PulseAugur
实时 07:13:02

PIIGuard shields webpages from LLM PII harvesting via adversarial fragments

Researchers have developed PIIGuard, a novel webpage-level defense system designed to prevent large language models (LLMs) from harvesting personally identifiable information (PII). This system embeds hidden HTML fragments within webpages that subtly redirect LLMs away from disclosing sensitive data. PIIGuard demonstrated a defense success rate of at least 97.0% across several LLM models, including GPT-5.4-nano, Claude-haiku-4.5, and DeepSeek-chat, while maintaining the page's utility for standard question-answering tasks. AI

影响 Offers a new method for website owners to protect user data from LLM-based scraping.

排序理由 Academic paper detailing a new method for mitigating PII leakage from LLMs.

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

PIIGuard shields webpages from LLM PII harvesting via adversarial fragments

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Mingshuo Liu, Yiwei Zha, Min Chen ·

    PIIGuard: Mitigating PII Harvesting under Adversarial Sanitization

    arXiv:2605.03129v1 Announce Type: cross Abstract: Browsing-enabled LLM assistants can fetch webpages and answer contact-seeking queries, creating a practical channel for scraping contact-style personally identifiable information (PII) from public pages. Many prior defenses are de…

  2. arXiv cs.CL TIER_1 English(EN) · Min Chen ·

    PIIGuard: Mitigating PII Harvesting under Adversarial Sanitization

    Browsing-enabled LLM assistants can fetch webpages and answer contact-seeking queries, creating a practical channel for scraping contact-style personally identifiable information (PII) from public pages. Many prior defenses are deployed at the model, service, or agent layer rathe…