PulseAugur
LIVE 07:39:08
research · [2 sources] ·
0
research

PIIGuard shields webpages from LLM PII harvesting via adversarial fragments

Researchers have developed PIIGuard, a novel webpage-level defense system designed to prevent large language models (LLMs) from harvesting personally identifiable information (PII). This system embeds hidden HTML fragments within webpages that subtly redirect LLMs away from disclosing sensitive data. PIIGuard demonstrated a defense success rate of at least 97.0% across several LLM models, including GPT-5.4-nano, Claude-haiku-4.5, and DeepSeek-chat, while maintaining the page's utility for standard question-answering tasks. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Offers a new method for website owners to protect user data from LLM-based scraping.

RANK_REASON Academic paper detailing a new method for mitigating PII leakage from LLMs.

Read on arXiv cs.CL →

COVERAGE [2]

  1. arXiv cs.CL TIER_1 · Mingshuo Liu, Yiwei Zha, Min Chen ·

    PIIGuard: Mitigating PII Harvesting under Adversarial Sanitization

    arXiv:2605.03129v1 Announce Type: cross Abstract: Browsing-enabled LLM assistants can fetch webpages and answer contact-seeking queries, creating a practical channel for scraping contact-style personally identifiable information (PII) from public pages. Many prior defenses are de…

  2. arXiv cs.CL TIER_1 · Min Chen ·

    PIIGuard: Mitigating PII Harvesting under Adversarial Sanitization

    Browsing-enabled LLM assistants can fetch webpages and answer contact-seeking queries, creating a practical channel for scraping contact-style personally identifiable information (PII) from public pages. Many prior defenses are deployed at the model, service, or agent layer rathe…