Researchers have developed PIIGuard, a novel webpage-level defense system designed to prevent large language models (LLMs) from harvesting personally identifiable information (PII). This system embeds hidden HTML fragments within webpages that subtly redirect LLMs away from disclosing sensitive data. PIIGuard demonstrated a defense success rate of at least 97.0% across several LLM models, including GPT-5.4-nano, Claude-haiku-4.5, and DeepSeek-chat, while maintaining the page's utility for standard question-answering tasks. AI
Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →
IMPACT Offers a new method for website owners to protect user data from LLM-based scraping.
RANK_REASON Academic paper detailing a new method for mitigating PII leakage from LLMs.