Researchers have developed PIIGuard, a novel webpage-level defense system designed to prevent large language models (LLMs) from harvesting personally identifiable information (PII). This system embeds hidden HTML fragments within webpages that subtly redirect LLMs away from disclosing sensitive data. PIIGuard demonstrated a defense success rate of at least 97.0% across several LLM models, including GPT-5.4-nano, Claude-haiku-4.5, and DeepSeek-chat, while maintaining the page's utility for standard question-answering tasks. AI
IMPACT Offers a new method for website owners to protect user data from LLM-based scraping.
RANK_REASON Academic paper detailing a new method for mitigating PII leakage from LLMs.
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →