New benchmark reveals AI agents leak private data via query aggregation

By PulseAugur Editorial · [1 sources] · 2026-06-01 04:00

Researchers have identified a significant privacy risk in AI agents that combine private documents with external tools, such as web searches. This risk, termed the "mosaic effect," occurs when individual queries seem innocuous but reveal sensitive information when aggregated. A new benchmark, MosaicLeaks, was developed to test this vulnerability across 1,001 tasks. Experiments showed that current AI models frequently leak private information, with standard privacy prompts offering only partial mitigation and performance-focused reinforcement learning exacerbating the issue. A novel RL framework, PA-DR, was introduced to balance task success with privacy, successfully improving accuracy and reducing leakage in tests. AI

IMPACT Highlights critical privacy vulnerabilities in AI agents, necessitating new training methods like PA-DR to secure sensitive enterprise data.

RANK_REASON The cluster contains a research paper detailing a new benchmark and methodology for evaluating privacy risks in AI agents. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New benchmark reveals AI agents leak private data via query aggregation

COVERAGE [1]

arXiv cs.CL TIER_1 English(EN) · Alexander Gurung, Spandana Gella, Alexandre Drouin, Issam H. Laradji, Perouz Taslakian, Rafael Pardinas · 2026-06-01 04:00

MosaicLeaks:Privacy Risks in Querying-in-the-Open for Deep Research Agents

arXiv:2605.30727v1 Announce Type: new Abstract: Deep research agents increasingly combine private local documents with external tools like web retrieval, creating a privacy risk: an agent's external queries may leak sensitive information from its local context. This risk is ampli…

COVERAGE [1]

MosaicLeaks:Privacy Risks in Querying-in-the-Open for Deep Research Agents

RELATED TOPICS