New benchmark reveals enterprise LLM agents leak sensitive data

作者 PulseAugur 编辑部 · [1 个来源] · 2026-04-23 06:00

A new benchmark called CI-Work has been developed to assess the contextual integrity of enterprise LLM agents, focusing on their ability to handle sensitive information. Evaluations of current leading models show significant privacy failures, with violation rates between 15.8% and 50.9%. The research highlights a trade-off where improved task utility often leads to increased privacy risks, suggesting that current scaling approaches are insufficient for secure enterprise deployment. AI

影响 Highlights critical privacy risks in enterprise LLM agents, necessitating new context-aware architectures for secure deployment.

排序理由 Academic paper introducing a new benchmark for LLM agents.

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Dongmei Zhang · 2026-04-23 06:00

CI-Work: Benchmarking Contextual Integrity in Enterprise LLM Agents

Enterprise LLM agents can dramatically improve workplace productivity, but their core capability, retrieving and using internal context to act on a user's behalf, also creates new risks for sensitive information leakage. We introduce CI-Work, a Contextual Integrity (CI)-grounded …

报道来源 [1]

CI-Work: Benchmarking Contextual Integrity in Enterprise LLM Agents

相关实体

相关话题