New benchmark reveals safety risks in LLM long-term memory

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-04 04:00

A new benchmark called PersistBench has been developed to evaluate the safety risks associated with long-term memory integration in large language models. The benchmark identifies two key risks: cross-domain leakage, where irrelevant stored information is injected into conversations, and memory-induced sycophancy, where biases are reinforced. Testing revealed significant failure rates across 18 frontier and open-source LLMs, highlighting the need for more robust memory management in conversational AI. AI

影响 Highlights critical safety vulnerabilities in LLM memory systems, potentially impacting the deployment of personalized AI assistants.

排序理由 The cluster contains an academic paper introducing a new benchmark for evaluating LLM safety. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Sidharth Pulipaka, Oliver Chen, Manas Sharma, Taaha S Bajwa, Vyas Raina, Ivaxi Sheth · 2026-06-04 04:00

PersistBench: When Should Long-Term Memories Be Forgotten by LLMs?

arXiv:2602.01146v2 Announce Type: replace Abstract: Conversational assistants are increasingly integrating long-term memory with large language models (LLMs). This persistence of memories, e.g., the user is vegetarian, can enhance personalization in future conversations. However,…

报道来源 [1]

PersistBench: When Should Long-Term Memories Be Forgotten by LLMs?

相关实体

相关话题