English(EN) Research reveals that large language models can silently corrupt documents when users delegate editing tasks. A study testing 19 LLMs found that even top models

研究发现：LLM在编辑任务中会悄无声息地损坏文档

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-08 21:53

一项最新研究发现，大型语言模型在被赋予编辑任务时，可能会无意中损坏文档。研究人员测试了包括Gemini Pro和Claude Opus在内的19个LLM，发现在20次交互后，这些模型大约会修改25%的内容。研究表明，能力较弱的模型倾向于删除内容，而更复杂的模型则会引入看似合理但错误的信息，并且随着上下文窗口增大和文件类型复杂化，文档损坏会加剧。 AI

影响凸显了AI代理在执行文档编辑任务时存在的一个关键安全隐患，可能影响用户信任和数据完整性。

排序理由该集群报告了LLM行为研究的发现。[lever_c_demoted from research: ic=1 ai=1.0]

在 Mastodon — fosstodon.org 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-06-08 21:53

Research reveals that large language models can silently corrupt documents when users delegate editing tasks. A study testing 19 LLMs found that even top models

Research reveals that large language models can silently corrupt documents when users delegate editing tasks. A study testing 19 LLMs found that even top models like Gemini Pro and Claude Opus corrupted 25% of content after 20 interactions. Weaker models delete content while smar…

链接 kdnuggets.com/why-do-llms-corrupt-your-do…

报道来源 [1]

Research reveals that large language models can silently corrupt documents when users delegate editing tasks. A study testing 19 LLMs found that even top models

相关实体

相关话题