English(EN) Don't use an LLM to decide what your AI agent is allowed to do

AI 安全专家警告不要使用基于 LLM 的代理操作控制

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-21 16:15

一位 AI 安全专家反对使用大型语言模型 (LLM) 作为 AI 代理操作的最终决策者，并提出了两个主要担忧。首先，使用一个 LLM 来判断另一个 LLM 的操作会引入相同的提示注入和操纵漏洞，本质上是创建了一个“模型监视模型”，具有相同的固有弱点。其次，LLM 内在的非确定性意味着关键操作可能一天被允许，第二天被拒绝，这使得它在需要一致、可审计结果的安全敏感决策方面不可靠。 AI

影响由于安全和可靠性问题，建议不应让 LLM 单独裁定 AI 代理的操作。

排序理由来自 AI 领域个人关于技术方法的观点文章。

在 dev.to — LLM tag 阅读 →

Aarm
LLM

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

dev.to — LLM tag TIER_1 English(EN) · Brian Hall · 2026-06-21 16:15

Don't use an LLM to decide what your AI agent is allowed to do

<p>I'm in a group called AARM. It's a bunch of people trying to work out how you actually secure what an AI agent can do once it's running, and the basic idea is that the control has to sit right at the action. You check a tool call before it runs, and the agent can't wriggle aro…

报道来源 [1]

Don't use an LLM to decide what your AI agent is allowed to do

相关实体

相关话题