English(EN) Content-Aware Attack Detection in LLM Agent Tool-Call Traffic: An Empirical Study of Features, Architectures, and Evaluation Protocols

LLM Agent 工具调用流量检测框架采用图神经网络

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-25 04:00

研究人员开发了一个新颖的框架，用于检测大型语言模型 (LLM) Agent 的工具调用流量中的攻击。该系统将 Agent 会话表示为图，并结合工具参数和响应的句子嵌入特征来将流量分类为良性或恶意。研究发现，内容级特征对于有效检测至关重要，其性能显著优于仅使用元数据的方法，并指出了一个可能夸大性能指标的常见评估陷阱。 AI

影响这项研究通过检测恶意的工具使用，引入了一种更强大的保护 LLM Agent 的方法，这可能会提高与外部服务交互的 AI 系统的安全性和可靠性。

排序理由学术论文，详细介绍了 LLM Agent 工具调用流量的新检测框架。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Sultan Zavrak · 2026-05-25 04:00

LLM 代理工具调用流量中的内容感知攻击检测：特征、架构和评估协议的实证研究

arXiv:2605.11053v3 Announce Type: replace-cross Abstract: The Model Context Protocol (MCP) has become a widely adopted interface for LLM agents to invoke external tools, yet learned monitoring of MCP tool-call traffic remains underexplored. In this article, the proposed detector …

报道来源 [1]

LLM 代理工具调用流量中的内容感知攻击检测：特征、架构和评估协议的实证研究

相关实体

相关话题