PulseAugur
实时 09:36:01
English(EN) AgentWard: A Lifecycle Security Architecture for Autonomous AI Agents

研究人员提出用于保护人工智能代理和多代理系统的新框架

2026年4月发布的多篇研究论文解决了自主AI代理系统中日益增长的安全挑战。这些论文提出了增强交互式AI代理的安全、可信赖和治理的框架和方法论,特别是在网络安全和企业系统等高风险领域。关键主题包括去中心化架构、形式化验证方法、运行时安全执行以及强大的审计机制,以减轻对抗性攻击、数据投毒和未经授权的操作等风险。 AI

影响 这些框架旨在提高AI代理的安全性与可信赖性,可能加速其在关键应用中的采用。

排序理由 多篇在arXiv上发表的研究论文提出了新的人工智能代理安全框架和方法论。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 30 个来源。 我们如何撰写摘要 →

研究人员提出用于保护人工智能代理和多代理系统的新框架

报道来源 [30]

  1. arXiv cs.AI TIER_1 English(EN) · Majed El Helou, Benjamin Ryder, Chiara Troiani, Jean Diaconu, Herv\'e Muyal, Marcelo Yannuzzi ·

    零信任代理AI中的混合检查和基于任务的访问控制

    arXiv:2605.02682v1 Announce Type: new Abstract: Authorizing Large Language Model (LLM)-driven agents to dynamically invoke tools and access protected resources introduces significant security risks, and the risks grow dramatically as agents engage in multi-turn conversations and …

  2. arXiv cs.AI TIER_1 English(EN) · Marcelo Yannuzzi ·

    零信任代理AI中的混合检查和基于任务的访问控制

    Authorizing Large Language Model (LLM)-driven agents to dynamically invoke tools and access protected resources introduces significant security risks, and the risks grow dramatically as agents engage in multi-turn conversations and scale toward distributed collaboration. A compro…

  3. arXiv cs.AI TIER_1 English(EN) · Luyao Xu, Xiang Chen ·

    面向自主代理框架的安全攻击与防御策略:以OpenClaw为例的分层综述

    arXiv:2604.27464v1 Announce Type: cross Abstract: Autonomous agent frameworks built upon large language models (LLMs) are evolving into complex, tool-integrated, and continuously operating systems, introducing security risks beyond traditional prompt-level vulnerabilities. As thi…

  4. arXiv cs.AI TIER_1 English(EN) · Akshay Mittal, Elyson De La Cruz ·

    Agent Name Service (ANS):Kubernetes 中安全 AI Agent 发现、身份和治理的 POC 信任层

    arXiv:2604.26997v1 Announce Type: cross Abstract: Autonomous AI agent ecosystems require stronger mechanisms for secure discovery, identity verification, capability attestation, and policy governance. Current deployments frequently lack (1) uniform agent discovery, (2) cryptograp…

  5. arXiv cs.AI TIER_1 English(EN) · Yuan Sun ·

    安全双层委托(SBD):多智能体系统中运行时委托安全的形式化框架

    arXiv:2604.27358v1 Announce Type: new Abstract: As large language model (LLM) agents are deployed in high-stakes environments, the question of how safely to delegate subtasks to specialized sub-agents becomes critical. Existing work addresses multi-agent architecture selection at…

  6. arXiv cs.AI TIER_1 English(EN) · Andrei Kojukhov, Arkady Bovshover ·

    Agentic AI for Cybersecurity: A Meta-Cognitive Architecture for Governable Autonomy

    arXiv:2602.11897v3 Announce Type: replace-cross Abstract: Cybersecurity decision-making increasingly occurs in environments characterized by uncertainty, partial observability, and adversarial manipulation, where heterogeneous signals from multiple sources are often incomplete, a…

  7. arXiv cs.AI TIER_1 English(EN) · Yu-Chao Huang, Zhen Tan, Mohan Zhang, Pingzhi Li, Zhuo Zhang, Tianlong Chen ·

    TRUST:去中心化AI服务的框架 v.0.1

    arXiv:2604.27132v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) and Multi-Agent Systems (MAS) in high-stakes domains demand reliable verification, yet centralized approaches suffer four limitations: (1) Robustness, with single points of failure vulnerable to attacks…

  8. arXiv cs.AI TIER_1 English(EN) · Christian Schroeder de Witt, Klaudia Krawiecka, Igor Krawczuk, Ben Hagag, William L. Anderson, Peter Belcak, Ben Bucknall, Xiaohong Cai, Ayush Chopra, Doron Cohen, Ron F. Del Rosario, Andis Draguns, Annie Gray, Keren Katz, Vasilios Mavroudis, Jaron Mink, ·

    多智能体安全中的开放性挑战:迈向交互式人工智能智能体安全系统

    arXiv:2505.02077v2 Announce Type: replace-cross Abstract: AI agents are beginning to interact with each other directly and across internet platforms and physical environments, creating security challenges beyond traditional cybersecurity and AI safety frameworks. Free-form protoc…

  9. arXiv cs.AI TIER_1 English(EN) · Hung Dang ·

    强制执行良性轨迹:结构化工作流AI代理的行为防火墙

    arXiv:2604.26274v1 Announce Type: cross Abstract: Structured-workflow agents driven by large language models execute tool calls against sensitive external environments. We propose \codename, a telemetry-driven behavioral anomaly detection firewall. Drawing on sequence-based intru…

  10. arXiv cs.AI TIER_1 English(EN) · Hung Dang ·

    强制执行良性轨迹:结构化工作流AI代理的行为防火墙

    Structured-workflow agents driven by large language models execute tool calls against sensitive external environments. We propose \codename, a telemetry-driven behavioral anomaly detection firewall. Drawing on sequence-based intrusion detection, \codename\ compiles verified benig…

  11. arXiv cs.AI TIER_1 English(EN) · Ignacio Peyrano ·

    从CRUD到自主代理:AI原生企业系统中语义网关的形式化验证与零信任安全

    Enterprise software engineering is shifting away from deterministic CRUD/REST architectures toward AI-native systems where large language models act as cognitive orchestrators. This transition introduces a critical security tension: probabilistic LLMs weaken classical mechanisms …

  12. arXiv cs.AI TIER_1 English(EN) · Yixiang Zhang, Xinhao Deng, Jiaqing Wu, Yue Xiao, Ke Xu, Qi Li ·

    AgentWard:自主人工智能代理的生命周期安全架构

    arXiv:2604.24657v1 Announce Type: cross Abstract: Autonomous AI agents extend large language models into full runtime systems that load skills, ingest external content, maintain memory, plan multi-step actions, and invoke privileged tools. In such systems, security failures rarel…

  13. arXiv cs.CL TIER_1 English(EN) · Mikko Lempinen, Joni Kemppainen, Niklas Raesalmi ·

    AVISE:评估人工智能系统安全性的框架

    arXiv:2604.20833v2 Announce Type: replace-cross Abstract: As artificial intelligence (AI) systems are increasingly deployed across critical domains, their security vulnerabilities pose growing risks of high-profile exploits and consequential system failures. Yet systematic approa…

  14. arXiv cs.AI TIER_1 English(EN) · Tam Nguyen, Moses Ndebugre, Dheeraj Arremsetty ·

    多智能体系统的安全考量

    arXiv:2603.09002v2 Announce Type: replace-cross Abstract: Multi-agent artificial intelligence systems or MAS are systems of autonomous agents that exercise delegated tool authority, share persistent memory, and coordinate via inter-agent communication. MAS introduces qualitativel…

  15. arXiv cs.AI TIER_1 English(EN) · Jiaqi Li, Yang Zhao, Bin Sun, Yang Yu, Jian Chang, Lidong Zhai ·

    海报:ClawdGo:自主人工智能代理的内生安全意识培训

    arXiv:2604.24020v1 Announce Type: cross Abstract: Autonomous AI agents deployed on platforms such as OpenClaw face prompt injection, memory poisoning, supply-chain attacks, and social engineering, yet existing defences address only the platform perimeter, leaving the agent's own …

  16. arXiv cs.LG TIER_1 English(EN) · Ben Hagag, William L. Anderson, Christian Schroeder de Witt, Sarah Scheffler ·

    架构对多智能体安全至关重要

    arXiv:2604.23459v1 Announce Type: cross Abstract: Multi-agent systems (MAS), composed of networks of two or more autonomous AI agents, have become increasingly popular in production deployments, yet introduce security risks that do not arise in single-agent settings. Even if indi…

  17. arXiv cs.LG TIER_1 English(EN) · Kexin Chu ·

    从无状态查询到自主行动:Agentic AI系统的分层安全框架

    arXiv:2604.23338v1 Announce Type: cross Abstract: Agentic AI systems face security challenges that stateless large language models do not. They plan across extended horizons, maintain persistent memory, invoke external tools, and coordinate with peer agents. Existing security ana…

  18. Hugging Face Daily Papers TIER_1 English(EN) ·

    面向不可信代理技能的结构化安全审计与鲁棒性增强

    Agent Skills package SKILL.md files, scripts, reference documents, and repository context into reusable capability units, turning pre-load auditing from single-prompt filtering into cross-file security review. Existing guardrails often flag risk but recover malicious intent incon…

  19. arXiv cs.AI TIER_1 English(EN) · Qi Li ·

    AgentWard:自主人工智能代理的生命周期安全架构

    Autonomous AI agents extend large language models into full runtime systems that load skills, ingest external content, maintain memory, plan multi-step actions, and invoke privileged tools. In such systems, security failures rarely remain confined to a single interface; instead, …

  20. Hugging Face Daily Papers TIER_1 English(EN) ·

    AI Agent 安全护栏的比较评估

    This report presents a comparative evaluation of DKnownAI Guard in AI agent security scenarios, benchmarked against three competing products: AWS Bedrock Guardrails, Azure Content Safety, and Lakera Guard. Using human annotation as the ground truth, we assess each guardrail's abi…

  21. Hugging Face Daily Papers TIER_1 English(EN) ·

    海报:ClawdGo:自主人工智能代理的内生安全意识培训

    Autonomous AI agents deployed on platforms such as OpenClaw face prompt injection, memory poisoning, supply-chain attacks, and social engineering, yet existing defences address only the platform perimeter, leaving the agent's own threat judgement entirely untrained. We present Cl…

  22. Hugging Face Daily Papers TIER_1 English(EN) ·

    通过分权架构实现AI代理目标完整性的结构化执行

    Recent evidence suggests that frontier AI systems can exhibit agentic misalignment, generating and executing harmful actions derived from internally constructed goals, even without explicit user requests. Existing mitigation methods, such as Reinforcement Learning from Human Feed…

  23. Mastodon — sigmoid.social TIER_1 English(EN) · [email protected] ·

    西方主要安全机构发布关于安全采用代理式AI服务的联合指南。关键考虑因素、新兴风险和最佳实践

    Joint guidance just released from leading Western security agencies on safely adopting agentic AI services. Key considerations, emerging risks, and best practices for implementation. Worth a read. #AI #AgenticAI #Cybersecurity #ASD #ACSC #CISA #NSA #CCCS #NCSCNZ #NCSCUK Careful A…

  24. Mastodon — sigmoid.social TIER_1 Italiano(IT) · [email protected] ·

    面向代理式企业的AI运行时安全 #AgenticAI #AgenticArtificialIntelligence #AI #ArtificialIntelligence #

    https://www. europesays.com/2954859/ AI runtime security for the agentic enterprise # AgenticAI # AgenticArtificialIntelligence # AI # ArtificialIntelligence # EnterprisesTurnToRuntimeSecurityToCloseTheAgenticAITrustGap # KellyKnight # SiliconANGLE

  25. Mastodon — sigmoid.social TIER_1 English(EN) · [email protected] ·

    SAS 为 Agentic AI 就绪的数据管理奠定基础 #AgenticAI #AgenticArtificialIntelligence #AI #ArtificialIntell

    https://www. europesays.com/2954857/ SAS cements agentic AI-ready data management foundation # AgenticAI # AgenticArtificialIntelligence # AI # ArtificialIntelligence

  26. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    CISA、NSA及合作伙伴机构发布了关于安全采用AI代理的最新指南。我们分析了这对身份控制、权限边界意味着什么,以及

    CISA, NSA, and partner agencies published fresh guidance on secure AI agent adoption. We analyzed what it means for identity controls, privilege boundaries, and deployment governance in real systems. https:// go.aintelligencehub.com/ma-usc yberagenciesaiagen # AI # Cybersecurity …

  27. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    西方主要安全机构发布关于安全采用代理式AI服务的联合指南。关键考虑因素、新兴风险和最佳实践

    Joint guidance just released from leading Western security agencies on safely adopting agentic AI services. Key considerations, emerging risks, and best practices for implementation. Worth a read. #AI #AgenticAI #Cybersecurity #ASD #ACSC #CISA #NSA #CCCS #NCSCNZ #NCSCUK Careful A…

  28. Mastodon — mastodon.social TIER_1 English(EN) · [email protected] ·

    Agentic AI 安全警报 ⚠️ CISA 指南:自主 AI 系统将攻击面扩展到传统控制之外。风险:• 未经检查的操作 • API 暴露 •

    Agentic AI security alert ⚠️ CISA guidance: Autonomous AI systems expand attack surface beyond traditional controls. Risks: • Unchecked actions • API exposure • Data leakage AI = operational entity now. Source: https://www. cisa.gov/resources-tools/resou rces/careful-adoption-age…

  29. Mastodon — mastodon.social TIER_1 Français(FR) · [email protected] ·

    自主AI代理编写代码、调用API、编排系统……通常无需人工监督。信任问题在于此合作

    Les agents IA autonomes écrivent du code, appellent des APIs, orchestrent des systèmes… souvent sans supervision humaine. La question de confiance dans cette couche "agentique" devient centrale en sécurité. Qui audite ce que l'agent a décidé tout seul à 3h du matin ? 🤖 La surface…

  30. Mastodon — mastodon.social TIER_1 English(EN) · [email protected] ·

    Megent Megent:AI代理的开源运行时策略。Megent通过强制执行允许/拒绝工具策略、屏蔽敏感数据和日志记录来帮助保护AI代理的安全

    Megent Megent: Open-Source Runtime Policy for AI Agents. Megent helps secure AI agents by enforcing allow/deny tool policies, masking sensitive data, and logging every tool decision. A zero-trust safety layer for AI agents. Wrap tools fast. Ship with confidence. Control every age…