PulseAugur
实时 12:57:13

新框架验证了学习到的多智能体通信策略的安全性

研究人员开发了一个新颖的框架,用于形式化验证多智能体强化学习(MARL)系统中学习到的通信策略的安全性。该方法将复杂的神经网络策略提炼成可解释的决策树,然后使用PRISM等概率模型检查器对其进行严格验证。该框架已成功证明了多无人机协调的安全属性,并且验证的属性可以转移到原始神经网络上。 AI

影响 增强了多智能体系统的信任和安全性,这对于自动驾驶车队和无人机群等应用至关重要。

排序理由 该集群包含两篇arXiv论文,详细介绍了多智能体强化学习和形式化验证技术方面的新研究。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →

新框架验证了学习到的多智能体通信策略的安全性

报道来源 [4]

  1. arXiv cs.AI TIER_1 English(EN) · Ahmad Farooq, Kamran Iqbal ·

    Formal Verification of Learned Multi-Agent Communication Policies via Decision Tree Distillation

    arXiv:2606.19632v1 Announce Type: cross Abstract: Multi-agent reinforcement learning (MARL) enables agents to develop coordination strategies through emergent communication, but neural policies lack the formal safety guarantees required for safety-critical robotic deployment in d…

  2. arXiv cs.MA (Multiagent) TIER_1 English(EN) · Kamran Iqbal ·

    通过决策树蒸馏对学习到的多智能体通信策略进行形式化验证

    Multi-agent reinforcement learning (MARL) enables agents to develop coordination strategies through emergent communication, but neural policies lack the formal safety guarantees required for safety-critical robotic deployment in drone swarms and autonomous vehicle fleets. We pres…

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    Formal Verification of Learned Multi-Agent Communication Policies via Decision Tree Distillation

    Multi-agent reinforcement learning (MARL) enables agents to develop coordination strategies through emergent communication, but neural policies lack the formal safety guarantees required for safety-critical robotic deployment in drone swarms and autonomous vehicle fleets. We pres…

  4. arXiv cs.MA (Multiagent) TIER_1 English(EN) · Almond Kiruthu Murimi ·

    BARD-MARL:多智能体强化学习中学习通信的拜占庭智能体检测

    Learned communication improves coordination in cooperative multi-agent reinforcement learning, but it also creates a trust problem: a trained policy may route information through agents that have become faulty or adversarial. This paper studies Byzantine-agent detection for learn…