Using large language models for embodied planning introduces systematic safety risks

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-05 04:00

A new benchmark called DESPITE has been developed to systematically evaluate the safety risks associated with using large language models for embodied planning in robotics. Research indicates that even models with high planning accuracy can exhibit significant safety failures, with safety awareness not scaling proportionally with model size. The findings highlight that improving safety awareness is a critical challenge for deploying LLM-based planners in real-world robotic systems. AI

影响 Highlights critical safety challenges for LLM-based robotic planners, emphasizing the need for improved danger avoidance over mere planning ability.

排序理由 The cluster contains two arXiv papers discussing safety risks in AI, specifically concerning LLMs in embodied planning and a broader survey of safety in embodied AI.

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.LG TIER_1 English(EN) · Tao Zhang, Kaixian Qu, Zhibin Li, Jiajun Wu, Marco Hutter, Manling Li, Fan Shi · 2026-05-05 04:00

Using large language models for embodied planning introduces systematic safety risks

arXiv:2604.18463v2 Announce Type: replace-cross Abstract: Large language models are increasingly used as planners for robotic systems, yet how safely they plan remains an open question. To evaluate safe planning systematically, we introduce DESPITE, a benchmark of 12,279 tasks sp…
arXiv cs.CV TIER_1 English(EN) · Xiao Li, Xiang Zheng, Yifeng Gao, Xinyu Xia, Yixu Wang, Xin Wang, Ye Sun, Yunhan Zhao, Ming Wen, Jiayu Li, Xun Gong, Yi Liu, Yige Li, Yutao Wu, Cong Wang, Jun Sun, Yixin Cao, Zhineng Chen, Jingjing Chen, Tao Gui, Qi Zhang, Zuxuan Wu, Xipeng Qiu, Xuanjing · 2026-05-06 04:00

Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses

arXiv:2605.02900v1 Announce Type: cross Abstract: Embodied Artificial Intelligence (Embodied AI) integrates perception, cognition, planning, and interaction into agents that operate in open-world, safety-critical environments. As these systems gain autonomy and enter domains such…

报道来源 [2]

Using large language models for embodied planning introduces systematic safety risks

Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses

相关实体

相关话题