English(EN) CoWorld-VLA: Thinking in a Multi-Expert World Model for Autonomous Driving

新的VLA框架改进自动驾驶规划

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-11 12:01

研究人员推出CoWorld-VLA，一个旨在增强端到端自动驾驶系统的新型框架。这种多专家世界推理方法将互补的世界信息编码到Vision-Language-Action模型中的专家token。这些token显式地模拟了语义交互、几何结构、动态演化和自我轨迹，作为可访问的动作规划条件信号。在NAVSIM v1基准上的实验表明，CoWorld-VLA在场景生成和规划方面具有竞争力，特别是在避碰和轨迹精度方面。 AI

影响通过提供显式的、规划器可访问的动作生成条件信号来增强自动驾驶系统。

排序理由发布了一篇关于自动驾驶新框架的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Gong Che · 2026-05-11 12:01

CoWorld-VLA：面向自动驾驶的多专家世界模型思考

Vision-Language-Action (VLA) models have emerged as a promising paradigm for end-to-end autonomous driving. However, existing reasoning mechanisms still struggle to provide planning-oriented intermediate representations: textual Chain-of-Thought (CoT) fails to preserve continuous…

报道来源 [1]

CoWorld-VLA：面向自动驾驶的多专家世界模型思考

相关实体

相关话题