PulseAugur
实时 10:26:48
English(EN) OWMDrive: Causality-Aware End-to-End Autonomous Driving via 4D Occupancy World Model

新的自动驾驶模型使用世界建模实现更安全、更鲁棒的规划 · 跟踪2个来源

两篇新的研究论文介绍了用于端到端自动驾驶的先进世界建模技术。OWMDrive 专注于 4D 占用世界模型,用于多步 3D 占用预测,以指导基于扩散的规划,旨在实现更具前瞻性和鲁棒性的轨迹生成,尤其是在挑战性场景中。ExploreVLA 将世界建模与强化学习相结合,以实现超越专家演示的策略探索,使用未来图像生成作为密集世界建模目标和新颖性检测的内在奖励信号。 AI

影响 这些世界建模方法旨在提高自动驾驶系统在复杂和不可预测的交通场景中的安全性和适应性。

排序理由 arXiv 上发表了两篇研究论文,详细介绍了自动驾驶的新方法。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →

新的自动驾驶模型使用世界建模实现更安全、更鲁棒的规划 · 跟踪2个来源

报道来源 [4]

  1. arXiv cs.CV TIER_1 English(EN) · Daniele De Martini ·

    PriorEye: Geospatial Visual Priors for End-to-End Autonomous Driving

    Most end-to-end autonomous driving methods rely solely on instantaneous sensor observations, limiting them to reactive behavior without the anticipatory foresight human drivers employ through prior experience. We introduce geospatial visual priors, street-level visual context anc…

  2. arXiv cs.CV TIER_1 English(EN) · Junjie Cheng, Ruiqi Song, Ye Wu, Nanxing Zeng, Ximiao Li, Yunfeng Ai ·

    OWMDrive:通过4D占用世界模型实现因果感知端到端自动驾驶

    arXiv:2606.30421v1 Announce Type: new Abstract: Autonomous driving systems are steadily moving toward end-to-end paradigms to mitigate the limited adaptability of rule-based pipelines in complex traffic environments. However, most existing learning-based methods still make decisi…

  3. arXiv cs.CV TIER_1 English(EN) · Zihao Sheng, Xin Ye, Jingru Luo, Sikai Chen, Liu Ren ·

    ExploreVLA:端到端自动驾驶的密集世界建模与探索

    arXiv:2604.02714v2 Announce Type: replace Abstract: End-to-end autonomous driving models based on Vision-Language-Action (VLA) architectures have shown promising results by learning driving policies through behavior cloning on expert demonstrations. However, imitation learning in…

  4. arXiv cs.CV TIER_1 English(EN) · Yunfeng Ai ·

    OWMDrive:通过4D占用世界模型实现因果感知端到端自动驾驶

    Autonomous driving systems are steadily moving toward end-to-end paradigms to mitigate the limited adaptability of rule-based pipelines in complex traffic environments. However, most existing learning-based methods still make decisions from static representations of the current s…