English(EN) AnyScene: Towards Highly Controllable Driving Scene Generation at Anywhere and Beyond

新AI框架增强自动驾驶场景生成

作者 PulseAugur 编辑部 · [6 个来源] · 2026-05-25 17:59

研究人员推出多个用于生成逼真且可控驾驶场景的新框架，这对于训练自动驾驶汽车至关重要。DriveWAM将视频扩散Transformer适配到自回归动作策略的创建中，整合了场景理解和记忆以实现长时规划。AnyScene提供了一个统一的以占用为中心的模型，能够从任意BEV布局进行精确控制，并生成时间上一致的多视图视频。DriveGen3D结合了高效视频扩散与3D场景重建，用于高质量、可控的动态场景，支持长驾驶视频和3D表示。此外，还策划了一个名为Nuplan-Occ的新数据集，以促进自动驾驶领域的大规模生成建模和下游应用。 AI

影响合成数据生成方面的这些进步，通过提供更逼真和可控的训练环境，有望加速自动驾驶系统的开发和测试。

排序理由多篇研究论文介绍了用于驾驶场景生成的新方法和数据集。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 6 个来源。我们如何撰写摘要 →

报道来源 [6]

arXiv cs.CV TIER_1 English(EN) · Chen Shi, Jinrui Xu, Shaoshuai Shi, Kehua Sheng, Bo Zhang, Li Jiang · 2026-05-28 04:00

DriveWAM: Video Generative Priors Enable Scalable World-Action Modeling for Autonomous Driving

arXiv:2605.28544v1 Announce Type: new Abstract: Pretrained foundation models have become an important basis for end-to-end autonomous driving. In contrast to vision-language models pretrained primarily on static image-text pairs, video generative models capture temporal dynamics …
arXiv cs.CV TIER_1 English(EN) · Li Jiang · 2026-05-27 14:36

DriveWAM: Video Generative Priors Enable Scalable World-Action Modeling for Autonomous Driving

Pretrained foundation models have become an important basis for end-to-end autonomous driving. In contrast to vision-language models pretrained primarily on static image-text pairs, video generative models capture temporal dynamics and motion priors that are naturally suited for …
arXiv cs.CV TIER_1 English(EN) · Haiming Zhang, Junfei Zhou, Feng Jiang, Jingzhong Li, Zhenglong Guo, Penglin Dai, Jifeng Dai, Yan Xie, Benjin Zhu · 2026-05-26 04:00

AnyScene: Towards Highly Controllable Driving Scene Generation at Anywhere and Beyond

arXiv:2605.26113v1 Announce Type: cross Abstract: Generating high-fidelity and controllable synthetic data is critical for advancing end-to-end autonomous driving, particularly for addressing the long tail of rare safety-critical scenarios. Existing occupancy-guided methods typic…
arXiv cs.CV TIER_1 English(EN) · Weijie Wang, Jiagang Zhu, Zeyu Zhang, Xiaofeng Wang, Zheng Zhu, Guosheng Zhao, Chaojun Ni, Haoxiao Wang, Guan Huang, Xinze Chen, Yukun Zhou, Wenkang Qin, Duochao Shi, Haoyun Li, Yicheng Xiao, Donny Y. Chen, Jiwen Lu · 2026-05-26 04:00

DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion

arXiv:2510.15264v3 Announce Type: replace Abstract: We present DriveGen3D, a novel framework for generating high-quality and highly controllable dynamic 3D driving scenes that addresses critical limitations in existing methodologies. Current approaches to driving scene synthesis …
arXiv cs.CV TIER_1 English(EN) · Bohan Li, Xin Jin, Hu Zhu, Hongsi Liu, Ruikai Li, Jiazhe Guo, Kaiwen Cai, Chao Ma, Yueming Jin, Hao Zhao, Xiaokang Yang, Wenjun Zeng · 2026-05-26 04:00

Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method

arXiv:2510.22973v2 Announce Type: replace Abstract: Driving scene generation is a critical domain for autonomous driving, enabling downstream applications, including perception and planning evaluation. Occupancy-centric methods have recently achieved state-of-the-art results by o…
arXiv cs.CV TIER_1 English(EN) · Benjin Zhu · 2026-05-25 17:59

AnyScene: Towards Highly Controllable Driving Scene Generation at Anywhere and Beyond

Generating high-fidelity and controllable synthetic data is critical for advancing end-to-end autonomous driving, particularly for addressing the long tail of rare safety-critical scenarios. Existing occupancy-guided methods typically rely on shallow conditioning mechanisms and r…

报道来源 [6]

DriveWAM: Video Generative Priors Enable Scalable World-Action Modeling for Autonomous Driving

DriveWAM: Video Generative Priors Enable Scalable World-Action Modeling for Autonomous Driving

AnyScene: Towards Highly Controllable Driving Scene Generation at Anywhere and Beyond

DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion

Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method

AnyScene: Towards Highly Controllable Driving Scene Generation at Anywhere and Beyond

相关实体

相关话题