PulseAugur
实时 14:47:49
English(EN) DreamX-World 1.0: A General-Purpose Interactive World Model

DreamX-World 1.0 揭晓,用于交互式长时视频生成

研究人员推出了 DreamX-World 1.0,一个通用交互式世界模型,能够生成具有场景持久性和相机控制的长时视频内容。该模型利用了新颖的数据引擎,结合了虚幻引擎渲染、游戏录制和真实世界视频,以及一种名为 E-PRoPE 的新位置编码方法来实现相机感知。DreamX-World 1.0 在八个 RTX 5090 GPU 上可达到 16 FPS,并且在总分上优于现有的 HY-WorldPlay 1.5LingBot-World 等模型。 AI

影响 实现了更可控、更持久的长时视频生成,可能对创意产业和虚拟环境产生影响。

排序理由 该集群描述了一篇详细介绍用于视频生成的新型交互式世界模型的研究论文。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →

报道来源 [3]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    DreamX-World 1.0: A General-Purpose Interactive World Model

    DreamX-World 1.0 is a interactive text/image-to-video model that generates long-horizon content with camera control and scene persistence using specialized encoding, training techniques, and optimization methods.

  2. arXiv cs.CV TIER_1 English(EN) · DreamX Team, Yancheng Bai, Rui Chen, Xiangxiang Chu, Rujing Dang, Hao Dou, Bingjie Gao, Qiwen Gu, Siyu Hong, Jiachen Lei, Geng Li, Jifan Li, Ruimin Lin, Qingfeng Shi, Bingze Song, Lei Sun, Jing Tang, Ruitian Tian, Jun Wang, Jiahong Wu, Pengfei Zhang, She… ·

    DreamX-World 1.0: A General-Purpose Interactive World Model

    arXiv:2606.16993v1 Announce Type: new Abstract: DreamX-World 1.0 is a general-purpose interactive text/image-to-video world model for controllable long-horizon generation. It supports camera navigation, revisits to previously observed regions, and promptable events across photore…

  3. arXiv cs.CV TIER_1 English(EN) · Jiashu Zhu ·

    DreamX-World 1.0: A General-Purpose Interactive World Model

    DreamX-World 1.0 is a general-purpose interactive text/image-to-video world model for controllable long-horizon generation. It supports camera navigation, revisits to previously observed regions, and promptable events across photorealistic, game-style, and stylized domains. Our d…