OrbitForge 通过视频合成和高斯溅射从文本生成 3D 场景

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-23 16:50

研究人员开发了 OrbitForge，一种利用文本到视频模型和高斯溅射从文本提示生成 3D 场景的新方法。OrbitForge 通过使用 3D 重建作为锚点来提高时间和空间一致性，将文本生成的视频转换为一致的 3D 场景。这种方法避免了特定任务的微调和每个提示的优化，同时还解决了 3D 场景生成中覆盖感知评估的需求。 AI

影响这项研究可能会推动从文本创建 3D 资产，影响虚拟现实和游戏开发等领域。

排序理由该集群包含一篇详细介绍 3D 场景生成新方法的 ist 研究论文。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Chenrui Fan, Paolo Favaro · 2026-06-24 04:00

OrbitForge：通过重建锚定的视频合成实现文本到3D场景生成

arXiv:2606.24799v1 Announce Type: cross Abstract: Generic text-to-video models can be used as rich open-world scene priors. Despite the high quality of today's generated videos, they do not directly yield reliable 3D assets: camera motion is difficult to control, view coverage is…
arXiv cs.AI TIER_1 English(EN) · Paolo Favaro · 2026-06-23 16:50

OrbitForge：通过重建锚定视频合成实现文本到3D场景生成

Generic text-to-video models can be used as rich open-world scene priors. Despite the high quality of today's generated videos, they do not directly yield reliable 3D assets: camera motion is difficult to control, view coverage is partial, and frames often contain inconsistencies…

报道来源 [2]

OrbitForge：通过重建锚定的视频合成实现文本到3D场景生成

OrbitForge：通过重建锚定视频合成实现文本到3D场景生成

相关实体

相关话题