English(EN) RayDer: Scalable Self-Supervised Novel View Synthesis from Real-World Video

RayDer transformer 扩展了真实世界视频的新视角合成能力

作者 PulseAugur 编辑部 · [3 个来源] · 2026-05-29 00:00

研究人员开发了 RayDer，这是一种新颖的 transformer 模型，旨在改进来自真实世界视频的自监督新视角合成。该统一模型将相机估计、场景重建和渲染整合到一个单一骨干网络中，从而能够对动态视频内容进行稳定训练。RayDer 在数据和计算方面表现出可预测的幂律缩放，在各种基准测试中取得了具有竞争力的零样本性能。 AI

影响通过利用通用视频数据，实现了更具可扩展性和鲁棒性的新视角合成，可能对 3D 重建和内容创作产生影响。

排序理由该集群包含一篇详细介绍新模型及其性能的学术论文。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

arXiv cs.AI TIER_1 English(EN) · Ulrich Prestel, Stefan Andreas Baumann, Nick Stracke, Bj\"orn Ommer · 2026-06-01 04:00

RayDer：来自真实世界视频的可扩展自监督新视角合成

arXiv:2605.31535v1 Announce Type: cross Abstract: Self-supervised novel view synthesis (NVS) remains challenging to scale, despite the abundance of video data, largely due to the brittleness of training on realistic videos and the hard-to-predict scaling behavior of multi-network…
arXiv cs.AI TIER_1 English(EN) · Björn Ommer · 2026-05-29 16:50

RayDer：来自真实世界视频的可扩展自监督新视角合成

Self-supervised novel view synthesis (NVS) remains challenging to scale, despite the abundance of video data, largely due to the brittleness of training on realistic videos and the hard-to-predict scaling behavior of multi-network system designs. We introduce RayDer, a unified, f…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-29 00:00

RayDer：来自真实世界视频的可扩展自监督新视角合成

RayDer is a unified feed-forward transformer that consolidates camera estimation, scene reconstruction, and rendering for self-supervised novel view synthesis, enabling stable training on real-world video through dynamic state absorption and demonstrating clean scaling behavior.

报道来源 [3]

RayDer：来自真实世界视频的可扩展自监督新视角合成

RayDer：来自真实世界视频的可扩展自监督新视角合成

RayDer：来自真实世界视频的可扩展自监督新视角合成

相关话题