PulseAugur
实时 13:36:27
English(EN) GenRecon: Bridging Generative Priors for Multi-View 3D Scene Reconstruction

GenRecon 利用生成式先验推动三维场景重建

研究人员开发了 GenRecon,一种将生成式三维先验与多视图图像条件相结合的新型三维场景重建方法。该方法将场景重建视为在局部块上进行条件式三维生成,从而能够继承 Trellis.2 等最先进的生成式形状模型的保真度。该方法实现了高保真、多视图一致的几何和可编辑的 PBR 网格重建,性能比现有方法提高了 16%。此外,一个用于自动驾驶的新框架利用映射先验来改进三维物体检测,在 Waymo Open Dataset 上取得了最先进的成果。 AI

影响 三维场景重建和三维检测的进步为自动驾驶和虚拟环境创建等应用提供了更强的能力。

排序理由 该集群包含两篇研究论文,详细介绍了三维场景重建和三维检测方面的新方法。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 8 个来源。 我们如何撰写摘要 →

报道来源 [8]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction

    TriSplat is a feed-forward 3D reconstruction network that uses oriented triangle primitives to directly generate simulation-ready meshes from single images, bypassing expensive post-processing steps.

  2. Hugging Face Daily Papers TIER_1 English(EN) ·

    GenRecon:连接生成先验以实现多视图三维场景重建

    A novel method for 3D scene reconstruction that integrates generative 3D priors with multi-view image conditioning to produce high-fidelity, editable mesh reconstructions of indoor environments.

  3. arXiv cs.CV TIER_1 English(EN) · Wanhee Lee, Klemen Kotar, Rahul Mysore Venkatesh, Jared Watrous, Honglin Chen, Khai Loong Aw, Daniel L. K. Yamins ·

    Unified 3D Scene Understanding Through Physical World Modeling

    arXiv:2605.24321v1 Announce Type: new Abstract: Understanding 3D scenes requires flexible combinations of visual reasoning tasks, including depth estimation, novel view synthesis, and object manipulation, all of which are essential for perception and interaction. Existing approac…

  4. arXiv cs.CV TIER_1 English(EN) · Weijie Wang, Zimu Li, Jinchuan Shi, Zeyu Zhang, Botao Ye, Marc Pollefeys, Donny Y. Chen, Bohan Zhuang ·

    TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction

    arXiv:2605.26115v1 Announce Type: new Abstract: Sparse-view 3D reconstruction is increasingly addressed with feed-forward splatting networks that predict explicit primitives directly from images. Yet most existing methods remain centered on Gaussian primitives and expose surfaces…

  5. arXiv cs.CV TIER_1 English(EN) · Bohan Zhuang ·

    TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction

    Sparse-view 3D reconstruction is increasingly addressed with feed-forward splatting networks that predict explicit primitives directly from images. Yet most existing methods remain centered on Gaussian primitives and expose surfaces only indirectly: extracting a usable mesh for d…

  6. arXiv cs.CV TIER_1 English(EN) · Yang Fu, Yuliang Zou, Hao Xiang, Xin Huang, Yijing Bai, Chen Song, Weijing Shi, Govind Thattai, Dragomir Anguelov, Mingxing Tan, Yingwei Li ·

    场景重建作为3D检测的映射先验

    arXiv:2605.22997v1 Announce Type: new Abstract: In autonomous driving, mapping is critical for motion planning but remains an under-utilized resource for perception tasks such as 3D object detection. Maps can provide robust structural priors of the static environment, helping res…

  7. arXiv cs.CV TIER_1 English(EN) · Katharina Schmid, Nicolas von L\"utzow, Jozef Hladk\'y, Angela Dai, Matthias Nie{\ss}ner ·

    GenRecon:为多视图三维场景重建连接生成先验

    arXiv:2605.23888v1 Announce Type: new Abstract: We introduce a new approach to high-fidelity 3D scene reconstruction from multi-view RGB images that tightly couples reconstruction with a strong generative 3D prior. We cast scene reconstruction as conditional 3D generation over a …

  8. arXiv cs.CV TIER_1 English(EN) · Matthias Nießner ·

    GenRecon:连接生成先验以进行多视图三维场景重建

    We introduce a new approach to high-fidelity 3D scene reconstruction from multi-view RGB images that tightly couples reconstruction with a strong generative 3D prior. We cast scene reconstruction as conditional 3D generation over a set of spatially-localized, overlapping chunks t…