PulseAugur
实时 20:19:57
English(EN) DEGround: An Effective Baseline for Ego-centric 3D Visual Grounding with a Homogeneous Framework

新框架MCM-VG和DEGround推动零样本3D视觉基础研究

研究人员开发了两个新框架DEGround和MCM-VG,以改进以自我为中心的3D视觉基础(ego-centric 3D visual grounding),这是具身智能的关键任务。DEGround利用一个同质化管道,在检测和基础之间共享对象表示,提高了效率和性能。MCM-VG通过建立多个一致的2D-3D映射来实现精确的定位并减少空间冗余,从而解决了零样本3D视觉基础的挑战。这两种方法在各种基准测试中都取得了最先进的结果,显著优于以前的方法。 AI

影响 3D视觉基础的进步可能会加速更强大的具身AI代理和机器人的开发。

排序理由 两篇新的学术论文介绍了用于3D视觉基础任务的新颖框架。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →

新框架MCM-VG和DEGround推动零样本3D视觉基础研究

报道来源 [3]

  1. arXiv cs.CV TIER_1 English(EN) · Yufei Yin, Jie Zheng, Qianke Meng, Zhou Yu, Minghao Chen, Jiajun Ding, Min Tan, Yuling Xi, Zhiwen Chen, Chengfei Lv ·

    多重一致性二维-三维映射实现鲁棒的零样本三维视觉定位

    arXiv:2604.26261v1 Announce Type: new Abstract: Zero-shot 3D Visual Grounding (3DVG) is a critical capability for open-world embodied AI. However, existing methods are fundamentally bottlenecked by the poor quality of open-vocabulary 3D proposals, suffering from inaccurate catego…

  2. arXiv cs.CV TIER_1 English(EN) · Yani Zhang, Dongming Wu, Hao Shi, Yingfei Liu, Tiancai Wang, Xingping Dong ·

    DEGround:一种使用同质化框架的有效视锥3D视觉基础基线

    arXiv:2506.05199v3 Announce Type: replace Abstract: A core task in embodied intelligence is ego-centric 3D visual grounding. Existing methods typically adopt two-stage, heterogeneous pipelines that pair a detector with a separate grounding model. Incompatible decoders and box hea…

  3. arXiv cs.CV TIER_1 English(EN) · Chengfei Lv ·

    多重一致性二维-三维映射实现鲁棒的零样本三维视觉定位

    Zero-shot 3D Visual Grounding (3DVG) is a critical capability for open-world embodied AI. However, existing methods are fundamentally bottlenecked by the poor quality of open-vocabulary 3D proposals, suffering from inaccurate categories and imprecise geometries, as well as the sp…