GemDepth框架通过几何嵌入增强3D视频深度估计

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-11 13:11

研究人员开发了GemDepth，一个旨在改进3D一致视频深度估计的新框架。与以往常常模糊精细细节或出现时间不一致的方法不同，GemDepth明确地整合了相机运动和全局3D结构。其几何嵌入模块预测帧间相机姿态，以创建隐式几何嵌入，增强模型对3D的感知和对齐能力。这种方法实现了更精确的空间细节和严格的时间一致性，在各种数据集上取得了最先进的成果。 AI

影响引入了一种新颖的3D一致视频深度估计方法，可能改进AR/VR和机器人领域的应用。

排序理由该集群包含一篇详细介绍视频深度估计新框架的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Xin Yang · 2026-05-11 13:11

GemDepth: Geometry-Embedded Features for 3D-Consistent Video Depth

Video depth estimation extends monocular prediction into the temporal domain to ensure coherence. However, existing methods often suffer from spatial blurring in fine-detail regions and temporal inconsistencies. We argue that current approaches, which primarily rely on temporal s…

报道来源 [1]

GemDepth: Geometry-Embedded Features for 3D-Consistent Video Depth

相关实体

相关话题