新框架通过手部交互视频重建三维物体

作者 PulseAugur 编辑部 · [3 个来源] · 2026-06-02 04:00

两篇新的研究论文介绍了一种从以自我为中心的视频中重建三维物体的新颖框架，重点关注手部交互。第一个框架ROHIT使用约束优化和传播（COP）框架来模拟稳定抓握期间的物体姿态。第二个框架AGILE采用由视觉语言模型指导的代理生成方法来创建水密网格，绕过了传统的运动恢复结构方法。 AI

影响这些方法可以通过实现对真实世界交互更准确的三维物体重建，从而改进机器人和VR的数字孪生。

排序理由 arXiv上发表了两篇学术论文，提出了新的三维物体重建方法。

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

arXiv cs.CV TIER_1 English(EN) · Dingbang Huang, Etienne Vouga, Qixing Huang, Georgios Pavlakos · 2026-06-05 04:00

从单目视频中恢复物理上可行的人与物体交互

arXiv:2606.05359v1 Announce Type: new Abstract: In this paper, we propose RePHO, a method to reconstruct physically plausible human-object interactions (HOI) from monocular videos. While existing kinematic-based approaches produce visually plausible motion, they often result in p…
arXiv cs.CV TIER_1 English(EN) · Zhifan Zhu, Siddhant Bansal, Shashank Tripathi, Dima Damen · 2026-06-03 04:00

从单目视频中重建手部交互时间线上的物体

arXiv:2512.07394v2 Announce Type: replace Abstract: We introduce the task of Reconstructing Objects along Hand Interaction Timelines (ROHIT). We first define the Hand Interaction Timeline (HIT) from a rigid object's perspective. In a HIT, an object is first static relative to the…
arXiv cs.CV TIER_1 English(EN) · Jin-Chuan Shi, Binhong Ye, Tao Liu, Junzhe He, Yangjinhui Xu, Xiaoyang Liu, Zeju Li, Hao Chen, Chunhua Shen · 2026-06-02 04:00

AGILE：通过代理生成从视频进行手部-物体交互重建

arXiv:2602.04672v4 Announce Type: replace Abstract: Reconstructing dynamic hand-object interactions from monocular videos is critical for dexterous manipulation data collection and creating realistic digital twins for robotics and VR. However, current methods face two prohibitive…