English(EN) EgoEMG: A Multimodal Egocentric Dataset with Bilateral EMG and Vision for Hand Pose Estimation

新模型和数据集推动自我中心手部姿态预测

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-08 04:00

研究人员推出 EggHand，一个用于从视频进行自我中心手部姿态预测的新型多模态基础模型。该模型整合了语义推理和动态运动建模，利用视觉-语言-动作解码器和自我中心视频-文本编码器，在无需外部跟踪的情况下理解意图和上下文。同时，EgoEMG 数据集和基准测试已发布，通过结合肌电图 (EMG) 和自我中心视觉数据，推动多模态手部姿态估计。EgoEMG 包含同步的双侧 EMG、IMU 和各种视频流，为开发和评估融合模型提供了全面的资源。 AI

影响自我中心手部姿态预测和多模态融合的这些进展可能在增强现实/虚拟现实和机器人领域实现更直观的人机交互。

排序理由该集群包含两篇研究论文，介绍了用于手部姿态估计的新模型和数据集。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CV TIER_1 English(EN) · Daehee Park · 2026-05-08 12:09

EggHand: A Multimodal Foundation Model for Egocentric Hand Pose Forecasting

Forecasting future 3D hand pose sequences from egocentric video is essential for understanding human intention and enabling embodied applications such as AR/VR assistance and human-robot interaction. However, this task remains a highly challenging problem because egocentric hand …
arXiv cs.CV TIER_1 English(EN) · Ziheng Xi, Jiayi Yu, Yitao Wang, Yanbo Duan, Jianjiang Feng, Jie Zhou · 2026-05-08 04:00

EgoEMG: A Multimodal Egocentric Dataset with Bilateral EMG and Vision for Hand Pose Estimation

arXiv:2605.05712v1 Announce Type: new Abstract: Surface electromyography (sEMG) records muscle activity during hand movement and can be decoded to recover detailed hand articulation. EMG and egocentric vision are complementary for hand sensing: EMG captures fine-grained finger ar…

报道来源 [2]

EggHand: A Multimodal Foundation Model for Egocentric Hand Pose Forecasting

EgoEMG: A Multimodal Egocentric Dataset with Bilateral EMG and Vision for Hand Pose Estimation

相关实体

相关话题