English(EN) Rethinking Graph Convolution for 2D-to-3D Hand Pose Lifting

自注意力机制在3D手部姿态提升方面优于图卷积

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-13 14:39

研究人员重新评估了图卷积网络（GCNs）在2D到3D手部姿态估计中的应用，发现标准的Multi-head自注意力模型表现更好。通过在FPHA基准上的对照实验，自注意力机制将平均每关节位置误差（MPJPE）从12.36毫米降低到10.09毫米，优于GCNs。研究表明，自适应空间注意力比固定的图卷积更适合这项任务，而手部拓扑结构在作为软结构先验时最有益。 AI

影响引入了一种更有效的三维手部姿态估计方法，有望改进机器人和增强现实领域的应用。

排序理由该集群包含一篇详细介绍计算机视觉新研究发现的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Youngjoong Kwon · 2026-05-13 14:39

Rethinking Graph Convolution for 2D-to-3D Hand Pose Lifting

Graph convolutional networks (GCNs) are widely used for 3D hand pose estimation, where the hand skeleton is encoded as a fixed adjacency graph. We revisit whether this is the most effective way to incorporate hand topology in 2D-to-3D lifting. In this paper, we perform controlled…

报道来源 [1]

Rethinking Graph Convolution for 2D-to-3D Hand Pose Lifting

相关实体

相关话题