Română(RO) Multi-scale Object-Aware Gaze Estimation via Geometric Reasoning

新框架通过物体感知几何推理增强注视估计

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-30 04:00

研究人员开发了一种新颖的两阶段注视目标估计框架，该框架明确地融入了物体语义。这种方法超越了传统的像素级回归，首先编码物体级别的表示，将图像特征与不同的语义实体对齐。然后，该方法利用多尺度特征融合以及来自头部姿势和注视方向的几何约束，以实现更稳定和语义一致的预测，尤其是在复杂场景中。在 GazeFollow 和 GOO-Real 等多个基准测试上的实验表明，该模型在模型尺寸紧凑的情况下取得了具有竞争力的性能。 AI

影响这项研究通过提高注视跟踪系统的准确性和稳定性，可能带来更直观的人机交互。

排序理由这是一篇详细介绍注视估计新方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 Română(RO) · Jiajie Mi, Xinyu Liu, Mengke Song, Chenglizhao Chen · 2026-06-30 04:00

Multi-scale Object-Aware Gaze Estimation via Geometric Reasoning

arXiv:2606.29334v1 Announce Type: new Abstract: Gaze target estimation aims to predict the semantic object an observer fixates upon within an image, a task deeply rooted in the object-oriented nature of human gaze. Observers tend to select a specific semantic entity as the attent…

报道来源 [1]

Multi-scale Object-Aware Gaze Estimation via Geometric Reasoning

相关话题