English(EN) FROST-STA: Frozen Dense Features for the Ego4D Short-Term Object Interaction Anticipation

FROST-STA 系统预测自我中心视频中的目标交互

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-02 04:00

研究人员开发了 FROST-STA，一个用于自我中心视频短期预测的系统，旨在预测目标交互。该模型使用来自 ViT-G 主干的冻结密集特征，提取视频和图像 token，然后进行融合和解码以预测目标框、标签和接触时间。FROST-STA 在 Ego4D 短期目标交互预测挑战赛中获得第二名，证明了预训练特征在交互预测中的有效性。 AI

影响展示了一种新颖的自我中心视频分析方法，可能改进人机交互和自主系统。

排序理由该集群包含一篇详细介绍新模型及其在特定挑战中表现的研究论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Chaoyang Wang, Lexuan Xu · 2026-06-02 04:00

FROST-STA: 冻结的稠密特征用于 Ego4D 短期目标交互预测

arXiv:2606.00694v1 Announce Type: new Abstract: Short-term anticipation in egocentric video requires more than recognizing the current scene: a system must infer which object the camera wearer will contact, which action will follow, and how soon the contact will happen. This repo…

报道来源 [1]

FROST-STA: 冻结的稠密特征用于 Ego4D 短期目标交互预测

相关实体

相关话题