English(EN) Where Do We (Not) Need Temporal Context in Low-Resource Video Task Adaptation?

研究探讨低资源视频模型适应中的时间上下文

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-02 16:19

研究人员对用于视频理解任务的基础模型适应进行了系统性研究，特别是在低资源场景下。该研究调查了参数高效微调（PEFT）和探测方法，并比较了适应预训练图像模型的方法与直接适应视频表示的方法。主要发现强调了在数据有限的情况下，策略性地将时间上下文分布在不同模型组件中的重要性，以实现有效的视频适应。 AI

影响为在数据有限的情况下优化视频模型适应提供了见解，有可能提高视频理解应用的效率。

排序理由这是一篇发表在arXiv上的研究论文，详细介绍了模型适应策略在视频理解方面的系统性研究。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CV TIER_1 English(EN) · Luc P. J. Str\"ater, Hazel Doughty · 2026-06-03 04:00

Where Do We (Not) Need Temporal Context in Low-Resource Video Task Adaptation?

arXiv:2606.03837v1 Announce Type: new Abstract: Parameter-efficient fine-tuning (PEFT) and probing enable adaptation of foundation models using only a small number of trainable parameters, making it attractive for video understanding where annotation and computation are expensive…
arXiv cs.CV TIER_1 English(EN) · Hazel Doughty · 2026-06-02 16:19

Where Do We (Not) Need Temporal Context in Low-Resource Video Task Adaptation?

Parameter-efficient fine-tuning (PEFT) and probing enable adaptation of foundation models using only a small number of trainable parameters, making it attractive for video understanding where annotation and computation are expensive. However, video PEFT has focused on adapting im…