English(EN) SF20K Competition 2025: Summary and findings

SF20K 竞赛表明叙事理解而非模型规模是视频问答的关键

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-05 04:00

首届 Short-Films 20K (SF20K) 竞赛在 ICCV 2025 期间举行，专注于通过开放式问答任务推进故事层面的视频理解。该竞赛使用业余短片作为基准，并由 GPT-4.1-nano 进行评估，共有 22 个团队提交了参赛作品。结果分析表明，叙事感知、镜头级处理和多阶段流水线比简单的帧采样更有效，并且字幕质量对性能有显著影响。 AI

影响强调信息选择和推理结构，而非原始模型容量，是长视频问答中的关键挑战。

排序理由这是学术论文中详细介绍的竞赛发现的总结。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Ridouane Ghermi, Xi Wang, Vicky Kalogeiton, Ivan Laptev · 2026-05-05 04:00

SF20K Competition 2025: Summary and findings

arXiv:2605.01496v1 Announce Type: new Abstract: This report presents the results and findings of the first edition of the Short-Films 20K (SF20K) Competition, held in conjunction with the SLoMO Workshop at ICCV 2025. The competition is designed to advance story-level video unders…

报道来源 [1]

SF20K Competition 2025: Summary and findings

相关实体

相关话题