English(EN) QEVA: A Reference-Free Evaluation Metric for Narrative Video Summarization with Multimodal Question Answering

新的QEVA指标提供无参考视频摘要评估

作者 PulseAugur 编辑部 · [3 个来源] · 2026-04-27 05:18

研究人员推出了一种新颖的无参考指标QEVA，用于评估叙事视频摘要。与依赖人工编写摘要的先前方法不同，QEVA通过多模态问答直接将摘要与源视频进行比较来评估摘要。该新指标评估摘要的覆盖度、事实性和时序性，并附带了一个名为MLVU(VS)-Eval的新基准数据集。 AI

影响引入了一个新的视频摘要评估框架，可能改进多模态AI系统的开发。

排序理由该集群描述了一篇介绍视频摘要新评估指标的新学术论文。

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-27 05:18

QEVA：一种用于多模态问答叙事视频摘要的无参考评估指标

Video-to-text summarization remains underexplored in terms of comprehensive evaluation methods. Traditional n-gram overlap-based metrics and recent large language model (LLM)-based approaches depend heavily on human-written reference summaries, limiting their practicality and sen…
arXiv cs.CV TIER_1 English(EN) · Woojun Jung, Junyeong Kim · 2026-04-28 04:00

QEVA：一种用于多模态问答叙事视频摘要的无参考评估指标

arXiv:2604.24052v1 Announce Type: new Abstract: Video-to-text summarization remains underexplored in terms of comprehensive evaluation methods. Traditional n-gram overlap-based metrics and recent large language model (LLM)-based approaches depend heavily on human-written referenc…
arXiv cs.CV TIER_1 English(EN) · Junyeong Kim · 2026-04-27 05:18

QEVA：一种用于多模态问答叙事视频摘要的无参考评估指标

Video-to-text summarization remains underexplored in terms of comprehensive evaluation methods. Traditional n-gram overlap-based metrics and recent large language model (LLM)-based approaches depend heavily on human-written reference summaries, limiting their practicality and sen…