PulseAugur
实时 19:03:24
English(EN) QEVA: A Reference-Free Evaluation Metric for Narrative Video Summarization with Multimodal Question Answering

新的QEVA指标提供无参考视频摘要评估

研究人员推出了一种新颖的无参考指标QEVA,用于评估叙事视频摘要。与依赖人工编写摘要的先前方法不同,QEVA通过多模态问答直接将摘要与源视频进行比较来评估摘要。该新指标评估摘要的覆盖度、事实性和时序性,并附带了一个名为MLVU(VS)-Eval的新基准数据集。 AI

影响 引入了一个新的视频摘要评估框架,可能改进多模态AI系统的开发。

排序理由 该集群描述了一篇介绍视频摘要新评估指标的新学术论文。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →

新的QEVA指标提供无参考视频摘要评估

报道来源 [3]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    QEVA: A Reference-Free Evaluation Metric for Narrative Video Summarization with Multimodal Question Answering

    Video-to-text summarization remains underexplored in terms of comprehensive evaluation methods. Traditional n-gram overlap-based metrics and recent large language model (LLM)-based approaches depend heavily on human-written reference summaries, limiting their practicality and sen…

  2. arXiv cs.CV TIER_1 English(EN) · Woojun Jung, Junyeong Kim ·

    QEVA: A Reference-Free Evaluation Metric for Narrative Video Summarization with Multimodal Question Answering

    arXiv:2604.24052v1 Announce Type: new Abstract: Video-to-text summarization remains underexplored in terms of comprehensive evaluation methods. Traditional n-gram overlap-based metrics and recent large language model (LLM)-based approaches depend heavily on human-written referenc…

  3. arXiv cs.CV TIER_1 English(EN) · Junyeong Kim ·

    QEVA: A Reference-Free Evaluation Metric for Narrative Video Summarization with Multimodal Question Answering

    Video-to-text summarization remains underexplored in terms of comprehensive evaluation methods. Traditional n-gram overlap-based metrics and recent large language model (LLM)-based approaches depend heavily on human-written reference summaries, limiting their practicality and sen…