New AVBench benchmark evaluates audio-video generation models

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-26 04:00

Researchers have introduced AVBench, a new automated benchmark designed to evaluate audio-video generative models, particularly those focused on human-centric scenarios. The benchmark incorporates fine-grained metrics across visual quality, audio quality, and cross-modal consistency, aiming to capture details often missed by existing evaluations. AVBench utilizes specialized evaluators trained through preference learning on a large dataset, deriving continuous scores from binary decisions to better align with human judgment and serve as a reward signal for RLHF. AI

影响 Provides a more accurate and automated method for assessing the capabilities of audio-video generative models.

排序理由 The cluster contains a research paper introducing a new benchmark for evaluating AI models. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Jialiang Yang, Bin Xia, Ruihang Chu, Dingdong Wang, Wanke Xia, Zhun Mou, Tianyang Zhong, Yiting Zhao, Wenming Yang · 2026-05-26 04:00

AVBench: Human-Aligned and Automated Evaluation Benchmark for Audio-Video Generative Models

arXiv:2605.24652v1 Announce Type: new Abstract: Rapid advances in audio-video (AV) generation have enabled high-fidelity synthesis with synchronized sound, particularly for human-related scenarios involving speech and interactions. Yet evaluation for AV generation remains at an e…

报道来源 [1]

AVBench: Human-Aligned and Automated Evaluation Benchmark for Audio-Video Generative Models

相关实体

相关话题