PulseAugur
EN
LIVE 09:30:48

New DPC-VQA framework uses MLLMs for efficient video quality assessment

Researchers have developed DPC-VQA, a new framework for video quality assessment that leverages multimodal large language models (MLLMs). This approach decouples the perceptual capabilities of a frozen MLLM from a lightweight calibration branch, allowing for efficient adaptation to new scenarios without extensive retraining. DPC-VQA demonstrates competitive performance on both user-generated and AI-generated content benchmarks while significantly reducing trainable parameters and the need for MOS labels. AI

RANK_REASON The cluster contains an academic paper detailing a new research framework for video quality assessment. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CV TIER_1 English(EN) · Xinyue Li, Shubo Xu, Zhichao Zhang, Zhaolin Cai, Yitong Chen, Guangtao Zhai ·

    DPC-VQA: Decoupling Quality Perception and Residual Calibration for Video Quality Assessment

    arXiv:2604.12813v2 Announce Type: replace Abstract: Recent multimodal large language models (MLLMs) have shown promising performance on video quality assessment (VQA) tasks. However, adapting them to new scenarios remains expensive due to large-scale retraining and costly mean op…