HuM-Eval framework improves human-centric video evaluation with coarse-to-fine approach

By PulseAugur Editorial · [1 sources] · 2026-04-28 08:27

Researchers have introduced HuM-Eval, a new framework designed to better evaluate the quality of human motion in generated videos. Current metrics often miss fine-grained human details, leading to evaluations that don't align with human preferences. HuM-Eval employs a coarse-to-fine approach, first using a Vision Language Model for a general assessment and then analyzing 2D pose for anatomical correctness and 3D motion for stability. This method achieved a 58.2% correlation with human judgment, surpassing existing benchmarks. AI

IMPACT Introduces a more accurate method for evaluating human motion in generated videos, potentially guiding future improvements in text-to-video models.

RANK_REASON Academic paper introducing a new evaluation framework for video generation models.

Read on Hugging Face Daily Papers →

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

HuM-Eval framework improves human-centric video evaluation with coarse-to-fine approach

COVERAGE [1]

Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-28 08:27

HuM-Eval: A Coarse-to-Fine Framework for Human-Centric Video Evaluation

Video generation models have developed rapidly in recent years, where generating natural human motion plays a pivotal role. However, accurately evaluating the quality of generated human motion video remains a significant challenge. Existing evaluation metrics primarily focus on g…

COVERAGE [1]

HuM-Eval: A Coarse-to-Fine Framework for Human-Centric Video Evaluation

RELATED TOPICS