MSD-Score metric improves image caption evaluation without references

By PulseAugur Editorial · [2 sources] · 2026-05-07 12:02

Researchers have developed MSD-Score, a novel method for evaluating image captions without needing reference captions. This approach models image patch and text token embeddings as distributions, enabling a more nuanced assessment of semantic discrepancies. MSD-Score achieves state-of-the-art correlation with human judgments and offers transparent diagnostics for local grounding errors. AI

IMPACT Introduces a new reference-free metric for image caption evaluation that correlates highly with human judgment.

RANK_REASON The cluster contains an academic paper detailing a new evaluation metric for image captions.

Read on arXiv cs.CV →

paper
other

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

MSD-Score metric improves image caption evaluation without references

COVERAGE [2]

arXiv cs.CV TIER_1 English(EN) · Shichao Kan, Xuyang Zhang, Haojie Zhang, Zhe Zhu, Yigang Cen, Yixiong Liang, Lianlei Shan, Linna Zhang, Zhe Qu, Jiazhi Xia · 2026-05-08 04:00

MSD-Score: Multi-Scale Distributional Scoring for Reference-Free Image Caption Evaluation

arXiv:2605.06080v1 Announce Type: new Abstract: Evaluating image captions without references remains challenging because global embedding similarity often misses fine-grained mismatches such as hallucinated objects, missing attributes, or incorrect relations. We propose MSD-Score…
arXiv cs.CV TIER_1 English(EN) · Jiazhi Xia · 2026-05-07 12:02

MSD-Score: Multi-Scale Distributional Scoring for Reference-Free Image Caption Evaluation

Evaluating image captions without references remains challenging because global embedding similarity often misses fine-grained mismatches such as hallucinated objects, missing attributes, or incorrect relations. We propose MSD-Score, a reference-free metric that models image patc…

COVERAGE [2]

MSD-Score: Multi-Scale Distributional Scoring for Reference-Free Image Caption Evaluation

MSD-Score: Multi-Scale Distributional Scoring for Reference-Free Image Caption Evaluation

RELATED ENTITIES

RELATED TOPICS