New benchmark PPaint fuses preference and rating data for aesthetic scoring

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have developed a new benchmark called PPaint for image aesthetic assessment, which uses both pairwise preferences and pointwise ratings from experts. This dual-protocol approach revealed that preferences provide more consistent rankings, while ratings anchor the absolute score scale. By fusing these signals, they created a unified expert ground truth and extended the principle to training vision-language models (VLMs) without labels. A self-distillation method using this approach significantly improved an open-source VLM's aesthetic scoring capabilities, matching a closed-source model's performance with lower inference costs. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Introduces a new benchmark and training method that significantly improves VLM aesthetic scoring, potentially impacting content generation and curation tools.

RANK_REASON The cluster describes a new academic paper introducing a novel benchmark and training methodology for image aesthetic assessment. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

COVERAGE [1]

arXiv cs.CV TIER_1 · Tangjie Lv · 2026-05-19 12:44

Preferences Order, Ratings Anchor: From Fused Expert Aesthetic Ground Truth to Self-Distillation

Pairwise preferences and pointwise ratings are the two dominant annotation protocols in image aesthetic assessment (IAA), yet existing benchmarks adopt only one, leaving their complementarity unmeasured under controlled conditions. We introduce PPaint, a matched dual-protocol ben…

COVERAGE [1]

Preferences Order, Ratings Anchor: From Fused Expert Aesthetic Ground Truth to Self-Distillation

RELATED ENTITIES

RELATED TOPICS