New framework uses AI to guide human comparisons for efficient ranking

By PulseAugur Editorial · [1 sources] · 2026-06-16 04:00

Researchers have developed a novel human-in-the-loop ranking framework called Surprise-Guided MergeSort (SGS). This system uses a Vision-Language Model (VLM) to identify comparisons that genuinely require human judgment, rather than replacing human annotators entirely. SGS integrates a MergeSort scheduler, a surprise scorer that combines VLM confidence, Elo gap, and vote entropy, and an adaptive budget allocator to route high-surprise pairs to humans while automating low-surprise pairs. Tested on six benchmarks for text similarity and image quality assessment, SGS successfully skipped up to 535 non-informative comparisons per session, outperforming Active Elo by 6-12% in Kendall's tau under the same budget. AI

IMPACT This approach could significantly reduce the cost and time required for subjective ranking tasks by optimizing human annotation efforts.

RANK_REASON The cluster contains an academic paper detailing a new algorithm and framework for human-in-the-loop ranking. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.AI TIER_1 English(EN) · Yujin Park, Haejun Chung, Ikbeom Jang · 2026-06-16 04:00

Surprise-Guided MergeSort: Budget-Efficient Human-in-the-Loop Ranking via Adaptive Comparison Scheduling

arXiv:2606.15623v1 Announce Type: cross Abstract: Pairwise comparison is the gold standard for subjective ranking tasks; however, exhaustive annotation requires a massive number of human comparisons ($O(n^2)$). While sorting-based methods have reduced this burden to $O(n\log n)$,…

COVERAGE [1]

Surprise-Guided MergeSort: Budget-Efficient Human-in-the-Loop Ranking via Adaptive Comparison Scheduling

RELATED ENTITIES

RELATED TOPICS