PulseAugur
EN
LIVE 09:30:47

New method predicts and mitigates order sensitivity in AI adjudication

Researchers have developed a new method called Quantified Martingale Violation (QMV) to address order sensitivity in transformer models used for evidence-based decision-making. This approach aims to reduce unreliable answers by formalizing an expectation-realization gap, where training minimizes expected description length across evidence permutations while a fixed ordering remains position-sensitive. The method introduces metrics like Bits-to-Trust (B2T) and Risk-of-Hallucination (RoH) to help determine when a model should provide an answer or abstain, showing promising results on several datasets. AI

IMPACT Introduces a framework to improve reliability in AI systems that make decisions based on evidence, potentially reducing hallucinations.

RANK_REASON The cluster contains a new academic paper detailing a novel research method. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv stat.ML →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv stat.ML TIER_1 English(EN) · Leon Chlon, Ahmed Karim, Maggie Chlon, MarcAntonio Awada ·

    Predictable Compression Failures: Order Sensitivity and Information Budgeting for Evidence-Grounded Binary Adjudication

    arXiv:2509.11208v3 Announce Type: replace Abstract: Transformers used for evidence-grounded binary adjudication (e.g., support/refute, yes/no, or verifier-backed pass/fail decisions) can be sensitive to the order in which exchangeable evidence is presented, producing dispersion a…