PulseAugur
EN
LIVE 12:47:54

New framework tackles LLM data contamination using uncertainty

Researchers have introduced Uncertainty-Based Debiasing and Unlearning (UBD), a novel framework for evaluating and mitigating data contamination in large language models (LLMs). Unlike previous methods that rely solely on aggregate accuracy, UBD employs a sample-level evaluation using distributional distance metrics. This approach leverages deep ensembles of the contaminated model to estimate per-sample memorization and uses ensemble uncertainty to construct a debiased target distribution. Experiments on MMLU-Pro and MATH-MCQA benchmarks show that UBD effectively reduces inflated performance metrics caused by contamination, while preserving model performance on uncontaminated data. AI

IMPACT Provides a more robust method for evaluating LLM performance by addressing data contamination, leading to more reliable benchmarks.

RANK_REASON Academic paper introducing a new methodology for LLM evaluation. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New framework tackles LLM data contamination using uncertainty

COVERAGE [1]

  1. arXiv cs.CL TIER_1 English(EN) · Mark Gales ·

    Uncertainty-based Debiasing and Unlearning for Decontamination

    Benchmark-based evaluation is the dominant paradigm for assessing large language model (LLM) capabilities, yet data contamination inflates reported performance and undermines fair comparison. Existing decontamination methods are evaluated solely through aggregate accuracy, which …