New methods tackle noisy labels in AI datasets

By PulseAugur Editorial · [2 sources] · 2026-05-18 03:19

Researchers have developed a new method called Standardized Loss Aggregation (SLA) to detect noisy labels in large datasets, particularly in medical imaging. SLA quantifies label reliability by analyzing standardized losses from cross-validation runs, offering a more continuous and informative measure than simple hard-counting methods. Experiments show SLA is more effective and faster at identifying ambiguous or mislabeled samples, which can help improve dataset quality for classification tasks. Another study highlights a problem called "uncertainty collapse" where models trained on noisy labels achieve high accuracy but fail to reliably distinguish out-of-distribution data from misclassified in-distribution data. AI

IMPACT New techniques for handling noisy labels can improve the reliability and robustness of AI models, especially in critical domains like medical imaging.

RANK_REASON The cluster contains two academic papers detailing new methods for handling noisy labels in machine learning.

Read on Hugging Face Daily Papers →

paper
safety

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

New methods tackle noisy labels in AI datasets

COVERAGE [2]

arXiv cs.AI TIER_1 English(EN) · Inhyuk Park, Doohyun Park · 2026-05-22 04:00

Task-Agnostic Noisy Label Detection via Standardized Loss Aggregation

arXiv:2605.10165v2 Announce Type: replace-cross Abstract: Noisy labels are common in large-scale medical imaging datasets due to inter-observer variability and ambiguous cases. We propose a statistically grounded and task-agnostic framework, Standardized Loss Aggregation (SLA), f…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-18 03:19

When Accuracy Is Not Enough: Uncertainty Collapse between Noisy Label Learning and Out-of-Distribution Detection

Learning with noisy labels (LNL) is typically benchmarked by closed-set classification accuracy, yet deployment often requires classifiers to reject out-of-distribution (OOD) inputs. We present a learner-agnostic ACC-OOD benchmark that freezes LNL checkpoints and evaluates them w…

COVERAGE [2]

Task-Agnostic Noisy Label Detection via Standardized Loss Aggregation

When Accuracy Is Not Enough: Uncertainty Collapse between Noisy Label Learning and Out-of-Distribution Detection

RELATED TOPICS