Brief · PulseAugur

TOOL · arXiv cs.CL English(EN) · 4d

Spiking the training data to correct for test set contamination

Researchers have proposed a novel method called "spiking" to address test set contamination in machine learning evaluations. This technique involves intentionally introducing known levels of contamination into the training data, allowing for the calibration of memorization predictors. These predictors can then be used to statistically correct inflated test scores, offering a principled approach to ensure more accurate model performance assessments. AI

IMPACT Provides a statistical method to ensure more reliable evaluation of ML models by correcting for contaminated test data.

arXiv
Hubble models