Researchers have developed a new method to identify optimal subsets of training data, particularly when dealing with label noise. This approach leverages data symmetries and invariance properties to improve the accuracy of k-nearest neighbors (k-NN) in selecting low-noise samples. The findings suggest that exploiting these underlying symmetries can lead to performance comparable to training on noise-free datasets, even in high-dimensional settings. AI
影响 Improves robustness of models trained on potentially noisy real-world datasets.
排序理由 Academic paper detailing a novel method for data selection in machine learning. [lever_c_demoted from research: ic=1 ai=1.0]
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →