PulseAugur
实时 13:29:01
English(EN) Noise-Aware Framework for Correcting Corrupted Labels

新框架解决机器学习数据集中的损坏标签问题

两篇新的研究论文介绍了一种用于识别和纠正机器学习数据集中损坏标签的框架。CANOLARelabeler 都旨在通过精炼噪声数据来提高模型性能,其中 CANOLA 专注于噪声感知学习和迭代软标签精炼,而 Relabeler 使用局部和全局数据关系进行检测和纠正。两种方法在实验中都显示出比现有技术显著的改进,从而提高了下游任务的性能。 AI

影响 这些框架提高的数据质量可能带来更强大、更准确的各种应用中的人工智能模型。

排序理由 两篇在 arXiv 上发表的学术论文介绍了纠正机器学习数据集中损坏标签的新方法。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Ha-Linh Nguyen, Hong-Anh Nguyen, Minh-Duc La, Phong Lam, Thu-Trang Nguyen, Son Nguyen, Hieu Dinh Vo ·

    Noise-Aware Framework for Correcting Corrupted Labels

    arXiv:2606.11695v1 Announce Type: cross Abstract: High-quality labeled data is essential for training reliable ML/DL models. However, real-world datasets often contain a considerable proportion of corrupted labels, which can severely degrade model performance. To address this pro…

  2. arXiv cs.LG TIER_1 English(EN) · Ha-Linh Nguyen, Hong-Anh Nguyen, Minh-Duc La, Thu-Trang Nguyen, Son Nguyen, Hieu Dinh Vo ·

    A Data-Centric Framework for Detecting and Correcting Corrupted Labels

    arXiv:2606.11699v1 Announce Type: new Abstract: The performance of machine learning and deep learning models largely depends on the quality of the training data. However, the quality of the real-world datasets is often compromised by noisy labels, which can substantially degrade …