PulseAugur
实时 09:23:54
English(EN) When Does Synthetic Data Augmentation Improve Score-Based Imbalanced Classification?

新研究分析合成数据增强在类别不平衡分类中的应用

一篇新论文探讨了合成数据增强对基于分数的分类的理论影响,特别是在类别不平衡的情况下。该研究提出了一个框架,用于确定何时此类增强可以提高 AUROCAUPRC 和 F1 分数等指标。研究结果表明,在理想条件下,增强除了方差减少外,改进效果甚微,但当分数模型被错误指定时,通过调整类别平衡和纠正排名错误,它可以带来益处。 AI

影响 为使用类别不平衡数据集改进分类模型提供了理论见解,可能指导未来的数据增强策略。

排序理由 该集群包含一篇发表在 arXiv 上的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv stat.ML 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →

新研究分析合成数据增强在类别不平衡分类中的应用

报道来源 [3]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    When Does Synthetic Data Augmentation Improve Score-Based Imbalanced Classification?

    Synthetic data augmentation is widely used to mitigate class imbalance, but its theoretical effects on score-based classification remain poorly understood. This paper develops a framework for characterizing when synthetic minority augmentation can improve threshold-integrated and…

  2. arXiv stat.ML TIER_1 English(EN) · Zhengchi Ma, Pengfei Lyu, Anru R. Zhang ·

    When Does Synthetic Data Augmentation Improve Score-Based Imbalanced Classification?

    arXiv:2606.26053v1 Announce Type: new Abstract: Synthetic data augmentation is widely used to mitigate class imbalance, but its theoretical effects on score-based classification remain poorly understood. This paper develops a framework for characterizing when synthetic minority a…

  3. arXiv stat.ML TIER_1 English(EN) · Anru R. Zhang ·

    When Does Synthetic Data Augmentation Improve Score-Based Imbalanced Classification?

    Synthetic data augmentation is widely used to mitigate class imbalance, but its theoretical effects on score-based classification remain poorly understood. This paper develops a framework for characterizing when synthetic minority augmentation can improve threshold-integrated and…