Data Balancing Strategies: A Systematic Survey of Resampling and Augmentation Methods

By PulseAugur Editorial · Summary by None from 1 source

This paper presents a systematic review of data balancing strategies for machine learning, covering resampling and augmentation techniques. It categorizes methods from foundational approaches like SMOTE to advanced deep generative models and ensemble strategies. The review highlights that optimal method selection is highly dependent on dataset characteristics and evaluation metrics, and it identifies future research directions such as adapting foundation models to skewed distributions. AI

Summary written by None from 1 source. How we write summaries →

IMPACT Provides a comprehensive overview of techniques to improve model performance on imbalanced datasets, crucial for many real-world applications.

RANK_REASON This is a systematic review paper published on arXiv.

Read on arXiv stat.ML →

paper
other

COVERAGE [1]

arXiv stat.ML TIER_1 · Behnam Yousefimehr, Mehdi Ghatee, Javad Fazli, Shervin Ghaffari, Zahra Rafei, Mohammad Amin Seifi, Sajed Tavakoli, Abolfazl Nikahd, Mahdi Razi Gandomani, Alireza Orouji, Ramtin Mahmoudi Kashani, Sarina Heshmati, Negin Sadat Mousavi · 2026-04-30 04:00

Data Balancing Strategies: A Systematic Survey of Resampling and Augmentation Methods

arXiv:2505.13518v2 Announce Type: replace Abstract: Imbalanced datasets, where one class significantly outnumbers others, remain a persistent challenge in machine learning, often biasing predictions toward the majority class and degrading classifier performance. This paper provid…

COVERAGE [1]

Data Balancing Strategies: A Systematic Survey of Resampling and Augmentation Methods

RELATED ENTITIES

RELATED TOPICS