This paper presents a systematic review of data balancing strategies for machine learning, covering resampling and augmentation techniques. It categorizes methods from foundational approaches like SMOTE to advanced deep generative models and ensemble strategies. The review highlights that optimal method selection is highly dependent on dataset characteristics and evaluation metrics, and it identifies future research directions such as adapting foundation models to skewed distributions. AI
IMPACT Provides a comprehensive overview of techniques to improve model performance on imbalanced datasets, crucial for many real-world applications.
RANK_REASON This is a systematic review paper published on arXiv.
- AMDO
- Balanced Random Forest
- GANs
- MWMOTE
- One-Sided Selection
- RUSBoost
- SMOTEBoost
- SMOTE-ENN
- SMOTE+OCSVM
- SMOTE-Tomek
- Tomek Links
- Diffusion Models
- NearMiss
- SMOTE
- Variational Autoencoders
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →