PulseAugur
EN
LIVE 03:16:23

New GAN model tackles class imbalance in tabular data

Researchers have developed ctdGAN, a novel conditional Generative Adversarial Network designed to address class imbalance in tabular datasets. This new model partitions input samples into clusters and employs a probabilistic sampling strategy to generate synthetic data within these identified subspaces. The method also incorporates a cluster-wise scaling technique to capture multiple feature modes and a loss function that penalizes mis-predictions at both the cluster and class levels. Evaluations on 14 imbalanced datasets showed ctdGAN's effectiveness in producing high-fidelity samples and improving classification accuracy. AI

IMPACT This research offers a new method for improving the performance of machine learning models on imbalanced tabular datasets.

RANK_REASON The cluster contains a research paper detailing a new model and methodology for tabular data generation. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New GAN model tackles class imbalance in tabular data

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Leonidas Akritidis, Panayiotis Bozanis ·

    A Conditional GAN for Tabular Data Generation with Probabilistic Sampling of Latent Subspaces

    arXiv:2508.00472v2 Announce Type: replace Abstract: The tabular form constitutes the standard way of representing data in relational database systems and spreadsheets. But, similarly to other forms, tabular data suffers from class imbalance, a problem that causes serious performa…