Brief

last 24h

[3/3] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.LG English(EN) · 7h

Benchmarking Instance-Dependent Label Noise with Controlled Corruptions

Researchers have developed a new framework called CILN for generating synthetic instance-dependent label noise (IDN) benchmarks. Unlike previous methods that implicitly generated noise, CILN uses controlled input corruptions and a diverse voter pool to create benchmarks where the source and severity of ambiguity are explicit. This approach, tested on CIFAR10, MNIST, and Adult datasets, generates benchmarks that exhibit genuine instance-dependent noise and can reveal failure modes in existing noisy-label learning methods like Co-Teaching and DivideMix. AI
RESEARCH · arXiv cs.LG English(EN) · 3w · [3 sources]

PATE-TabTransGAN: Differentially Private Synthetic Tabular Data Generation via Transformer-Based Student Discrimination

Researchers have developed PATE-TabTransGAN, a novel framework for generating synthetic tabular data that adheres to formal differential privacy guarantees. This method combines the Private Aggregation of Teacher Ensembles (PATE) mechanism with a Transformer-based student discriminator to effectively model inter-feature dependencies while ensuring strong privacy. Experiments on four benchmark datasets demonstrated that PATE-TabTransGAN achieves competitive or superior performance in terms of AUROC and AUCPR compared to existing state-of-the-art differentially private synthesis methods. AI

IMPACT This research advances the state-of-the-art in privacy-preserving synthetic data generation, potentially enabling more secure use of sensitive tabular datasets.
TOOL · arXiv cs.AI English(EN) · 2w

TIMEGATE: Sustainable Time-Boxed Promotion Gates for Continual ML Adaptation Under Resource Constraints

Researchers have developed TIMEGATE, a novel policy layer designed to manage the continuous adaptation of machine learning systems while minimizing resource consumption. This system budgets time, labeling, training, and evaluation, emitting a metric-availability signal (M) to guide decisions between partial and full evaluations. Experiments show TIMEGATE can achieve significant computational and energy savings, with a 10% slice evaluation on LLaMA using 89% less wall-clock time and energy on an H200 GPU, without compromising accuracy or leading to silent mis-promotions. AI

IMPACT Offers a framework to reduce the computational and energy costs associated with continuous ML model updates.
- LLaMA-3.1-8B
- QLoRA
- H200
- Abhijit Chakraborty
- Adult

Brief

Benchmarking Instance-Dependent Label Noise with Controlled Corruptions

PATE-TabTransGAN: Differentially Private Synthetic Tabular Data Generation via Transformer-Based Student Discrimination

TIMEGATE: Sustainable Time-Boxed Promotion Gates for Continual ML Adaptation Under Resource Constraints