New PUMA method accelerates masked diffusion model training

By PulseAugur Editorial · [1 sources] · 2026-06-05 04:00

Researchers have introduced Progressive UnMAsking (PUMA), a novel method to accelerate the training of Masked Diffusion Models (MDMs). PUMA aligns the masking patterns used during training with those employed during inference, thereby focusing optimization on more relevant masks. This approach has demonstrated a significant speed-up in pretraining, achieving approximately a 2.5x faster training time at the 125 million parameter scale. AI

IMPACT Accelerates training for diffusion models, potentially enabling faster iteration and development of generative AI.

RANK_REASON The cluster contains an academic paper detailing a new method for training diffusion models. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.LG TIER_1 English(EN) · Jaeyeon Kim, Jonathan Geuter, David Alvarez-Melis, Sham Kakade, Sitan Chen · 2026-06-05 04:00

Stop Training for the Worst: Progressive Unmasking Accelerates Masked Diffusion Training

arXiv:2602.10314v2 Announce Type: replace Abstract: Masked Diffusion Models (MDMs) have emerged as a promising approach for generative modeling in discrete spaces. By generating sequences in any order and allowing for parallel decoding, they enable fast inference and strong perfo…

COVERAGE [1]

Stop Training for the Worst: Progressive Unmasking Accelerates Masked Diffusion Training

RELATED ENTITIES

RELATED TOPICS