Masked Diffusion Models
PulseAugur coverage of Masked Diffusion Models — every cluster mentioning Masked Diffusion Models across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
New PUMA method accelerates masked diffusion model training
Researchers have introduced Progressive UnMAsking (PUMA), a novel method to accelerate the training of Masked Diffusion Models (MDMs). PUMA aligns the masking patterns used during training with those employed during inf…
-
BlockGen model explores blockwise sequence generation with hybrid samplers
Researchers have introduced BlockGen, a novel blockwise sequence modeling approach that utilizes hybrid samplers for discrete diffusion. This method explores the effectiveness of uniform-state diffusion models (USDMs) c…
-
New research tackles diffusion language model limitations
Researchers are exploring new methods to improve diffusion language models (DLMs), which offer faster inference than autoregressive models. Several recent papers introduce techniques to enhance DLM performance, includin…
-
New LLM training methods boost efficiency and error recovery
Researchers have developed new techniques for improving the efficiency of training large language models (LLMs). One method, Step Rejection Fine-Tuning (SRFT), leverages unsuccessful training trajectories by assessing t…