Brief · PulseAugur

RESEARCH · Hugging Face Daily Papers English(EN) · 6d · [3 sources]

Live Music Diffusion Models: Efficient Fine-Tuning and Post-Training of Interactive Diffusion Music Generators

Researchers have developed Live Music Diffusion Models (LMDMs), a novel approach to adapt audio diffusion models for real-time, interactive music generation on consumer hardware. These models address inefficiencies in current diffusion pipelines, achieving better computational performance than existing discrete-AR models through block-wise KV caching. LMDMs also introduce ARC-Forcing for stable post-training alignment without RL, enabling applications like text-conditioned generation, sketch-based synthesis, and live artist-AI collaboration. AI

IMPACT Enables interactive AI music generation on consumer hardware, potentially transforming live performance and co-creation.

Live Music Diffusion Models
ARC-Forcing
KV Caching