Lip Forcing: Few-Step Autoregressive Diffusion for Real-time Lip Synchronization
Researchers have developed "Lip Forcing," a novel autoregressive diffusion method for real-time video-to-video lip synchronization. This technique distills a large 14B parameter model into smaller, faster student models that can generate synchronized lip movements in just two denoising steps. The 1.3B parameter student model achieves real-time performance at 31 FPS, significantly outperforming previous diffusion models in speed while maintaining visual quality. AI
IMPACT Enables real-time, high-quality lip synchronization for video applications, potentially impacting content creation and virtual communication.