Researchers have developed AdaCluster, a new framework designed to significantly speed up video diffusion transformers (DiTs). This method addresses the slow inference times caused by the quadratic complexity of attention mechanisms in these models. AdaCluster employs adaptive clustering techniques for both query and key vectors to compress attention, achieving up to a 4.31x speedup on various video generation models with minimal loss in quality. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
RANK_REASON Academic paper detailing a new method for improving AI model efficiency.