PulseAugur
实时 10:26:49
English(EN) Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models

新的Causal-rCM配方加速了自回归视频扩散

研究人员推出了一种新颖的自回归视频扩散蒸馏开放配方Causal-rCM。该框架统一了teacher-forcing和self-forcing范式,以增强流式视频生成和交互式世界模型。Causal-rCM利用连续时间一致性模型和自定义FlashAttention-2内核,实现了比以往方法快10倍的收敛速度。该方法在视频生成方面展示了最先进的性能,一个蒸馏的2步因果Wan2.1-1.3B模型在使用最少采样步数的情况下,在VBench-T2V基准测试中得分84.63。 AI

影响 该框架可以显著提高实时视频生成和交互式AI系统的效率和性能。

排序理由 该集群描述了一篇关于视频生成新算法和框架的最新研究论文。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →

新的Causal-rCM配方加速了自回归视频扩散

报道来源 [3]

  1. arXiv cs.LG TIER_1 English(EN) · Kaiwen Zheng, Guande He, Min Zhao, Jintao Zhang, Huayu Chen, Jianfei Chen, Chen-Hsuan Lin, Ming-Yu Liu, Jun Zhu, Qianli Ma ·

    Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models

    arXiv:2606.25473v1 Announce Type: cross Abstract: Autoregressive video diffusion with causal diffusion transformers has emerged as a major paradigm for real-time streaming video generation and action-conditioned interactive world models. In this work, we extend rCM, an advanced d…

  2. arXiv cs.LG TIER_1 English(EN) · Qianli Ma ·

    Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models

    Autoregressive video diffusion with causal diffusion transformers has emerged as a major paradigm for real-time streaming video generation and action-conditioned interactive world models. In this work, we extend rCM, an advanced diffusion distillation framework, to autoregressive…

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models

    Autoregressive video diffusion extends diffusion distillation frameworks to real-time streaming generation through causal training paradigms, achieving state-of-the-art performance with fast convergence and interactive world modeling capabilities.