English(EN) Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

Mutual Forcing框架实现快速同步的音视频生成

作者 PulseAugur 编辑部 · [3 个来源] · 2026-04-28 16:28

研究人员推出Mutual Forcing，一个专为高效音视频角色生成的创新框架。该方法通过采用两阶段训练策略和独特的双模态生成过程，解决了联合音视频建模和快速自回归输出的挑战。与以往的方法不同，Mutual Forcing允许单个权重共享模型执行少步和多步生成，从而促进自蒸馏并提高训练-推理一致性，而无需单独的教师模型。实验表明，Mutual Forcing在采样步数显著更多的情况下，取得了与基线相当或更优的结果，在速度和质量上均有显著提升。 AI

影响引入了一种更高效的音视频生成方法，有望加速内容创作流程。

排序理由这是一篇描述用于音视频生成新框架的研究论文。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-28 16:28

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

In this work, we propose Mutual Forcing, a framework for fast autoregressive audio-video generation with long-horizon audio-video synchronization. Our approach addresses two key challenges: joint audio-video modeling and fast autoregressive generation. To ease joint audio-video o…
arXiv cs.CV TIER_1 English(EN) · Yupeng Zhou, Lianghua Huang, Zhifan Wu, Jiabao Wang, Yupeng Shi, Biao Jiang, Daquan Zhou, Yu Liu, Ming-Ming Cheng, Qibin Hou · 2026-04-29 04:00

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

arXiv:2604.25819v1 Announce Type: new Abstract: In this work, we propose Mutual Forcing, a framework for fast autoregressive audio-video generation with long-horizon audio-video synchronization. Our approach addresses two key challenges: joint audio-video modeling and fast autore…
arXiv cs.CV TIER_1 English(EN) · Qibin Hou · 2026-04-28 16:28

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

In this work, we propose Mutual Forcing, a framework for fast autoregressive audio-video generation with long-horizon audio-video synchronization. Our approach addresses two key challenges: joint audio-video modeling and fast autoregressive generation. To ease joint audio-video o…

报道来源 [3]

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

相关实体

相关话题