English(EN) Mamba-Enhanced Implicit Motion Learning for Audio-Driven Portrait Animation

新的Mamba增强模型可根据图像和音频生成逼真动画

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-02 09:43

研究人员开发了一个新的框架，可以根据单个图像和音频输入生成逼真的人物动画。该方法采用两阶段流程，首先通过整合外观先验和深度线索来建模潜在运动特征，然后采用Mamba增强的扩散模型从音频和源图像预测这些特征。该方法在一个大型数据集上进行了训练，据报道在诸如说话人头合成等应用中的准确性、自然度和时间连贯性方面设定了新的最先进水平。 AI

影响这项研究推动了AI在根据有限输入生成逼真人物动画方面的能力，可能对虚拟化身和内容创作等领域产生影响。

排序理由该集群包含一篇详细介绍AI驱动动画新方法的论文。

在 arXiv cs.CV 阅读 →

arXiv
Mamba

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CV TIER_1 English(EN) · Xuan Wei, Jiahui Chen, Kaiheng Li, Mingyu Shao, Qingqi Hong · 2026-06-03 04:00

Mamba增强的隐式运动学习用于音频驱动的肖像动画

arXiv:2606.03402v1 Announce Type: new Abstract: Audio-driven human motion video generation aims to synthesize realistic and temporally coherent human animations from a single static image, with applications in talking-head synthesis, co-speech gesture generation, and dynamic pres…
arXiv cs.CV TIER_1 English(EN) · Qingqi Hong · 2026-06-02 09:43

Mamba增强的隐式运动学习用于音频驱动的肖像动画

Audio-driven human motion video generation aims to synthesize realistic and temporally coherent human animations from a single static image, with applications in talking-head synthesis, co-speech gesture generation, and dynamic presentations. Moving beyond conventional keypoint-b…

报道来源 [2]

Mamba增强的隐式运动学习用于音频驱动的肖像动画

Mamba增强的隐式运动学习用于音频驱动的肖像动画

相关实体

相关话题