PulseAugur
实时 11:46:51
English(EN) GaussianEmoTalker: Real-Time Emotional Talking Head Synthesis with Audio-Driven and Blendshape-Based 3D Gaussian Splatting

GaussianEmoTalker 实现实时情感说话人头合成

研究人员开发了GaussianEmoTalker,一种使用3D高斯泼溅进行实时情感说话人头合成的新颖框架。该方法将情感动画视为残差变形问题,解决了生成具有可控情感强度的富有表现力的虚拟形象的挑战。GaussianEmoTalker构建了一个特定身份的中性说话空间,然后预测条件情感残差变形,实现了具有竞争力的视频质量、准确的唇形同步和实时渲染。 AI

影响 这项研究可能为虚拟现实和远程会议等应用带来更具表现力和可控性的虚拟形象。

排序理由 该集群描述了一篇关于新颖AI驱动合成方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

GaussianEmoTalker 实现实时情感说话人头合成

报道来源 [2]

  1. arXiv cs.CV TIER_1 English(EN) · Haijie Yang, Zhenyu Zhang, Yixuan Dong, Jianjun Qian, Jian Yang ·

    GaussianEmoTalker: Real-Time Emotional Talking Head Synthesis with Audio-Driven and Blendshape-Based 3D Gaussian Splatting

    arXiv:2607.00959v1 Announce Type: new Abstract: Audio-driven talking head synthesis has achieved impressive progress in lip synchronization and visual quality, yet generating expressive emotional avatars with controllable intensity remains challenging, especially under real-time …

  2. arXiv cs.CV TIER_1 English(EN) · Jian Yang ·

    GaussianEmoTalker:基于音频驱动和混合形状的3D高斯泼溅的实时情感说话头合成

    Audio-driven talking head synthesis has achieved impressive progress in lip synchronization and visual quality, yet generating expressive emotional avatars with controllable intensity remains challenging, especially under real-time constraints. In this paper, we present GaussianE…