English(EN) The Impact of VAE Design on Latent Pose Representations for Diffusion-based Sign Language Production

VAE设计对生成手语模型至关重要

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-22 07:38

研究人员探讨了变分自编码器（VAE）的设计如何影响基于扩散模型的手语生成中的潜在姿态表示。他们发现，VAE中的架构和训练目标选择显著影响潜在空间的结构。这种影响反过来会影响下游文本到手语生成模型的性能，有时比单独的传统VAE重建精度更重要，这在Phoenix14T数据集上得到了证明。 AI

影响研究了VAE设计选择如何影响潜在空间结构，进而影响文本到手语生成性能。

排序理由该集群包含一篇详细介绍模型架构和数据集研究结果的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

Hugging Face Daily Papers TIER_1 English(EN) · 2026-06-22 07:38

VAE设计对基于扩散模型的手语生成中的潜在姿态表示的影响

Latent diffusion approaches to sign language production (SLP) rely on an initial stage that learns an encoding of sign pose sequences, enabling generative modeling in the resulting latent space. The autoencoder used in this stage is typically evaluated in terms of reconstruction …