ECAPA-TDNN
PulseAugur coverage of ECAPA-TDNN — every cluster mentioning ECAPA-TDNN across labs, papers, and developer communities, ranked by signal.
1 天有情绪数据
-
New method offers adaptive control over deep neural network sparsity
Researchers have developed an adaptive regularization method to better control sparsity in deep neural networks, addressing the challenge where traditional $\ell_1$ penalties indirectly influence sparsity rates. This ne…
-
Researchers develop new spoken language ID method using pre-trained models and margin loss
Researchers have developed a new method for spoken language identification using pre-trained models and margin-based losses. This approach enhances the ability of language representations to distinguish between language…
-
LASE模型通过使嵌入信息语言无关来改进跨脚本语音克隆
研究人员开发了LASE(语言对抗说话人编码器),以改进多语言语音克隆。标准的编码器在不同脚本之间保持说话人身份时会遇到困难,特别是在将非印度语语音映射到印度语时。LASE采用了一种新颖的训练方法,结合了监督对比损失和梯度反转交叉熵目标,以创建语言信息无关但说话人信息相关的嵌入。该方法显著减小了跨脚本的身份差距,并以显著减少的训练数据增强了跨脚本说话人召回率。