English(EN)Robust Spoofed Speech Detection via Temporal Pyramid Modeling
新研究利用先进的AI模型解决欺骗语音检测问题
作者PulseAugur 编辑部·[5 个来源]·
研究人员正在开发先进的方法来检测欺骗语音,由于逼真的合成和语音转换技术,这是一个日益严峻的挑战。一种方法是时间金字塔适配器(Temporal Pyramid Adapter),它使用具有不同感受野的并行时间卷积来捕获多尺度欺骗线索,并整合XLS-R等自监督表示。另一项研究推出了ArFake,这是第一个多方言阿拉伯语欺骗语音数据集,以解决该领域有限的研究。第三篇论文将自监督语音模型转换为专家混合(Mixture-of-Experts)架构,以增强对未见合成方法的泛化能力和鲁棒性,在错误减少方面显示出显著的相对改进。
AI
arXiv:2606.16837v1 Announce Type: cross Abstract: Spoofed speech detection is increasingly challenged by realistic synthesis, voice conversion, and replay attacks, with cross-dataset generalization remaining a major limitation. This work we propose a Temporal Pyramid Adapter that…
arXiv:2509.22808v2 Announce Type: replace Abstract: With the rise of generative text-to-speech models, distinguishing between real and synthetic speech has become challenging, especially for Arabic that have received limited research attention. Most spoof detection efforts have f…
arXiv:2606.14639v1 Announce Type: cross Abstract: Recent advances in speech generation have significantly improved the naturalness of synthetic speech, making spoofing detection increasingly challenging. A key limitation of current anti-spoofing systems is their limited robustnes…
Recent advances in speech generation have significantly improved the naturalness of synthetic speech, making spoofing detection increasingly challenging. A key limitation of current anti-spoofing systems is their limited robustness to unseen synthesis methods. In this work, we tr…
Spoofed speech detection is increasingly challenged by realistic synthesis, voice conversion, and replay attacks, with cross-dataset generalization remaining a major limitation. This work we propose a Temporal Pyramid Adapter that utilize parallel temporal convolutions with varyi…