PulseAugur
实时 13:20:00
English(EN) Robust Spoofed Speech Detection via Temporal Pyramid Modeling

新研究利用先进的AI模型解决欺骗语音检测问题

研究人员正在开发先进的方法来检测欺骗语音,由于逼真的合成和语音转换技术,这是一个日益严峻的挑战。一种方法是时间金字塔适配器(Temporal Pyramid Adapter),它使用具有不同感受野的并行时间卷积来捕获多尺度欺骗线索,并整合XLS-R等自监督表示。另一项研究推出了ArFake,这是第一个多方言阿拉伯语欺骗语音数据集,以解决该领域有限的研究。第三篇论文将自监督语音模型转换为专家混合(Mixture-of-Experts)架构,以增强对未见合成方法的泛化能力和鲁棒性,在错误减少方面显示出显著的相对改进。 AI

排序理由 多篇在arXiv上发表的研究论文,详细介绍了欺骗语音检测的新方法。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 5 个来源。 我们如何撰写摘要 →

报道来源 [5]

  1. arXiv cs.AI TIER_1 English(EN) · Mahtab Masoudi Nezhad, Nima Karimian ·

    Robust Spoofed Speech Detection via Temporal Pyramid Modeling

    arXiv:2606.16837v1 Announce Type: cross Abstract: Spoofed speech detection is increasingly challenged by realistic synthesis, voice conversion, and replay attacks, with cross-dataset generalization remaining a major limitation. This work we propose a Temporal Pyramid Adapter that…

  2. arXiv cs.CL TIER_1 English(EN) · Mohamed Elsetohy, Alhassan Ehab, Ali Mekky, Besher Hassan, Shady Shehata ·

    ArFake: A Robust Framework for Multi-Dialect Arabic Speech Spoofing Detection Benchmark

    arXiv:2509.22808v2 Announce Type: replace Abstract: With the rise of generative text-to-speech models, distinguishing between real and synthetic speech has become challenging, especially for Arabic that have received limited research attention. Most spoof detection efforts have f…

  3. arXiv cs.AI TIER_1 English(EN) · Hugo Daumain, Driss Matrouf, Khaled Khelif, Mickael Rouvier ·

    From Self-Supervised Speech Models to Mixture-of-Experts for Robust Anti-Spoofing

    arXiv:2606.14639v1 Announce Type: cross Abstract: Recent advances in speech generation have significantly improved the naturalness of synthetic speech, making spoofing detection increasingly challenging. A key limitation of current anti-spoofing systems is their limited robustnes…

  4. arXiv cs.AI TIER_1 English(EN) · Mickael Rouvier ·

    From Self-Supervised Speech Models to Mixture-of-Experts for Robust Anti-Spoofing

    Recent advances in speech generation have significantly improved the naturalness of synthetic speech, making spoofing detection increasingly challenging. A key limitation of current anti-spoofing systems is their limited robustness to unseen synthesis methods. In this work, we tr…

  5. arXiv cs.CV TIER_1 English(EN) · Nima Karimian ·

    Robust Spoofed Speech Detection via Temporal Pyramid Modeling

    Spoofed speech detection is increasingly challenged by realistic synthesis, voice conversion, and replay attacks, with cross-dataset generalization remaining a major limitation. This work we propose a Temporal Pyramid Adapter that utilize parallel temporal convolutions with varyi…