PulseAugur
实时 12:47:59
English(EN) Systematic Study of Dysarthric Speech Recognition: Spectral Features and Acoustic Models

新研究提升构音障碍者语音识别能力

两篇新研究论文探讨了改善构音障碍者自动语音识别(ASR)的方法。构音障碍是一种常由神经系统疾病引起的言语障碍。第一篇论文系统研究了频谱特征和声学模型,发现结合音高特征和使用因子化时延神经网络(F-TDNN)模型可以在单词和句子识别方面带来显著的相对改进。第二篇论文侧重于数据增强技术,特别是语速修改(SRM)和音高修改(PM),并将其应用于Wav2Vec2模型,证明这些方法可以有效提高不同严重程度构音障碍者的ASR性能。 AI

影响 这些进展可能显著改善言语障碍者的沟通工具和可及性。

排序理由 两篇在arXiv上发表的学术论文,详细介绍了改善构音障碍语音识别的新方法。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

新研究提升构音障碍者语音识别能力

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Paban Sapkota, Hemant Kumar Kathania, Mikko Kurimo, Sudarsana Reddy Kadiri, Shrikanth Narayanan ·

    Systematic Study of Dysarthric Speech Recognition: Spectral Features and Acoustic Models

    arXiv:2606.19793v1 Announce Type: cross Abstract: The challenge associated with recognizing dysarthric speech primarily arises from pronounced acoustic variability attributed to impaired articulatory precision. Past research has demonstrated improved recognition through the use o…

  2. arXiv cs.AI TIER_1 English(EN) · Paban Sapkota, Hemant Kumar Kathania, Sudarsana Reddy Kadiri, Shrikanth Narayanan ·

    Improving End-to-End Speech Recognition for Dysarthric Speech through In-Domain Data Augmentation

    arXiv:2606.19797v1 Announce Type: cross Abstract: Dysarthric speech recognition is crucial for facilitating effective communication among individuals with dysarthria. However, accurately recognizing dysarthric speech poses significant challenges due to varying severity levels and…