PulseAugur
实时 14:58:06
English(EN) Learning to Hear Hesitation: Continual Learning for Disfluency-Aware ASR

新的语音识别方法通过持续学习处理语音不流畅

研究人员开发了一种新的方法来改进自动语音识别(ASR)系统,通过整合持续学习技术来更好地处理语音不流畅。该方法包括将明确的不流畅标记引入预训练的ASR模型,然后在多样化的数据集上进行微调。此过程旨在防止通用知识的灾难性遗忘,同时增强模型识别和处理语音不流畅的能力,从而解决当前ASR技术的一个关键挑战。 AI

影响 这项研究可能带来更强大的ASR系统,能够处理自然、非脚本化的语音,改善语音启用应用程序的用户体验。

排序理由 该集群包含一篇详细介绍ASR新研究方法的学术论文。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Henri-Leon Kordt, Theresa Pekarek Rosin, Jae Hee Lee, Stefan Wermter ·

    Learning to Hear Hesitation: Continual Learning for Disfluency-Aware ASR

    arXiv:2606.14391v1 Announce Type: cross Abstract: Despite advances in large-scale Automatic Speech Recognition (ASR), disfluent speech remains challenging, as state-of-the-art systems are often optimized to omit disfluencies, leading to information loss and hallucinations. Prior …

  2. arXiv cs.AI TIER_1 English(EN) · Stefan Wermter ·

    学习识别犹豫:面向含不流畅语音识别的持续学习

    Despite advances in large-scale Automatic Speech Recognition (ASR), disfluent speech remains challenging, as state-of-the-art systems are often optimized to omit disfluencies, leading to information loss and hallucinations. Prior work has focused on verbatim transcription and the…