Researchers have developed a novel data augmentation technique to improve automatic speech recognition (ASR) for elderly individuals. This method utilizes large language models to paraphrase existing transcripts, generating elderly-contextual variations. These paraphrased texts are then converted into synthetic speech using text-to-speech synthesis with elderly reference speakers. Experiments demonstrated a significant reduction in word error rate, with up to a 58.2% improvement compared to baseline models. AI
影响 Enhances ASR performance for specific demographics, potentially improving accessibility of voice technologies for the elderly.
排序理由 Academic paper detailing a new method for data augmentation in ASR.
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →