PulseAugur
实时 12:29:24

Researchers enhance elderly ASR with LLM paraphrasing and speech synthesis

Researchers have developed a novel data augmentation technique to improve automatic speech recognition (ASR) for elderly individuals. This method utilizes large language models to paraphrase existing transcripts, generating elderly-contextual variations. These paraphrased texts are then converted into synthetic speech using text-to-speech synthesis with elderly reference speakers. Experiments demonstrated a significant reduction in word error rate, with up to a 58.2% improvement compared to baseline models. AI

影响 Enhances ASR performance for specific demographics, potentially improving accessibility of voice technologies for the elderly.

排序理由 Academic paper detailing a new method for data augmentation in ASR.

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

Researchers enhance elderly ASR with LLM paraphrasing and speech synthesis

报道来源 [1]

  1. arXiv cs.CL TIER_1 English(EN) · Minsik Lee, Seoi Hong, Chongmin Lee, Sieun Choi, Jian Kim, Jua Han, Jihie Kim ·

    Elderly-Contextual Data Augmentation via Speech Synthesis for Elderly ASR

    arXiv:2604.24770v1 Announce Type: new Abstract: Despite recent progress in automatic speech recognition (ASR), elderly ASR (EASR) remains challenging due to limited training data and the distinct acoustic and linguistic characteristics of elderly speech. In this work, we address …