新MoDiCoL数据集旨在应对真实世界变化下的ASR鲁棒性

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-12 13:50

研究人员推出MoDiCoL，一个旨在提高自动语音识别（ASR）系统鲁棒性的新数据集。与现有数据集孤立地考虑口音或噪声等因素不同，MoDiCoL允许对语言内容、说话人特征和声学环境进行受控分析，包括它们的共现。该数据集配有持续学习课程，以模拟真实世界的增量更新，并研究ASR模型在不断变化的情况下如何获取、迁移和遗忘鲁棒性。 AI

影响该数据集旨在通过解决分布偏移来弥合ASR在基准测试和实际应用之间的性能差距。

排序理由该集群包含一篇学术论文，详细介绍了用于ASR研究的新数据集和方法。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Theresa Pekarek Rosin, Matthias Kerzel, Stefan Wermter · 2026-06-15 04:00

MoDiCoL: A Modular Diagnostic Continual Learning Dataset for Robust Speech Recognition

arXiv:2606.14459v1 Announce Type: cross Abstract: Modern Automatic Speech Recognition (ASR) systems have made remarkable progress on standard benchmarks, yet performance gaps have emerged under real-world distribution shifts, caused by recording conditions, accents, speech impair…
arXiv cs.AI TIER_1 English(EN) · Stefan Wermter · 2026-06-12 13:50

MoDiCoL：用于鲁棒语音识别的模块化诊断持续学习数据集

Modern Automatic Speech Recognition (ASR) systems have made remarkable progress on standard benchmarks, yet performance gaps have emerged under real-world distribution shifts, caused by recording conditions, accents, speech impairments, and noise. Existing datasets and benchmarks…

报道来源 [2]

MoDiCoL: A Modular Diagnostic Continual Learning Dataset for Robust Speech Recognition

MoDiCoL：用于鲁棒语音识别的模块化诊断持续学习数据集

相关实体

相关话题