Brief

last 24h

[2/2] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

RESEARCH · arXiv cs.AI English(EN) · 3d · [2 sources]

MoDiCoL: A Modular Diagnostic Continual Learning Dataset for Robust Speech Recognition

Researchers have introduced MoDiCoL, a new dataset designed to improve the robustness of Automatic Speech Recognition (ASR) systems. Unlike existing datasets that isolate factors like accents or noise, MoDiCoL allows for the controlled analysis of linguistic content, speaker characteristics, and acoustic environments, including their co-occurrence. The dataset is paired with a continual learning curriculum to simulate real-world incremental updates and study how ASR models acquire, transfer, and forget robustness under evolving conditions. AI

IMPACT This dataset aims to bridge the gap between ASR performance on benchmarks and real-world applications by addressing distribution shifts.
- MoDiCoL
- Theresa Pekarek Rosin
RESEARCH · arXiv cs.AI English(EN) · 4d · [2 sources]

Learning to Hear Hesitation: Continual Learning for Disfluency-Aware ASR

Researchers have developed a new approach to improve Automatic Speech Recognition (ASR) systems by incorporating continual learning techniques to better handle disfluent speech. The method involves introducing explicit disfluency tokens into pre-trained ASR models and then fine-tuning them on diverse datasets. This process aims to prevent catastrophic forgetting of general knowledge while enhancing the model's ability to recognize and process speech disfluencies, addressing a key challenge in current ASR technology. AI

IMPACT This research could lead to more robust ASR systems capable of handling natural, unscripted speech, improving user experience in voice-enabled applications.
- arXiv
- Theresa Pekarek Rosin

Brief

MoDiCoL: A Modular Diagnostic Continual Learning Dataset for Robust Speech Recognition

Learning to Hear Hesitation: Continual Learning for Disfluency-Aware ASR