tool · [1 source] · 2026-05-22 04:00

ASR systems benchmarked on code-switching speech

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

A new benchmark study evaluated five commercial automatic speech recognition (ASR) systems on code-switching speech, specifically focusing on Arabic, Persian, and German mixed with English. The research introduced a novel pipeline using GPT-4o and Gemini 1.5 Pro to score transcripts, reducing LLM costs by 91% and employing BERTScore as a more reliable metric than traditional Word Error Rate (WER) for certain language pairs. ElevenLabs Scribe v2 emerged as the top performer, achieving the lowest WER and highest BERTScore across all tested language pairs. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT This research highlights the challenges in ASR for code-switching and introduces a more robust evaluation method, potentially guiding future development of multilingual speech technologies.

RANK_REASON The cluster contains an academic paper detailing a new benchmark and evaluation methodology for ASR systems. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

paper
other

COVERAGE [1]

arXiv cs.CL TIER_1 · Sajjad Abdoli, Ghassan Al-Sumaidaee, Clayton W. Taylor, Ahmad ElShiekh, Ahmed Rashad · 2026-05-22 04:00

Benchmarking Commercial ASR Systems on Code-Switching Speech: Arabic, Persian, and German

arXiv:2605.19069v2 Announce Type: replace Abstract: Code-switching -- the natural alternation between two languages within a single utterance -- represents one of the most challenging and under-studied conditions for automatic speech recognition (ASR). Existing commercial ASR ben…

COVERAGE [1]

Benchmarking Commercial ASR Systems on Code-Switching Speech: Arabic, Persian, and German

RELATED ENTITIES

RELATED TOPICS