PulseAugur
EN
LIVE 14:54:43

Korean spoken QA research highlights ASR error impact on LLMs

A new research paper analyzes how errors in Korean speech recognition impact the performance of large language models (LLMs) in spoken question answering (SQA). The study found that the degradation caused by speech recognition errors is consistent across different LLMs, suggesting that the information loss at the speech recognition stage is the primary driver of performance decline. The research also identified single-character errors in Korean transcriptions as a unique vulnerability that can alter the intended question and degrade QA accuracy. An auxiliary comparison indicated that large audio language models may offer a more robust solution by directly processing audio input, potentially mitigating issues caused by transcription errors. AI

IMPACT Highlights potential for direct audio input models to improve spoken language understanding in noisy conditions.

RANK_REASON Research paper published on arXiv detailing analysis of ASR-LLM cascades. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Korean spoken QA research highlights ASR error impact on LLMs

COVERAGE [1]

  1. arXiv cs.CL TIER_1 English(EN) · Donghyuk Jung, Youngwon Choi ·

    Analyzing Error Propagation in Korean Spoken QA with ASR-LLM Cascades

    arXiv:2605.17443v2 Announce Type: replace Abstract: We analyze how automatic speech recognition (ASR) errors propagate through ASR-LLM cascades in Korean spoken question answering (SQA), focusing on downstream semantic failures that conventional ASR metrics cannot fully capture. …