Brief

last 24h

[2/2] 223 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.AI English(EN) · 7h

Training-Free Intelligibility-Guided Observation Addition for Noisy ASR

Researchers have developed a new method for improving automatic speech recognition (ASR) in noisy environments. This technique, called intelligibility-guided observation addition (OA), fuses noisy speech with enhanced speech to boost recognition accuracy. Unlike previous methods that required training, this new approach is training-free, deriving fusion weights directly from the ASR system's intelligibility estimates, which enhances its generalization capabilities. AI

IMPACT This new training-free method could improve the robustness of speech recognition systems in real-world noisy conditions.
- arXiv
- Haoyang Li
TOOL · arXiv cs.AI English(EN) · 7h

GenTSE: Enhancing Target Speaker Extraction via a Coarse-to-Fine Generative Language Model

Researchers have developed GenTSE, a novel two-stage generative language model designed to enhance target speaker extraction (TSE). This model first predicts coarse semantic tokens and then refines them into fine acoustic tokens, a separation that improves accuracy and speech quality. GenTSE utilizes continuous embeddings and a Frozen-LM Conditioning training strategy to mitigate exposure bias, outperforming previous language model-based systems in experiments. AI

IMPACT Introduces a new method for improving speech processing tasks like speaker extraction.
- Haoyang Li