PulseAugur
EN
LIVE 19:48:57
ENTITY Automatic Speech Recognition

Automatic Speech Recognition

PulseAugur coverage of Automatic Speech Recognition — every cluster mentioning Automatic Speech Recognition across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
15
15 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
10
10 over 90d
TIER MIX · 90D
TOPICS
RELATIONSHIPS
SENTIMENT · 30D

6 day(s) with sentiment data

RECENT · PAGE 1/1 · 15 TOTAL
  1. TOOL · CL_78238 ·

    ASR fine-tuned for Indian banking calls after 3-week effort

    This article details the process of fine-tuning an Automatic Speech Recognition (ASR) system specifically for the unique challenges of Indian banking calls. The author spent three weeks experimenting with multiple model…

  2. RESEARCH · CL_68139 ·

    LLMs generate synthetic conversations to boost ASR training

    Researchers have developed a novel method to enhance Automatic Speech Recognition (ASR) training for low-resource languages by generating synthetic conversational data. This pipeline uses LLMs to create dialogues, maps …

  3. RESEARCH · CL_65569 ·

    New ASR methods tackle compute scaling and multilingual evaluation

    Researchers are developing new methods to improve automatic speech recognition (ASR) systems. One approach, LARM, uses a depth-conditioned looped Transformer to allow for adjustable test-time computation, achieving perf…

  4. TOOL · CL_54716 ·

    Noisekit CLI generates realistic degraded audio for ASR benchmarking

    A new command-line tool called noisekit has been released to help benchmark automatic speech recognition (ASR) systems. It generates realistic degraded audio datasets by applying various noise and distortion conditions …

  5. TOOL · CL_51864 ·

    Intel NPU accelerates smart home ASR, outperforming CPU on speed and energy

    A user has successfully utilized their Intel Arrow Lake NPU for Automatic Speech Recognition (ASR) in a smart home setup, achieving significant performance gains. The NPU processed a 10-second audio clip 4.8 times faste…

  6. COMMENTARY · CL_47605 ·

    AI voice assistants in 2026 offer advanced capabilities for personal and business use

    AI voice assistants in 2026 are significantly more advanced, leveraging LLMs, ASR, ML, and NLP to understand natural speech, learn continuously, and personalize responses. These assistants are categorized into personal …

  7. TOOL · CL_32731 ·

    New neural layer nASR enhances EEG artifact removal for BCIs

    Researchers have developed nASR, a novel trainable neural layer designed to improve Electroencephalogram (EEG) signal processing for Brain-Computer Interfaces (BCIs). This new layer addresses limitations in existing Art…

  8. COMMENTARY · CL_23142 ·

    Voice AI paradox: Advanced chat, basic failures

    Voice AI assistants like Yandex's Alisa exhibit a paradox of advanced conversational abilities alongside basic functional failures, stemming from their complex architecture. This hybrid system combines speech recognitio…

  9. RESEARCH · CL_13577 ·

    Sakana AI's KAME architecture injects LLM knowledge into speech AI without latency

    Sakana AI has developed KAME, a novel tandem architecture for speech-to-speech AI that aims to combine the speed of direct systems with the knowledge depth of LLM-based approaches. KAME operates with two asynchronous co…

  10. RESEARCH · CL_09296 ·

    Tamazight single-speaker speech dataset released on Hugging Face

    A new single-speaker speech dataset for the Tamazight language has been released on Hugging Face and the Mozilla Data Collective. This dataset is intended for use in AI applications such as automatic speech recognition …

  11. RESEARCH · CL_08610 ·

    Researchers enhance elderly ASR with LLM paraphrasing and speech synthesis

    Researchers have developed a novel data augmentation technique to improve automatic speech recognition (ASR) for elderly individuals. This method utilizes large language models to paraphrase existing transcripts, genera…

  12. RESEARCH · CL_11761 ·

    New LLMs unify audio and language processing for full-duplex and medical applications

    Researchers have developed UAF, a novel unified audio front-end LLM designed for full-duplex speech interaction. This model integrates diverse audio front-end tasks like voice activity detection and turn-taking into a s…

  13. RESEARCH · CL_06703 ·

    MedSpeak framework improves medical QA by correcting ASR errors with knowledge graphs

    Researchers have developed MedSpeak, a new framework designed to improve the accuracy of spoken question-answering systems in the medical domain. This system utilizes a medical knowledge graph to aid automatic speech re…

  14. RESEARCH · CL_04968 ·

    New framework identifies demographic unfairness in speech recognition models

    A new research paper identifies two types of errors—random variance and systematic bias—that contribute to demographic unfairness in speech recognition models. The study found that while both error types are present, ra…

  15. RESEARCH · CL_02996 ·

    "This Wasn't Made for Me": ASR Bias Hurts Users Emotionally and Cognitively

    A new research paper highlights the emotional and psychological toll of bias in Automatic Speech Recognition (ASR) systems. The study, which involved user experience research in four U.S. locations, found that participa…