ENTITY LibriSpeech

LibriSpeech

PulseAugur coverage of LibriSpeech — every cluster mentioning LibriSpeech across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

14 over 90d

Releases · 30d

0 over 90d

Papers · 30d

13 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

10 day(s) with sentiment data

RECENT · PAGE 1/1 · 14 TOTAL

TOOL · CL_116438 · Jun 29 · 16:20

User seeks help implementing Calm TTS paper, facing voice cloning issues

A user is seeking assistance with implementing the Calm text-to-speech model described in a research paper. They have encountered difficulties in replicating the model's performance, experiencing issues with generating …
TOOL · CL_111729 · Jun 26 · 04:00

New neural diarization model excels on low-resource Nepali-Hindi speech

Researchers have developed a new approach to speaker diarization, the process of identifying who spoke when in an audio recording, specifically for low-resource languages like Nepali-Hindi. They trained two neural netwo…
TOOL · CL_109048 · Jun 24 · 00:00

Hugging Face launches FFASR Leaderboard for real-world ASR benchmarking

Hugging Face and Treble Technologies have launched the FFASR Leaderboard, an open, community-driven benchmark for evaluating Automatic Speech Recognition (ASR) models in realistic far-field acoustic conditions. This new…
RESEARCH · CL_107814 · Jun 23 · 05:09

New ASR method InterAligner improves training stability and reduces errors

Researchers have developed a new method called InterAligner to improve the training stability and performance of Aligner-Encoder based Automatic Speech Recognition (ASR) models. This approach introduces an intermediate …
TOOL · CL_105156 · Jun 22 · 13:21

New research reveals CTC limitations in speech recognition, highlights linguistic model benefits

A new research paper explores the limitations of Connectionist Temporal Classification (CTC) in speech recognition systems. The study found that CTC's internal scoring methods struggle to improve accuracy beyond basic g…
RESEARCH · CL_98162 · Jun 18 · 04:00

New research tackles ASR challenges with synthetic speech, LLM optimization, and failure reduction

Researchers are developing advanced techniques to improve Automatic Speech Recognition (ASR) systems, particularly for challenging scenarios like code-switching and real-time applications. One paper proposes a code-mixi…
RESEARCH · CL_95869 · Jun 16 · 05:28

New NAR-MBR Decoding Boosts Speech Recognition Speed and Accuracy

Researchers have developed a new non-autoregressive decoding framework for speech recognition, termed NAR-MBR decoding. This method aims to improve the speed of speech recognition by generating output tokens in parallel…
RESEARCH · CL_84432 · Jun 10 · 09:16

Speech models compressed using parameter clustering

Researchers have developed a new method for compressing speech foundation models without requiring additional data or retraining. This approach utilizes channelwise clustering with k-means to achieve parameter compressi…
TOOL · CL_82584 · Jun 10 · 04:00

New model uses continuous space for speech recognition and translation

Researchers have introduced ELF-S2T, a novel approach to speech-to-text systems that operates in a continuous latent space rather than discrete text tokens. This model, built on the Embedded Language Flows (ELF) backbon…
RESEARCH · CL_65569 · Jun 1 · 17:49

New ASR methods tackle compute scaling and multilingual evaluation

Researchers are developing new methods to improve automatic speech recognition (ASR) systems. One approach, LARM, uses a depth-conditioned looped Transformer to allow for adjustable test-time computation, achieving perf…
TOOL · CL_65131 · May 31 · 10:15

Neuromorphic Mamba models boost speech recognition efficiency

Researchers have developed new neuromorphic versions of the Mamba model for more efficient automatic speech recognition (ASR). By incorporating spiking and event-driven neural network techniques, they achieved significa…
TOOL · CL_44843 · May 22 · 04:00

Quantization study enables smaller, more accurate Whisper-small ASR

A new study published on arXiv evaluates various post-training quantization (PTQ) techniques for the Whisper-small automatic speech recognition model. The research, which tested libraries like PyTorch, Optimum-Quanto, H…
TOOL · CL_32709 · May 14 · 06:19

New framework uses calculus to optimize ASR vocabulary size

Researchers have developed a calculus-based framework to determine the optimal vocabulary size for end-to-end Automatic Speech Recognition (ASR) systems. Unlike traditional hybrid ASR, end-to-end systems derive their vo…
RESEARCH · CL_09815 · Apr 29 · 10:28

New research explores text-only data for faster encoder-dominated speech recognition models

This paper introduces novel methods for enhancing speech recognition models by leveraging text-only data. The research focuses on encoder-dominated architectures, demonstrating that a larger encoder paired with a smalle…

User seeks help implementing Calm TTS paper, facing voice cloning issues

New neural diarization model excels on low-resource Nepali-Hindi speech

Hugging Face launches FFASR Leaderboard for real-world ASR benchmarking

New ASR method InterAligner improves training stability and reduces errors

New research reveals CTC limitations in speech recognition, highlights linguistic model benefits

New research tackles ASR challenges with synthetic speech, LLM optimization, and failure reduction

New NAR-MBR Decoding Boosts Speech Recognition Speed and Accuracy

Speech models compressed using parameter clustering

New model uses continuous space for speech recognition and translation

New ASR methods tackle compute scaling and multilingual evaluation

Neuromorphic Mamba models boost speech recognition efficiency

Quantization study enables smaller, more accurate Whisper-small ASR

New framework uses calculus to optimize ASR vocabulary size

New research explores text-only data for faster encoder-dominated speech recognition models