Gladia open-sources normalization library to improve STT evaluation accuracy

By PulseAugur Editorial · [1 sources] · 2026-04-23 15:49

A new open-source library, gladia-normalization, has been released to address inconsistencies in evaluating speech-to-text (STT) models. The library standardizes transcripts before calculating Word Error Rate (WER), preventing formatting differences from being incorrectly flagged as errors. This tool offers configurable normalization pipelines defined in YAML, ensuring deterministic and version-controllable evaluation processes. AI

IMPACT Standardizes STT evaluation, improving accuracy and comparability of speech recognition model performance.

RANK_REASON Release of an open-source library for a specific task in AI model evaluation.

Read on r/MachineLearning →

gladia-normalization

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Gladia open-sources normalization library to improve STT evaluation accuracy

COVERAGE [1]

r/MachineLearning TIER_1 English(EN) · /u/Karamouche · 2026-04-23 15:49

Built a normalizer so WER stops penalizing formatting differences in STT evals! [P]

<div class="md"><p>Hey guys! At my company, we've been benchmarking STT engines a lot and kept running into the same issue: WER is penalizing formatting differences that have nothing to do with actual recognition quality. "It's $50" vs "it is fifty d…

COVERAGE [1]

Built a normalizer so WER stops penalizing formatting differences in STT evals! [P]

RELATED ENTITIES

RELATED TOPICS