PulseAugur
EN
LIVE 02:27:21

New benchmark evaluates Chinese news TTS pronunciation accuracy

Researchers have introduced the CN-NewsTTS Bench, a new benchmark designed to evaluate the pronunciation accuracy of Chinese news Text-to-Speech (TTS) systems. This benchmark specifically targets complex written forms like scores, hyphenated names, and mixed Chinese-Latin-digit expressions, which are common in news content and can be mispronounced by TTS systems. The benchmark includes development and test sets, auto-evaluable targets, and transcripts from an Automatic Speech Recognition (ASR) ensemble, with initial results showing the best performing system achieving 0.879 accuracy, while others lag significantly below 0.60. AI

IMPACT This benchmark aims to improve the naturalness and accuracy of Chinese news TTS, potentially leading to better voice assistants and audio content generation.

RANK_REASON The cluster describes a new benchmark for evaluating TTS systems, which falls under research.

Read on Hugging Face Daily Papers →

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

New benchmark evaluates Chinese news TTS pronunciation accuracy

COVERAGE [3]

  1. arXiv cs.CL TIER_1 English(EN) · Shijun Luo ·

    CN-NewsTTS Bench: a target-level automatic benchmark for raw-input Chinese news TTS pronunciation

    arXiv:2606.24714v1 Announce Type: new Abstract: Chinese news text contains dense written forms such as scores, hyphenated model names, ranges, unit symbols, percentages, English abbreviations, and mixed Chinese-Latin-digit names. These forms are frequent in real listening workflo…

  2. arXiv cs.CL TIER_1 English(EN) · Shijun Luo ·

    CN-NewsTTS Bench: a target-level automatic benchmark for raw-input Chinese news TTS pronunciation

    Chinese news text contains dense written forms such as scores, hyphenated model names, ranges, unit symbols, percentages, English abbreviations, and mixed Chinese-Latin-digit names. These forms are frequent in real listening workflows, and a text-to-speech (TTS) system can preser…

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    CN-NewsTTS Bench: a target-level automatic benchmark for raw-input Chinese news TTS pronunciation

    Chinese news text contains dense written forms such as scores, hyphenated model names, ranges, unit symbols, percentages, English abbreviations, and mixed Chinese-Latin-digit names. These forms are frequent in real listening workflows, and a text-to-speech (TTS) system can preser…