PulseAugur
实时 09:47:50

New HATS dataset integrates human perception for ASR evaluation

Researchers have introduced HATS, a new French dataset designed to evaluate Automatic Speech Recognition (ASR) systems by incorporating human perception. The dataset was created by having 143 individuals compare and select the better transcription from two options generated by different ASR systems. This effort aims to move beyond traditional metrics like Word Error Rate (WER), which are considered insufficient for assessing ASR quality from a human user's perspective. AI

影响 Introduces a new dataset for evaluating ASR systems, potentially leading to more human-aligned transcription quality assessments.

排序理由 The cluster describes a new academic paper introducing a novel dataset for ASR evaluation.

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

New HATS dataset integrates human perception for ASR evaluation

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Thibault Ba\~neras Roux, Jane Wottawa, Mickael Rouvier, Teva Merlin, Richard Dufour ·

    HATS: An Open data set Integrating Human Perception Applied to the Evaluation of Automatic Speech Recognition Metrics

    arXiv:2604.27542v1 Announce Type: new Abstract: Conventionally, Automatic Speech Recognition (ASR) systems are evaluated on their ability to correctly recognize each word contained in a speech signal. In this context, the word error rate (WER) metric is the reference for evaluati…

  2. arXiv cs.CL TIER_1 English(EN) · Richard Dufour ·

    HATS: An Open data set Integrating Human Perception Applied to the Evaluation of Automatic Speech Recognition Metrics

    Conventionally, Automatic Speech Recognition (ASR) systems are evaluated on their ability to correctly recognize each word contained in a speech signal. In this context, the word error rate (WER) metric is the reference for evaluating speech transcripts. Several studies have show…