English(EN) A Calculus-Based Framework for Determining Vocabulary Size in End-to-End ASR

新框架利用微积分优化自动语音识别词汇量大小

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-14 06:19

研究人员开发了一个基于微积分的框架，用于确定端到端自动语音识别（ASR）系统的最佳词汇量大小。与传统的混合ASR不同，端到端系统从训练数据中派生其词汇量，使得词汇量大小成为一个关键的超参数。这种新方法利用曲线拟合和微积分原理来正式估计最佳词汇量大小，从而提高了ASR在Librispeech等标准数据集上的性能。 AI

影响正式化了一种优化端到端ASR词汇量大小的方法，有望提高模型性能和训练效率。

排序理由学术论文，详细介绍了一种优化ASR系统超参数的新方法。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Sunil Kumar Kopparapu · 2026-05-14 06:19

A Calculus-Based Framework for Determining Vocabulary Size in End-to-End ASR

In hybrid automatic speech recognition (ASR) systems, the vocabulary size is unambiguous, typically determined by the number of phones, bi-phones, or tri-phones present in the language. In contrast, end-to-end ASR systems derive their vocabulary, often referred to as tokens from …

报道来源 [1]

A Calculus-Based Framework for Determining Vocabulary Size in End-to-End ASR

相关实体

相关话题