English(EN) Pretrained self-supervised speech models can recognize unseen consonants

语音模型泛化识别稀有舌侧塞音

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-10 01:07

研究人员调查了自监督语音模型是否能准确识别不常见的语音声音，特别是科伊桑语系中发现的舌侧塞音。通过在 G|ui 和 West !Xoon 的数据上微调 Wav2Vec2 和 HuBERT 等模型，他们发现这些模型确实比非舌侧塞音更能有效地识别舌侧塞音。这表明自监督学习使这些模型能够跨越更广泛的人类音素进行泛化，即使是那些在典型训练数据中很少遇到的音素。 AI

影响证明了自监督模型可以泛化到稀有音素，有可能改善低资源语言的自动语音识别。

排序理由该集群包含一篇详细介绍语音模型研究结果的学术论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Chihiro Taguchi, \'Eric Le Ferrand, Hirosi Nakagawa, Hitomi Ono, Kanji Kato, Emily Prud'hommeaux, David Chiang · 2026-06-11 04:00

Pretrained self-supervised speech models can recognize unseen consonants

arXiv:2606.11542v1 Announce Type: cross Abstract: Modern pretrained self-supervised automatic speech recognition models are trained on large-scale audio data to encode speech into contextualized representations. However, their training data are heavily skewed toward high-resource…
arXiv cs.CL TIER_1 English(EN) · David Chiang · 2026-06-10 01:07

预训练的自监督语音模型可以识别未见的辅音

Modern pretrained self-supervised automatic speech recognition models are trained on large-scale audio data to encode speech into contextualized representations. However, their training data are heavily skewed toward high-resource languages with little data from low-resource lang…

报道来源 [2]

Pretrained self-supervised speech models can recognize unseen consonants

预训练的自监督语音模型可以识别未见的辅音

相关实体

相关话题