PulseAugur
实时 13:27:27
English(EN) Connecting Speech to Words through Images

新方法在无文本监督的情况下将口语与图像关联起来

研究人员开发了一种新颖的方法,可以在不依赖显式文本监督的情况下创建口语词汇。该方法使用图像及其语音描述来构建书面词汇表,然后将它们与相关的音频片段对齐。该系统利用无监督词发现技术将口语片段与其书面对应词联系起来,在口语检索和关键词识别任务中表现出有效性。 AI

影响 支持低资源语言开发,并提高语音转文本系统的可解释性。

排序理由 该集群包含一篇在 arXiv 上发表的学术论文,详细介绍了一种新的研究方法。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Gabriel Pirlogeanu, Dan Oneata, Horia Cucu, Herman Kamper ·

    Connecting Speech to Words through Images

    arXiv:2606.16807v1 Announce Type: new Abstract: How can we learn the mapping between written words and their spoken counterparts in the absence of explicit textual supervision? We present a visually grounded method for building a vocabulary of spoken words using only images and t…

  2. arXiv cs.CL TIER_1 English(EN) · Herman Kamper ·

    Connecting Speech to Words through Images

    How can we learn the mapping between written words and their spoken counterparts in the absence of explicit textual supervision? We present a visually grounded method for building a vocabulary of spoken words using only images and their spoken descriptions. First, image captionin…