PulseAugur
实时 08:47:14
English(EN) In Silico Modeling of the RAMPHO Buffer: Dissociating Informational and Energetic Masking via Phonetic Entropy in Deep Neural Networks

新的模拟模型揭示语音理解的认知极限

研究人员开发了 RAMPHO 缓冲区的计算机模拟,这是多说话者聆听环境中的认知瓶颈。该模拟使用 wav2vec 2.0 声学模型的语音熵来区分信息掩蔽和能量掩蔽。研究揭示了一种权衡:在高信噪比下,去除干扰项的语义内容有助于聆听,但在较低信噪比下会损害时间线索感知。 AI

影响 引入了一种理解语音处理中认知局限性的新颖模拟,可能指导未来听觉感知领域的人工智能发展。

排序理由 该集群包含一篇详细介绍新模拟模型的学术论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Stefan Bleeck ·

    In Silico Modeling of the RAMPHO Buffer: Dissociating Informational and Energetic Masking via Phonetic Entropy in Deep Neural Networks

    arXiv:2605.22465v1 Announce Type: new Abstract: The fundamental challenge of listening in multi-talker environments is a cognitive bottleneck, defined by the Ease of Language Understanding (ELU) model as a failure within the RAMPHO episodic buffer. Current deep neural networks fo…

  2. arXiv cs.CL TIER_1 English(EN) · Stefan Bleeck ·

    In Silico Modeling of the RAMPHO Buffer: Dissociating Informational and Energetic Masking via Phonetic Entropy in Deep Neural Networks

    The fundamental challenge of listening in multi-talker environments is a cognitive bottleneck, defined by the Ease of Language Understanding (ELU) model as a failure within the RAMPHO episodic buffer. Current deep neural networks for speech enhancement optimize purely for physica…