English(EN) In Silico Modeling of the RAMPHO Buffer: Dissociating Informational and Energetic Masking via Phonetic Entropy in Deep Neural Networks

新的模拟模型揭示语音理解的认知极限

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-21 13:25

研究人员开发了 RAMPHO 缓冲区的计算机模拟，这是多说话者聆听环境中的认知瓶颈。该模拟使用 wav2vec 2.0 声学模型的语音熵来区分信息掩蔽和能量掩蔽。研究揭示了一种权衡：在高信噪比下，去除干扰项的语义内容有助于聆听，但在较低信噪比下会损害时间线索感知。 AI

影响引入了一种理解语音处理中认知局限性的新颖模拟，可能指导未来听觉感知领域的人工智能发展。

排序理由该集群包含一篇详细介绍新模拟模型的学术论文。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Stefan Bleeck · 2026-05-22 04:00

深度神经网络中基于语音熵的RAMPHO缓冲区的计算机模拟：区分信息掩蔽和能量掩蔽

arXiv:2605.22465v1 Announce Type: new Abstract: The fundamental challenge of listening in multi-talker environments is a cognitive bottleneck, defined by the Ease of Language Understanding (ELU) model as a failure within the RAMPHO episodic buffer. Current deep neural networks fo…
arXiv cs.CL TIER_1 English(EN) · Stefan Bleeck · 2026-05-21 13:25

深度神经网络中基于语音熵的RAMPHO缓冲区的计算机模拟：区分信息掩蔽和能量掩蔽

The fundamental challenge of listening in multi-talker environments is a cognitive bottleneck, defined by the Ease of Language Understanding (ELU) model as a failure within the RAMPHO episodic buffer. Current deep neural networks for speech enhancement optimize purely for physica…