English(EN) Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis

语言模型和人类在句子惊讶度上存在差异

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-14 21:44

研究人员调查了语言模型在处理歧义句时为何比人类表现出更少的惊讶。他们检验了语言模型可以同时考虑比人类更多的解释的假说。通过调整循环神经网络语法中使用的解析数量，他们发现减少同时解析会增加预测的“花园路径”效应，但不足以匹配人类的阅读时间。这表明解析多重性上的差异本身并不能解释惊讶程度上的差异。 AI

影响探讨了语言模型和人类在处理语言歧义方面的根本差异，可能为未来的模型设计提供信息。

排序理由在arXiv上发表的学术论文，详细介绍了关于语言模型行为的假说和实验结果。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Tal Linzen · 2026-05-14 21:44

Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis

Surprisal theory posits that the processing difficulty of a word is determined by its predictability in context, offering a potential link between human sentence processing and next-word predictions from language models. While language model (LM) surprisals successfully predict r…

报道来源 [1]

Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis

相关实体

相关话题