Language models and humans differ in sentence surprise

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have investigated why language models exhibit less surprise than humans when processing ambiguous sentences. They tested the hypothesis that language models can consider more interpretations simultaneously than humans. By adjusting the number of parses used in recurrent neural network grammars, they found that reducing simultaneous parses increased predicted garden path effects, but not enough to match human reading times. This suggests that the difference in parse multiplicity alone does not explain the discrepancy in surprise levels. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Investigates fundamental differences in how language models and humans process linguistic ambiguity, potentially informing future model design.

RANK_REASON Academic paper published on arXiv detailing a hypothesis and experimental results regarding language model behavior. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

paper
other

COVERAGE [1]

arXiv cs.CL TIER_1 · Tal Linzen · 2026-05-14 21:44

Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis

Surprisal theory posits that the processing difficulty of a word is determined by its predictability in context, offering a potential link between human sentence processing and next-word predictions from language models. While language model (LM) surprisals successfully predict r…

COVERAGE [1]

Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis

RELATED ENTITIES

RELATED TOPICS