Researchers have investigated why language models exhibit less surprise than humans when processing ambiguous sentences. They tested the hypothesis that language models can consider more interpretations simultaneously than humans. By adjusting the number of parses used in recurrent neural network grammars, they found that reducing simultaneous parses increased predicted garden path effects, but not enough to match human reading times. This suggests that the difference in parse multiplicity alone does not explain the discrepancy in surprise levels. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT Investigates fundamental differences in how language models and humans process linguistic ambiguity, potentially informing future model design.
RANK_REASON Academic paper published on arXiv detailing a hypothesis and experimental results regarding language model behavior. [lever_c_demoted from research: ic=1 ai=1.0]