Language models' early layers capture human reading time signals

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have investigated whether language model representations encode signals related to human reading times. By using regularized linear regression on eye-tracking data across five languages, they compared model layer representations against predictors like surprisal. The study found that early model layers were better at predicting initial reading measures, suggesting that low-level representations capture human-like processing signatures. However, for later reading measures, surprisal remained a stronger predictor, and the optimal predictor varied by language and eye-tracking metric. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

RANK_REASON This is a research paper detailing a study on language model representations and human reading times.

Read on Hugging Face Daily Papers →

paper
other

COVERAGE [1]

Hugging Face Daily Papers TIER_1 · 2026-04-20 18:12

Probing for Reading Times

Probing has shown that language model representations encode rich linguistic information, but it remains unclear whether they also capture cognitive signals about human processing. In this work, we probe language model representations for human reading times. Using regularized li…

COVERAGE [1]

Probing for Reading Times

RELATED TOPICS