Trajectory Dynamics in Language Model Hidden States Predict Human Processing Costs Beyond Surprisal

arXiv cs.CL·Elan Barenholtz

2d ago

·~2 min·6/5/2026·en·1

Quick Answer

This study introduces trajectory extrapolation error as a predictor of human processing costs in language comprehension, revealing that it operates independently of surprisal.

Quick Take

This study introduces trajectory extrapolation error as a predictor of human processing costs in language comprehension, revealing that it operates independently of surprisal. Analyzing transformer models like GPT-2, the research shows that this measure correlates with reading times, particularly in complex sentences, suggesting a nuanced understanding of language processing dynamics.

Key Points

Trajectory extrapolation error predicts reading times independently of surprisal.
The effect is stronger in garden-path sentences and scales with model size.
Findings suggest two components of processing cost: surprisal and trajectory error.
Research utilized the Natural Stories corpus for empirical validation.
Results replicate across different transformer architectures and positional encodings.

Article Content

From source RSS / original summary

arXiv:2606. 05346v1 Announce Type: new Abstract: Human language comprehension unfolds sequentially: each word is processed in the context of those that came before, and the interpretation builds incrementally over time. Surprisal, the negative log probability of a word given its context, has been the dominant predictor of incremental processing cost.

But surprisal reduces rich sequential representations to a single scalar at each word, discarding information about the direction in which the interpretation has been evolving. Dynamical-systems approaches suggest that the trajectory of the evolving interpretive state, not just its position at each moment,should shape processing, and language itself may have local momentum, since speakers plan utterances a few words at a time.

We introduce trajectory extrapolation error: at each word, we fit a linear trajectory to the preceding hidden states of a transformer language model and measure deviation from the extrapolated path. On the Natural Stories corpus, this measure is nearly orthogonal to surprisal (r = . 044) and independently predicts self-paced reading times.

The effect is especially pronounced in garden-path sentences, strengthens with model scale (GPT-2 Small to Large), and replicates across architectures with different positional encoding schemes (GPT-2 vs. Pythia/RoPE). A displacement control shows the effect is not reducible to representational change magnitude: displacement and extrapolation error predict in opposite directions.

These findings reveal two dissociable components of processing cost: word-level prediction error (surprisal) and sensitivity to the local momentum of the unfolding interpretation (trajectory extrapolation error).

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CL

See more →

arXiv cs.CL·Leyao Wang, Yanan He, Peng Chen, Asaf Yehudai, Yixin Liu, Rex Ying, Michal Shmueli-Scheuer, Arman Cohan

2w ago

FeaturedOriginal

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

AI Summary

The REFLECT benchmark reveals that current LLM judges are unreliable, achieving below 55% accuracy in evaluating reasoning and evidence use, highlighting the need for improved evaluation methods for deep research agents.

#LLM #Agent #Inference #Policy