Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Eyettention II: A Dual-Sequence Architecture for Modeling Fixation Location, Within-Word Landing Position, and Fixation Duration in Reading

About

The way our eyes move while reading provides valuable insights into both the reader's cognitive processes and the properties of the text. In particular, eye-tracking-while-reading data has shown to be highly beneficial in various technological applications, such as enhancing and interpreting language models and inferring a reader's characteristics. However, these applications often rely on large-scale, data-driven models, which demand extensive eye-tracking datasets that are challenging to obtain due to the resource-intensive nature of data collection. To address the challenge of data scarcity, we develop Eyettention II, an end-to-end trained deep-learning model capable of generating realistic scanpaths consisting of a complete set of fixation attributes in chronological order, including fixation location, within-word landing position, and fixation duration. Our model is lightweight, efficiently trainable on limited GPU resources, and closely aligned with cognitive theories. We demonstrate that Eyettention II surpasses state-of-the-art models in scanpath prediction and mirrors human-like gaze behavior by capturing key psycholinguistic phenomena. With its robust performance, Eyettention II holds the potential to drive advancements in natural language processing, facilitate piloting the materials of psycholinguistic experiments, and uncover new insights beyond what is explicitly encoded in theoretical cognitive models.

Shuwen Deng, Cui Ding, David R. Reich, Paul Prasse, Lena A. J\"ager• 2026

Related benchmarks

TaskDatasetResultRank
Scanpath PredictionCELER English L1 (New Sentence Split)
MultiMatch Score (Vector)0.978
6
Scanpath PredictionCELER English L1 (New Reader Split)
MultiMatch Vector Score97.9
6
Scanpath PredictionCELER English L1 (New Sentence / New Reader Split)
MultiMatch Vector Score98
6
Scanpath PredictionBSC (Chinese) (New Sentence Split)
MultiMatch Vector99.3
5
Scanpath PredictionBSC Chinese (New Reader Split)
MultiMatch (Vector)0.993
5
Scanpath PredictionBSC Chinese (New Sentence New Reader Split)
MultiMatch Vector0.993
5
Showing 6 of 6 rows

Other info

Follow for update