Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorPoesio, Massimo
dc.contributor.authorMouratidi, Maria
dc.date.accessioned2025-10-15T23:01:45Z
dc.date.available2025-10-15T23:01:45Z
dc.date.issued2025
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/50536
dc.description.abstractAs transformers become increasingly prevalent in NLP research, evaluating their cognitive alignment with human language processing has become essential for validating them as models of human language. This study compares eye-gaze patterns in human reading with transformer attention mechanisms to examine whether they can plausibly represent human attention during reading tasks. Our analysis validates previous findings with encoder models while extending the analysis to decoder architectures. We employ both statistical correlation analysis and predictive modeling using PCA-reduced representations of eye-tracking features across two reading tasks. We also examine the effect of different attention explanation methods (raw attention, attention flow, and gradient-based saliency) on the results. The findings reveal lower correlations and predictive capacity for the decoder model compared to the encoder model, with implications on the gap between behavioral performance and cognitive plausibility of different transformer designs.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectThis thesis compares human eye-gaze patterns in reading with transformer attention to assess cognitive alignment in language processing. It extends prior work by analyzing a decoder-only model, using correlation and predictive modeling of PCA-reduced eye-tracking features, and examining the impact of various attention explanation methods on the results.
dc.titleComparing Eye-gaze and Transformer Attention Mechanisms in Reading Tasks
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordslarge language models; attention; eye-movements; reading; interpretability;
dc.subject.courseuuArtificial Intelligence
dc.thesis.id54600


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record