Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorGatt, A.
dc.contributor.authorRoijen, Andrea
dc.date.accessioned2024-12-17T00:01:20Z
dc.date.available2024-12-17T00:01:20Z
dc.date.issued2024
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/48248
dc.description.abstractA longstanding debate surrounds modelling English past tense inflection. In this thesis, we investigated neural models, specifically Encoder-Decoders, as cognitive models of English past tense inflection. While recent studies showed that Encoder-Decoders achieve strong improvements over the initial connectionist model of Rumelhart and McClelland (1986), it has also been reported that they still failed to accurately capture human speaker inflections of novel forms. Our results reveal that data representativeness and model configuration choices influence model performance on real and nonce verbs. Importantly, we found improved correlations with human nonce verb inflections when the problem of overfitting on training data was mitigated, for instance, by using fewer training epochs. However, this also resulted in lower accuracies on real irregular verbs. A key finding is that this problem could be overcome by augmenting the training data with token frequency. This led to near-perfect performance on training verbs, including high accuracies on the irregular class, while obtaining almost equally strong correlations with human data. This highlights the relevance of token frequency, challenging previous assumptions. Additionally, we investigated a multi- task training setup, wherein the model also classifies verbs as regular or irregular. This task aligns with the dual-route view of Pinker and Prince (1988). However, this setup led to similar or slightly worse performance, leaving the cognitive validity of the discrete distinction between regular and irregular verbs open to further investigation. We emphasise the value of future research on using neural models to investigate cognitive processes such as morphological inflection.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectI investigated the potential of encoder-decoder neural models as cognitive model of English past tense inflection. I did this by investigating model performance on real and nonce verbs, as a result of: token frequency instead of type frequency in the training data, a multi-task training set-up where the model also classifies verbs as regular or irregular, the influence of different data and model configurations, and the influence of the distribution of verbs over a training and development set.
dc.titleThe Tense Debate: Cognitive Modelling English Past Tense Inflection with Encoder-Decoder Neural Models
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsEncoder-Decoder models; cognitive modelling; English past tense inflection; token frequency; Multi-task learning; morphological inflection; neural models
dc.subject.courseuuArtificial Intelligence
dc.thesis.id41807


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record