On how transformers learn to understand and evaluate nested arithmetic expressions

Grashoff, Daan

dc.rights.license	CC-BY-NC-ND
dc.contributor.advisor	Paperno, Denis
dc.contributor.author	Grashoff, Daan
dc.date.accessioned	2022-03-01T00:00:29Z
dc.date.available	2022-03-01T00:00:29Z
dc.date.issued	2022
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/533
dc.description.abstract	In this thesis, we studied whether self-attention networks can learn compositional seman- tics using an arithmetic language. The goal of language aims to evaluate the meaning of nested expressions. We find that self-attention networks can learn to evaluate these nested expres- sions by taking shortcuts on less complex expressions or utilizing deeper layers on complex expressions when the nested depth grows. The complexity is in whether expressions are left- (easy) or right-branching (hard) and whether, in the case of right-branching expressions, plus (easy) or minus (complex) operators are used. We find that increasing the number of heads does not always help with more complex expressions, whereas the number of layers does always help to generalize to deeper expressions. Finally, to help with the understanding of what the self-attention networks are doing, we analyzed the attention scores and found exciting patterns such as the numbers attending to the preceding operators and nested sub-expressions attend- ing to preceding operators. These patterns may explain why in less complex expressions, the self-attention networks take shortcuts, but in more complex expressions, this is not possible by the way the self-attention networks try to solve them.
dc.description.sponsorship	Utrecht University
dc.language.iso	EN
dc.subject	In this work, we studied whether self-attention networks can learn compositionality using an arithmetic language, a simple language that allowed us to construct precise compositional structures and test the networks on these.
dc.title	On how transformers learn to understand and evaluate nested arithmetic expressions
dc.type.content	Master Thesis
dc.rights.accessrights	Open Access
dc.subject.keywords	Transformer; interpretability of neural networks;compositionality;embedding depth
dc.subject.courseuu	Artificial Intelligence
dc.thesis.id	1870

Files in this item

Name:: Thesis_Daan_Grashoff_final_.pdf
Size:: 1.230Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Theses

Show simple item record