Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorNazarov, Aleksei
dc.contributor.authorKooij, G.
dc.date.accessioned2021-08-09T18:00:31Z
dc.date.available2021-08-09T18:00:31Z
dc.date.issued2021
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/40696
dc.description.abstractHow learners of a language, such as children, acquire knowledge of a language has always been a big question within linguistics. Often, to model this, a maximum entropy model is used. Nazarov (2016) and Mayer (2018) showed that it was possible to learn phonotactic phenomena without a priori knowledge. These studies, however, did not consider the effect of clustering on accuracy. The goal of this research is to measure the effect of clustering on a phonotactic maximum entropy model. The main question of this research is whether clustering in a phonotactic maximum entropy model, which learns a language, improves the performance of the model, compared to a model without clustering. To answer this question, a model is created. This model first creates constraints using clusters of classes of phones. Next, a maximum entropy model is used to weigh these constraints. Finally, using these weights, the model predicts the probability of words in the language. This is compared to the actual probability and evaluated. The model also compares different models with and without clustering. The models with clustering turned out to perform better than the models without clustering. From this, it could be inferred that clustering improved the performance of the model. This model was run using a made-up 'toy' language. Possible follow up research could take a look at a real language, with a bigger dataset.
dc.description.sponsorshipUtrecht University
dc.format.extent170307
dc.format.extent43479
dc.format.extent8339
dc.format.extent19474
dc.format.mimetypeapplication/pdf
dc.format.mimetypetext/plain
dc.format.mimetypetext/plain
dc.format.mimetypetext/plain
dc.language.isoen
dc.titleClustering in a Phonotactic Maximum Entropy Model
dc.type.contentBachelor Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsClustering, Maximum Entropy Model, MaxEnt, Phonology, Phonotactics, Language Constraints
dc.subject.courseuuKunstmatige Intelligentie


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record