Comparing core concept categorisation models in geo-analytic questions

Wiersma, Zef

dc.rights.license	CC-BY-NC-ND
dc.contributor.advisor	Scheider, Simon
dc.contributor.author	Wiersma, Zef
dc.date.accessioned	2022-09-09T00:02:39Z
dc.date.available	2022-09-09T00:02:39Z
dc.date.issued	2022
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/42403
dc.description.abstract	Current question answering (QA) systems lack the ability to provide answers to geo-analytical questions. Geo-analytical questions must be interpreted to know what relevant data and geographical tools require to be used to provide an answer. This study focused on core concept categorisation, which is the first step in developing the aforementioned system. Named-entity recognition, in combination with transformer-based models BERT and RoBERTa, is applied to categorise core concepts in geo-analytical questions. Synonym replacement, a simple data augmentation technique, is applied to overcome data scarcity and results of both models are compared. RoBERTa has a better performance on the original data set and BERT has a better performance on the augmented data set. Both models presented significant improvements when applying synonym replacement. Results of this study can be applied to further develop a geo-analytical QA system.
dc.description.sponsorship	Utrecht University
dc.language.iso	EN
dc.subject	Comparing core concept categorisation models in geo-analytic questions
dc.title	Comparing core concept categorisation models in geo-analytic questions
dc.type.content	Master Thesis
dc.rights.accessrights	Open Access
dc.subject.keywords	named-entity recognition;geo-analytical questions;core concepts categorisation;deep learning models; BERT;RoBERTa;data augmentation
dc.subject.courseuu	Applied Data Science
dc.thesis.id	8864

Files in this item

Name:: NER thesis - Zef Wiersma.pdf
Size:: 357.9Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Theses

Show simple item record