Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorLissa, C.J. van
dc.contributor.authorKoutsoukou Prelorentzou, Rania
dc.date.accessioned2024-07-11T00:03:24Z
dc.date.available2024-07-11T00:03:24Z
dc.date.issued2024
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/46667
dc.description.abstractText mining is considered an effective approach for the identification of relevant phenomena in systematic reviews. Topic models have shown to be a promising unsupervised technique to reveal common topics in text data. This research used three topic modeling text mining algorithms, LDA, Top2Vec, and BERTopic, to identify the relevant phenomena in two datasets from published literature text data. The first dataset contains bibliographic data of articles about adolescents’ emotional regulation, and the second, bibliographic data of articles about cooperation in prisoner’s dilemma, where each of the datasets is divided to abstracts and keywords. The goal of this thesis is to select the optimal number of topics/phenomena and then map them to a network. Comparing the performance of the three algorithms with regards to topic quality and network representation of the topics, it is concluded that BERTopic produced more meaningful topics than Top2Vec and LDA.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectDeveloping text-mining methods to review the published literature
dc.titleDeveloping text-mining methods to review the published literature
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordstext mining systematic review, phenomena identification, LDA, Top2Vec, BERTopic, topic quality
dc.subject.courseuuApplied Data Science
dc.thesis.id9163


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record