Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorVreeswijk, G.A.W.
dc.contributor.authorDiederiks, Carlijn
dc.date.accessioned2023-08-11T00:02:12Z
dc.date.available2023-08-11T00:02:12Z
dc.date.issued2023
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/44628
dc.description.abstractOne of the key challenges in analyzing open-ended answers is the labor-intensive nature, which typically requires significant effort. To address this challenge, this research investigated the potential of NLP techniques, specifically topic modeling, to automate the discovery of topics in unstructured text answers. The study explored two different topic modeling methods, namely LDA and BTM, to assess their effectiveness in uncovering latent topics. By employing these methods, the research aimed to automate the extraction of meaningful themes from the open-ended responses. The data underwent preprocessing, and the models were fine-tuned with optimized parameters. The LDA model failed to provide meaningful insights into the underlying topics. However, the results obtained from the BTM model proved to be highly valuable in extracting latent topics from unstructured and unlabeled text data. The BTM model, which employs biterms to address sparse word co-occurrence in short texts, successfully generated topics with interpretable sets of top words. With some manual adjustments and labeling of these topics, the outcomes can be effectively applied in the analysis of open-ended responses, for example when combined with topic classification and sentiment analysis techniques. This research contributes to the field of topic modeling by highlighting the effectiveness of the BTM model in analyzing unstructured text data from employee engagement surveys. The BTM model is a promising model to uncover topics in data with different text lengths and structures.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectOne of the key challenges in analyzing open-ended answers is the labor-intensive nature, which typically requires significant effort. To address this challenge, this research investigated the potential of NLP techniques, specifically topic modeling, to automate the discovery of topics in unstructured text answers. This study explored two different topic modeling methods, namely LDA and BTM, to assess their effectiveness in uncovering latent topics.
dc.titleEvaluating the Effectiveness of the Topic Models LDA and BTM for Uncovering Topics in Open-Ended Employee Engagement Survey Responses
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsLDA; BTM; Topic Modeling; open-ended answers
dc.subject.courseuuApplied Data Science
dc.thesis.id21630


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record