Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorNguyen, D.
dc.contributor.authorSanting, L.B.
dc.date.accessioned2021-08-09T18:00:16Z
dc.date.available2021-08-09T18:00:16Z
dc.date.issued2021
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/40651
dc.description.abstractThe use of data augmentation techniques in NLP for the creation of more robust models has increased in recent years. Easy Data Augmentation (EDA) techniques by Wei & Zou (2019) proposed a simple method to augment small datasets for text classification that showed promising results. While most research in the topic of data augmentation for NLP has been focused on deep learning models and not traditional machine learning models, these models are still commonly used for text classification. On three text classification tasks, this research tests the application of EDA on the performance of three traditional machine learning models: logistic regression, naïve bayes and decision tree. Results show that EDA marginally improves performance for these classifiers on small and large datasets.
dc.description.sponsorshipUtrecht University
dc.format.extent400770
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.titleEasy Data Augmentation Techniques for Traditional Machine Learning Models on Text Classification Tasks
dc.type.contentBachelor Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsartificial intelligence, natural language processing, text classification, machine learning, data augmentation, easy data augmentation, logistic regression, naive bayes, decision tree, classifier, AI, NLP, ML, EDA
dc.subject.courseuuKunstmatige Intelligentie


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record