Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributorShihan Wang, Injy Sharhan, Pablo Mosteiro Romero
dc.contributor.advisorWang, Shihan
dc.contributor.authorTóth, Bendegúz
dc.date.accessioned2024-03-31T23:01:36Z
dc.date.available2024-03-31T23:01:36Z
dc.date.issued2024
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/46226
dc.description.abstractThis thesis investigates the application of multi-critic reinforcement learning to taxonomy construction. Using multiple critics in complicated environments has been shown to improve overall performance on a plethora of reinforcement learning tasks. Guiding the learning process of the critic by explicitly constraining it to a small subset of the task is an effective way to speed up learning and ensure stability during training. Combining several of these constrained critics into a centralized critic outperforms single critic methods on a variety of different tasks. Multi-critic algorithms are especially effective when there is an underlying structure of the task with clearly defined sub-tasks that can be evaluated independently. In this work, we introduce a multi-critic algorithm for taxonomy construction, where we use two critics to evaluate the choice of the parent and child words at each step. Through a series of experiments, we demonstrate that the critics have learned to effectively identify the source of error in incorrect actions, which was not possible with previous methods. We also demonstrate the robustness of our model by analyzing the consistency of the structure of its generated taxonomies.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectIn this thesis we investigate the use of multi-critic reinforcement learning for taxonomy construction tasks.
dc.titleTaxonomy Construction with Multi-Critic Reinforcement Learning
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsmachine learning; reinforcement learning; taxonomy; taxonomy construction; multi-critic; credit assignment
dc.subject.courseuuArtificial Intelligence
dc.thesis.id20052


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record