dc.rights.license | CC-BY-NC-ND | |
dc.contributor | Shihan Wang, Injy Sharhan, Pablo Mosteiro Romero | |
dc.contributor.advisor | Wang, Shihan | |
dc.contributor.author | Tóth, Bendegúz | |
dc.date.accessioned | 2024-03-31T23:01:36Z | |
dc.date.available | 2024-03-31T23:01:36Z | |
dc.date.issued | 2024 | |
dc.identifier.uri | https://studenttheses.uu.nl/handle/20.500.12932/46226 | |
dc.description.abstract | This thesis investigates the application of multi-critic reinforcement learning to taxonomy construction. Using multiple critics in complicated environments has been shown to improve overall performance on a plethora of reinforcement learning tasks. Guiding the learning process of the critic by explicitly constraining it to a small subset of the task is an effective way to speed up learning and ensure stability during training. Combining several of these constrained critics into a centralized critic outperforms single critic methods on a variety of different tasks. Multi-critic algorithms are especially effective when there is an underlying structure of the task with clearly defined sub-tasks that can be evaluated independently. In this work, we introduce a multi-critic algorithm for taxonomy construction, where we use two critics to evaluate the choice of the parent and child words at each step. Through a series of experiments, we demonstrate that the critics have learned to effectively identify the source of error in incorrect actions, which was not possible with previous methods. We also demonstrate the robustness of our model by analyzing the consistency of the structure of its generated taxonomies. | |
dc.description.sponsorship | Utrecht University | |
dc.language.iso | EN | |
dc.subject | In this thesis we investigate the use of multi-critic reinforcement learning for taxonomy construction tasks. | |
dc.title | Taxonomy Construction with Multi-Critic Reinforcement Learning | |
dc.type.content | Master Thesis | |
dc.rights.accessrights | Open Access | |
dc.subject.keywords | machine learning; reinforcement learning; taxonomy; taxonomy construction; multi-critic; credit assignment | |
dc.subject.courseuu | Artificial Intelligence | |
dc.thesis.id | 20052 | |