dc.rights.license | CC-BY-NC-ND | |
dc.contributor.advisor | Logan, B.S. | |
dc.contributor.author | Carras, Alexis | |
dc.date.accessioned | 2023-06-08T23:00:39Z | |
dc.date.available | 2023-06-08T23:00:39Z | |
dc.date.issued | 2023 | |
dc.identifier.uri | https://studenttheses.uu.nl/handle/20.500.12932/43975 | |
dc.description.abstract | Agent Based Modelling (ABM) is a powerful tool for modelling social systems. Generative runs simulate micro-level behaviours that give rise to emergent macro-level outcomes. To ensure the accuracy of those outcomes to the modelled process, behavioural rules are carefully implemented and their parameters calibrated. Recently, methods for the inverse generation of ABMs - from outcomes to behavioural rules - have received much attention. Most approaches aim at constructing parts of the ABM or require high-resolution data. In this thesis, we use Reinforcement Learning (RL) to learn the individual policies of a school choice model using only summary statistics of the reference process. A Deep Q-Network is used to learn and encode the recovered policy, which can then be used in simulations. We demonstrate the robustness of our method for the recovery of different latent behavioural rules using different reward functions. We find that our method is not very robust, although it shows signs of learning. In subsequent experiments, we show that the recovered policies generalise better than a baseline random agent, but the learned behaviour only partially matches the reference. We speculate on two critical obstacles to the performance that future research should address. | |
dc.description.sponsorship | Utrecht University | |
dc.language.iso | EN | |
dc.subject | Recovering latent school-choice policies from summary statistics using RL for ABMs | |
dc.title | Agent Based Model Discovery with Reinforcement Learning | |
dc.type.content | Master Thesis | |
dc.rights.accessrights | Open Access | |
dc.subject.keywords | Reinforcement Learning; Agent-Based Models; Inverse Generative Social Science; School-choice models | |
dc.subject.courseuu | Artificial Intelligence | |
dc.thesis.id | 17285 | |