Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorOberski, Daniel
dc.contributor.authorVerdonk, Raf
dc.date.accessioned2025-08-28T00:02:47Z
dc.date.available2025-08-28T00:02:47Z
dc.date.issued2025
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/50050
dc.description.abstractInternational surveys and assessments can measure educational efficiency, but often fail to find policies or external factors that cause changes in efficiency. Large-scale data on policy reforms is necessary to understand the impact of reforms and identifying reforms that positively benefit education. The World Education Reform Database (WERD) is a comprehensive, manually coded database on educational policy reforms. However due to the large scale and manual nature in the construction of this database, WERD can introduce human errors and limits scalability and update-ability. In this paper, we propose EduPoliRAG, a large language model-based Retrieval-Augmented Generation (RAG) system designed to semi-automate the large scale detection and extraction of educational policies. EduPoliRAG is capable of dealing with many large highly in-depth policy documents in multiple languages simultaneously. Built upon GPT-4o, EduPoliRAG integrates a corpus of international and national policy reports on the Dutch education system in order to generate tabular data on policy reforms. Evaluation is conducted through a manual comparison of EduPoliRAG's outputs against the WERD database. While EduPoliRAG successfully generates structured tabular data on policy reforms across various education sectors and addresses some of the problems of WERD, further refinement is needed to improve the completeness and semantic coverage.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectCreation of a RAG model which is to an extent able to detect educational policy reforms from documents. Through the RAG model a database is created and compared to an exisiting educational policy database (WERD).
dc.titleDetecting educational policy reforms at scale using Retrieval Augmented Generation
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsPolicy; RAG; LLM; Education; Netherlands
dc.subject.courseuuApplied Data Science
dc.thesis.id52757


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record