View Item 
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        JavaScript is disabled for your browser. Some features of this site may not work without it.

        Browse

        All of UU Student Theses RepositoryBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

        Connecting Software Specifications and Payroll Rules: Evaluating Sparse and BERT-Based Retrieval

        Thumbnail
        View/Open
        04_07_2025_final_Thesis_Tristan.pdf (1.318Mb)
        Publication date
        2025
        Author
        Stuffers, tristan
        Metadata
        Show full item record
        Summary
        Each organization in the Netherlands that withholds taxes is legally required to file tax returns with the Dutch Tax Agency, which in turn sends these data to the Employee Insurance Agency (UWV). A tax return must comply with a set of rules. The technical specifications, intended for payrollsoftware developers, are contained in one document. Examples and explanations, intended for payroll administrators, are contained in another. When analysts or legal experts need to trace an informative rule back to a technical rule, they must manually locate relevant paragraphs as there is no tool to assist their search. We present a framework to automatically retrieve relevant paragraphs. As a baseline, TF–IDF achieved lower Recall@ 10 than BM25 across all metrics. BM25 reached Recall@10 = 0.55 on our manually annotated test set. A BERT bi-encoder, when applied as a reranker over BM25 results, achieved R@10 = 0.42. After domain-adaptive and denoising pre-training, the same bi-encoder reached R@10 = 0.44, yet both scores still trail the BM25 baseline of 0.55.
        URI
        https://studenttheses.uu.nl/handle/20.500.12932/50040
        Collections
        • Theses
        Utrecht university logo