Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorDirksen, S
dc.contributor.authorEssaijan, Alex
dc.date.accessioned2023-09-06T09:40:20Z
dc.date.available2023-09-06T09:40:20Z
dc.date.issued2023
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/44950
dc.description.abstractIn the field of machine learning, the evaluation of models typically involves training them on a specific dataset and assessing their performance on a separate test set. However, assessing their performance in real-world environments can be challenging, especially when there is a shortage of labeled data. This study focuses on estimating the performance of machine learning classifiers in financial audits, specifically on unseen accounting data. By employing the Confidence Based Probability Estimation methodology, accurate estimation of performance metrics can be achieved, considering both predicted labels and probabilities. These estimates can be made under the assumption that there is no concept drift, the model is well calibrated, and it exhibits consistent performance across all classes. The findings of this study have practical implications for auditors, offering insights into the feasibility and usability of integrating machine learning models into audit procedures. This enables auditors to make informed decisions regarding the adoption of these models. Furthermore, this research contributes to the field by emphasizing the importance of considering class discrepancies and promoting a data-driven approach to improve sampling methods beyond traditional random sampling. In future research, it would be valuable to address challenges such as multiclass calibration, class imbalance, threshold selection methods, and real-time monitoring of model performance. These areas of investigation would enhance the robustness and applicability of machine learning models in production settings.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectThe estimation of model performance on unseen financial data. The results of this study gave the ADR guidelines in how they could apply machine learning in the auditing process.
dc.titleThe estimation of model performance on unseen data
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsClassification, Confidence based estimation, Probability Calibration, Financial Audits, Financial Transactions
dc.subject.courseuuApplied Data Science
dc.thesis.id23507


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record