Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorRico Cuevas, Ramon
dc.contributor.authorTilma, Allard
dc.date.accessioned2024-08-29T00:02:36Z
dc.date.available2024-08-29T00:02:36Z
dc.date.issued2024
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/47459
dc.description.abstractCorrect estimation of probability of default (PD) for credit loans is an essential task for BridgeFund, an online loan broker operating in the Dutch Small and Medium-Sized Enterprizes (SME) market . Advanced machine learning techniques are increasingly being explored to enhance prediction accuracy. Traditional models like logistic regression offer clear interpretability but often lack predictive power compared to more complex algorithms. Ensemble methods and deep learning techniques show potential for significant performance improvements in PD quantification. This study compares XGBoost, Random Forest, Feedforward Neural Networks (FNN) and Tabular Networks (TabNet) against logistic regression to determine their efficacy. The results show that XGBoost outperforms logistic regression and all other models, in all evaluation metrics for PD scoring. However, the "black box" nature of XGBoost raises concerns about model transparency and stakeholder trust, necessitating careful implementation. Developing techniques to demystify XGBoost’s decision-making process such as calculation of SHAP values will enhance the model’s interpretability and, therefore, applicability.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectDeploying Deep Learning Techniques to Estimate Probability of Default: Way to Data Driven Credit Risk Modeling
dc.titleDeploying Deep Learning Techniques to Estimate Probability of Default: Way to Data Driven Credit Risk Modeling
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.courseuuApplied Data Science
dc.thesis.id38103


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record