Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorFeelders, A.J.
dc.contributor.authorLi, A.
dc.date.accessioned2017-11-27T18:02:12Z
dc.date.available2017-11-27T18:02:12Z
dc.date.issued2017
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/28090
dc.description.abstractIn the process of renting a house, payment arrears may happen to some tenants. Normally, the housing corporation can only take actions after the problems occurred. In this thesis, several machine learning and subgroup discovery algorithms are used to detect in advance people who are more likely to cause payment problems. The chosen machine leaning algorithms include logistic regression, random forests, k nearest neighbors, naive bayes and neural networks using model averaging, while the PRIM algorithm is selected for subgroup discovery. Because the skewed distribution of classes in datasets, we utilize the synthetic minority over-sampling technique (SMOTE) to generate more reasonable results. Additionally, feature selection and several ensemble methods are leveraged as well to improve the model performance, such as averaging, majority voting and stacking. By all these approaches, finally, we are able to get a few models that are significantly b etter than the preliminary one. However, since the available data is limited and incomplete, and important time-based information is missing, we can’t obtain a model which is good enough.
dc.description.sponsorshipUtrecht University
dc.format.extent713955
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.titlePredictive machine learning for a housing corporation
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsmachine learning, subgroup discovery, SMOTE, ensemble, payment problems, housing corporation
dc.subject.courseuuComputing Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record