Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorKarnstedt-Hulpus, I.R.
dc.contributor.authorAndrosiuk, Jan
dc.date.accessioned2022-09-09T00:02:43Z
dc.date.available2022-09-09T00:02:43Z
dc.date.issued2022
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/42406
dc.description.abstractAlthough the number of transaction fraud events grows slower than the number of transactions in total, it is still a problem for many institutions. Detecting fraudulent transactions is challenging for multiple reasons, including a general lack of labels, class imbalance, and hidden and evolving fraud patterns. Even more difficulties emerge while modeling public transaction datasets, namely feature anonymization, missing information, and data aggregation. This work suggests a pipeline of modeling fraudulent transactions, which accounts for most of those concerns based on other researchers’ experience. From the modeling approaches, one can distinguish those based on transaction features and those using graph anomaly detection methods. This research combines both methods and presents cross-validation results over two datasets. Performance scores did not indicate the superior predictive power of any presented approach. Nevertheless, the addition of graph features in the case of the second dataset significantly improved validation scores and therefore indicated the direction for further research.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectFraud detection in transaction datasets
dc.titleFraud detection in transaction datasets
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsfraud detection, Random Forest, graph theory, transaction data
dc.subject.courseuuApplied Data Science
dc.thesis.id8868


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record