Binary Classification on a Highly Imbalanced Dataset
dc.rights.license | CC-BY-NC-ND | |
dc.contributor.advisor | Siebes, A.P.J.M. | |
dc.contributor.advisor | Dajani, K. | |
dc.contributor.author | Peters, T.R. | |
dc.date.accessioned | 2018-08-24T17:00:41Z | |
dc.date.available | 2018-08-24T17:00:41Z | |
dc.date.issued | 2018 | |
dc.identifier.uri | https://studenttheses.uu.nl/handle/20.500.12932/30529 | |
dc.description.abstract | Credit card fraud is a growing field of crime. Data-drive detection of fraudulent transactions can be viewed as a binary classification problem, where the two outcome classes are highly imbalanced. To overcome the difficulties that arise from this imbalance, multiple solution are described and explored. Furthermore, accompanied statistical arguments, a novel method using subgroup discovery is introduced. Finally, all methods are empirically tested on an actual credit card transaction dataset. | |
dc.description.sponsorship | Utrecht University | |
dc.format.extent | 2614935 | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en | |
dc.title | Binary Classification on a Highly Imbalanced Dataset | |
dc.type.content | Master Thesis | |
dc.rights.accessrights | Open Access | |
dc.subject.keywords | Classification; Imbalanced Data; Fraud; Bump Hunting; | |
dc.subject.courseuu | Mathematical Sciences |