Binary Classification on a Highly Imbalanced Dataset

Peters, T.R.

View/Open

Thesis_Tom_Peters.pdf (2.493Mb)

Publication date

2018

Author

Peters, T.R.

Metadata

Show full item record

Summary

Credit card fraud is a growing field of crime. Data-drive detection of fraudulent transactions can be viewed as a binary classification problem, where the two outcome classes are highly imbalanced. To overcome the difficulties that arise from this imbalance, multiple solution are described and explored. Furthermore, accompanied statistical arguments, a novel method using subgroup discovery is introduced. Finally, all methods are empirically tested on an actual credit card transaction dataset.

URI

https://studenttheses.uu.nl/handle/20.500.12932/30529

Collections

Theses