Fairness in Machine Learning: Ensuring Fairness in Datasets for Classification Problems

Hattatoğlu, B.

dc.rights.license	CC-BY-NC-ND
dc.contributor.advisor	Qahtan, A.A.A.
dc.contributor.author	Hattatoğlu, B.
dc.date.accessioned	2021-07-27T18:01:04Z
dc.date.available	2021-07-27T18:01:04Z
dc.date.issued	2021
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/40070
dc.description.abstract	Machine Learning (ML) algorithms are used in a wide range of applications, which affected societies either directly or indirectly in daily life. ML algorithms are preferred for many tasks that require complex computations with a massive volume of data due to the better performance compared to humans. Moreover, people have subjective opinions and points of view, which can lead to bias in their decisions. Unfortunately, ML algorithms are not always objective either. Using ML algorithms in several decision-making systems and other services may cause serious discrimination among some groups of people in society. One of the most significant reasons behind the biased predictions of the algorithms for different demographic groups is the imbalanced representation of each demographic subgroup in the population. In this master's thesis, COSCFair (Clustering and OverSampling for Fair Classification) is proposed, which is a framework to ensure fairness among all the subgroups that exist in a dataset without changing the original class labels of the samples. COSCFair consists of clustering, oversampling, and classification components, where the classification component considers the outcomes of the clustering algorithm. The classification component contains an ensembled technique of class label prediction. The experimental results over different datasets that are widely used as benchmarks to evaluate algorithmic fairness show that the COSCFair framework yields consistent improvements in fairness while causing a minimal loss in predictive performance compared to a set of baseline methods.
dc.description.sponsorship	Utrecht University
dc.format.extent	1342831
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.title	Fairness in Machine Learning: Ensuring Fairness in Datasets for Classification Problems
dc.type.content	Master Thesis
dc.rights.accessrights	Open Access
dc.subject.keywords	Machine Learning, Fairness, Algorithmic Fairness, Clustering, Oversampling, Classification
dc.subject.courseuu	Business Informatics

Files in this item

Name:: MBI_Graduation_Thesis_Begum_Ha ...
Size:: 1.280Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Theses

Show simple item record