Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorRuigrok, Ynte
dc.contributor.authorEdwards, Laurens
dc.date.accessioned2022-02-17T00:00:27Z
dc.date.available2022-02-17T00:00:27Z
dc.date.issued2022
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/499
dc.description.abstractImbalanced data which is the occurrence of one a minority class in a data set, often causes hardship for machine learning algorithms. A pipeline was built to preprocess the data and apply machine learning algorithms specifically built for imbalanced data sets. Different resulting metrics for model performance were considered (AUROC, AUPCR, precision, recall, accuracy, F-1 and F-beta). The pipeline was applied to the UK Biobank, a large-scale prospective cohort study that allowed to identify hypothesis free, new risk factors for aneurysmal subarachnoid hemorrhage (aSAH).
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectUsing machine learning algorithms to find new potential risk factors for subarachnoid hemorrhage.
dc.titleWays to deal with imbalanced data sets for machine-learning using the identification of potential new risk factors for aneurysmal subarachnoid hemorrhage from the UK Biobank as an example.
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordssubarachnoid hemorrhage, stroke, risk factors, aneurysm, machine learning, imbalanced
dc.subject.courseuuBioinformatics and Biocomplexity
dc.thesis.id2311


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record