Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorVink, Gerko
dc.contributor.authorKlomp, Tinke
dc.date.accessioned2022-09-09T02:01:00Z
dc.date.available2022-09-09T02:01:00Z
dc.date.issued2022
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/42511
dc.description.abstractThis study evaluates whether the Python package IterativeImputer can yield valid estimates through iterative imputation of missing data. The performance was analyzed by means of a simulation study and compared to the benchmark methods of iterative imputation with mice in R and complete case analysis. With each simulation repetition data was generated, amputed with varying conditions (e.g. missing data mechanisms and missing data proportions), handled by the three missingness techniques and multiple regression models were estimated. Estimates were evaluated on bias, coverage rate and confidence interval width were pooled and obtained. IterativeImputer generated results that were relatively low in bias. However, the produced coverage rates were found to be below nominal coverage. This may be explained by the confidence interval widths, as they were generally too small to contain the true value of the data. The Python package doesn’t operate as adequately as mice and doesn’t outperform complete case analysis. Therefore, IterativeImputer isn’t suitable as a imputation tool for drawing inferences.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectThe performance of the Python package IterativeImputer is studied through bias, coverage rate and confidence interval width.
dc.titleIterative Imputation in Python: A study on the performance of the package IterativeImputer
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.courseuuApplied Data Science
dc.thesis.id9557


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record