View Item 
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        JavaScript is disabled for your browser. Some features of this site may not work without it.

        Browse

        All of UU Student Theses RepositoryBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

        Can a 𝙿𝚢𝚝𝚑𝚘𝚗 (package) do what 𝚖𝚒𝚌𝚎 can?

        Thumbnail
        View/Open
        Elviss_Dvinskis_2459302_ADS_Thesis.pdf (1.825Mb)
        Publication date
        2022
        Author
        Dvinskis, Elviss
        Metadata
        Show full item record
        Summary
        Missing data frequently complicate data analysis. Multiple imputation is a well known and robust technique for addressing missing data. In R, multiple imputation is commonly implemented through the mice package which utilizes the MICE algorithm. However, such a standard choice is not yet established for Python. This study addresses four imputation methods that are implemented in Python to assess if they can yield unbiased and confidence valid estimates. A model-based simulation study is carried out to evaluate the performance of KNNImputer, IterativeImputer, miceforest and MIDASpy. The obtained results demonstrate that while under certain conditions IterativeImputer can show comparable performance to the conventional R imputation method mice, the other methods (KNNImputer, miceforest and MIDASpy) underperform under most conditions specified in this simulation study. This study suggests that it would be unwise to recommend these Python approaches as a general imputation strategy without a detailed comprehension of each of the method’s proper application settings and fine-tuning.
        URI
        https://studenttheses.uu.nl/handle/20.500.12932/42449
        Collections
        • Theses
        Utrecht university logo