View Item 
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        JavaScript is disabled for your browser. Some features of this site may not work without it.

        Browse

        All of UU Student Theses RepositoryBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

        Exploring the value of the Bregman Block Average Co-clustering algorithm for missing value imputation in geo-referenced time series

        Thumbnail
        View/Open
        Final_thesis_joris_timmermans_4140214_29052019.pdf (16.83Mb)
        Publication date
        2019
        Author
        Timmermans, J.M.
        Metadata
        Show full item record
        Summary
        Introduction Missing values frequently introduce loss of information in spatial analysis. A common approach to manage missing values is to impute missing values. This is often done by using spatial interpolation models, and more recently machine learning methods. The Bregman Block Average Co-clustering algorithm with I-Divergence (BBAC-I) has recently been applied to explore spatial patterns. Among other things, the original authors of this algorithm used it for missing value imputation. This thesis explored the value of the BBAC-I algorithm in missing value imputation of Geo-referenced time series. Methods This model comparison study compared the imputation value of a selection of machine learning and spatial interpolation models to the BBAC-I models on four data sets with distinctly different spatial characteristics. Three objectives were set to explore the BBAC-I algorithm in this context: (1) Compare the prediction accuracy, (2) compare the computational run time, (3) analyse the spatial properties of the prediction residuals. Results and Conclusion BBAC-I produced less accurate results than the selection of Machine learning models, but produced more accurate than spatial interpolation methods. The BBAC-I run time was faster than any other model, especially for larger data sets. However, it did consistently produce positively spatially correlated residuals. The value of BBAC-I for missing value imputation lies in a limited selection of data sets that are very large, and for which limiting computational requirements is more important than accuracy. Future research should continue to address the value of recently developed non spatial models in the spatial domain.
        URI
        https://studenttheses.uu.nl/handle/20.500.12932/35535
        Collections
        • Theses
        Utrecht university logo