View Item 
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        JavaScript is disabled for your browser. Some features of this site may not work without it.

        Browse

        All of UU Student Theses RepositoryBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

        Optimal choice of sampling location for mapping with machine learning on a fixed budget.

        Thumbnail
        View/Open
        Thesis_Last.pdf (1.318Mb)
        Publication date
        2024
        Author
        Noti, Thodoris
        Metadata
        Show full item record
        Summary
        This study explores optimal sampling strategies for soil mapping with a constrained budget, focusing on predicting soil clay content using Digital Soil Mapping (DSM) techniques. Soil mapping is crucial for sustainable land management, impacting agriculture, environmental monitoring, and land use planning. Advances in remote sensing, GIS, and machine learning (ML) have improved the efficiency and accuracy of soil mapping. This research employs Random Forest (RF) models to compare the efficacy of Simple Random Sampling (SRS) and Conditioned Latin Hypercube Sampling (cLHS). Using a dataset of 3,670 geo-referenced soil samples from Ebergötzen, Germany, the RF models were trained and validated, with key predictors identified. Results indicate that SRS generally offers lower Root Mean Square Error (RMSE) values and higher predictive accuracy compared to cLHS. The study also evaluates the impact of measurement errors and different sampling strategies. A significant finding is that a mixed-method approach, combining 25% high-cost, high-accuracy sampling (Method A) with 75% low-cost, lower-accuracy sampling (Method B), provides the optimal balance between accuracy and costefficiency. This approach achieved the lowest median RMSE, demonstrating the highest accuracy among the tested scenarios. The findings suggest that integrating diverse sampling methods can enhance the reliability and cost-effectiveness of soil property predictions, offering practical guidelines for improving DSM and land management practices.
        URI
        https://studenttheses.uu.nl/handle/20.500.12932/47797
        Collections
        • Theses

        Related items

        Showing items related by title, author, creator and subject.

        • Boltzmann Generators for sampling many-body systems in the isobaric-isothermal ensemble 

          Leeuwen, Steyn van (2023)
          Boltzmann generators (BGs) are exact-likelihood generative models that can be used to sample equilibrium states of many-body systems from the canonical ensemble and to compute the associated Helmholtz free energy. Moreover, ...
        • Measurement of bone quality using multispectral X-ray imaging with Medipix3 detectors 

          Garcia-Tejedor Bilbao-Goyoaga, Andrea (2023)
        • Improving surrogate models through exploration and exploitation sampling 

          Demmers, David (2024)
        Utrecht university logo