Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorNussbaum, Madlene
dc.contributor.authorNoti, Thodoris
dc.date.accessioned2024-09-16T23:03:20Z
dc.date.available2024-09-16T23:03:20Z
dc.date.issued2024
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/47797
dc.description.abstractThis study explores optimal sampling strategies for soil mapping with a constrained budget, focusing on predicting soil clay content using Digital Soil Mapping (DSM) techniques. Soil mapping is crucial for sustainable land management, impacting agriculture, environmental monitoring, and land use planning. Advances in remote sensing, GIS, and machine learning (ML) have improved the efficiency and accuracy of soil mapping. This research employs Random Forest (RF) models to compare the efficacy of Simple Random Sampling (SRS) and Conditioned Latin Hypercube Sampling (cLHS). Using a dataset of 3,670 geo-referenced soil samples from Ebergötzen, Germany, the RF models were trained and validated, with key predictors identified. Results indicate that SRS generally offers lower Root Mean Square Error (RMSE) values and higher predictive accuracy compared to cLHS. The study also evaluates the impact of measurement errors and different sampling strategies. A significant finding is that a mixed-method approach, combining 25% high-cost, high-accuracy sampling (Method A) with 75% low-cost, lower-accuracy sampling (Method B), provides the optimal balance between accuracy and costefficiency. This approach achieved the lowest median RMSE, demonstrating the highest accuracy among the tested scenarios. The findings suggest that integrating diverse sampling methods can enhance the reliability and cost-effectiveness of soil property predictions, offering practical guidelines for improving DSM and land management practices.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectThis study investigates optimal soil sampling strategies for predicting clay content using Digital Soil Mapping techniques, comparing Simple Random Sampling (SRS) and Conditioned Latin Hypercube Sampling (cLHS) with Random Forest models. It finds that SRS, combined with a mixed-method approach of high-cost and low-cost sampling, provides the best balance between accuracy and cost-efficiency
dc.titleOptimal choice of sampling location for mapping with machine learning on a fixed budget.
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsDigital Soil Mapping (DSM) Random Forest (RF) Soil Clay Content Conditioned Latin Hypercube Sampling (cLHS) Simple Random Sampling (SRS) Root Mean Square Error (RMSE) Geometric Transformations Predictive Modeling Environmental Covariates Sampling Strategies
dc.subject.courseuuApplied Data Science
dc.thesis.id39401


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record