Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorKarssenberg, Derek
dc.contributor.authorVelde, Ewout van der
dc.date.accessioned2023-07-25T00:02:12Z
dc.date.available2023-07-25T00:02:12Z
dc.date.issued2023
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/44308
dc.description.abstractCost distance tool are embedded in various geographic information systems (GIS) giving insights into spatial relationships. Most GIS software use a serial cost distance algorithm. Cost distance calculations have a strong sequential nature due to the order we access cells in the raster. This limits the development of a parallel algorithm, hindering the usage of multiple processes for faster computation. This paper proposes a parallelization framework, accounting for the sequential nature while running in parallel. By dynamically distributing partitions to different processes, we ensure full workload distribution and minimising idle times for processes. Relative strong and weak scaling efficiencies drop below 80% when run with more than 3 and 2 workers respectively. We notice that this is partly caused by the fact that the size of the input data and the amount of work a worker has to perform, do not scale linearly. When scaling to more workers, we expect to run into a performance bottleneck caused by input output operations of the root node. Recommendations are made for future research to limit the amount of input output operations by statically assigning partitions to workers.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectParallelization of the cost-distance algorithm
dc.titleParallelization of the cost-distance algorithm
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsCost; Distance; Parallelization; Framework; Multiprocessing;
dc.subject.courseuuApplied Data Science
dc.thesis.id20038


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record