Pursuit-evasion game with SARSA learned pursuer

Mutsaers, T.

dc.rights.license	CC-BY-NC-ND
dc.contributor.advisor	Alechina, Natasha
dc.contributor.author	Mutsaers, T.
dc.date.accessioned	2021-01-07T19:00:41Z
dc.date.available	2021-01-07T19:00:41Z
dc.date.issued	2020
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/38492
dc.description.abstract	Pursuit-evasion algorithms have become more important to domains in robotics, like surveillance with robots. Most of these domains lack a complete model or enough data to work with, reinforcement learning could be a good solution to this. This thesis will show that "a pursuer with a Sarsa algorithm is able to catch an evader in a discrete grid environment.” First, pursuit-evasion games and Sarsa algorithms will be explained. The algorithm used in this thesis implements a simple two-dimensional environment with obstacles in which a pursuer agent tries to catch an evader agent. In the experiments pursuer uses a Sarsa algorithm with different parameter settings against a evader that used two different behaviours. This algorithm will be described further in the method section. The pursuer manages to learn to catch the evader in each tested scenario. There is not a very noticeable difference between the different parameter values used for Sarsa, and there is a difference between the two behaviours of the evader. Finally, there will be discussed what would be interesting for future research. The code used for this thesis can be accessed here: https://github.com/tts118/sarsa-pursuit-evasion .
dc.description.sponsorship	Utrecht University
dc.format.extent	928500
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.title	Pursuit-evasion game with SARSA learned pursuer
dc.type.content	Bachelor Thesis
dc.rights.accessrights	Open Access
dc.subject.keywords	pursuit-evasion; sarsa; machine learning; artificial intelligence;
dc.subject.courseuu	Kunstmatige Intelligentie

Files in this item

Name:: Theses AI.pdf
Size:: 906.7Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Theses

Show simple item record