Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorFrank, Jason
dc.contributor.authorSmit, Iris
dc.date.accessioned2022-06-16T00:00:29Z
dc.date.available2022-06-16T00:00:29Z
dc.date.issued2022
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/41650
dc.description.abstractReinforcement learning is an upcoming area in machine learning with many applications. This thesis covers the basics of reinforcement learning: reward functions, value and policy iterations, and their algorithms. A value iteration algorithm for the game tic-tac-toe is given along with the results of a policy learning from itself. When the reward function is not straightforward to define, a surrogate reward function might be helpful. A surrogate reward function is defined by using the Fiedler vector of the Laplacian of the graph defined by the game. Laplacians based on weighted graphs in four different ways are defined and used to make different surrogate reward functions for a walking game. Finally, the surrogate reward functions are used in a value iterations algorithm and compared to the exact value function of the walking game.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectThe basics of reinforcement learning: reward functions, value and policy iterations, and their algorithms. A value iteration algorithm for the game tic-tac-toe is given along with the results of a policy learning from itself. When the reward function is not straightforward to define, a surrogate reward function might be helpful. A surrogate reward function is defined by using the Fiedler vector of the Laplacian of the graph defined by the game.
dc.titleReinforcement Learning and surrogate reward functions based on graph Laplacians
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsreinforcement, learning, graph, Laplacian, Fiedler, vector, reward, value, policy, tic-tac-toe,
dc.subject.courseuuMathematical Sciences
dc.thesis.id4483


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record