View Item 
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        JavaScript is disabled for your browser. Some features of this site may not work without it.

        Browse

        All of UU Student Theses RepositoryBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

        Reinforcement Learning and surrogate reward functions based on graph Laplacians

        Thumbnail
        View/Open
        Master_thesis_Iris_Smit.pdf (1.272Mb)
        Publication date
        2022
        Author
        Smit, Iris
        Metadata
        Show full item record
        Summary
        Reinforcement learning is an upcoming area in machine learning with many applications. This thesis covers the basics of reinforcement learning: reward functions, value and policy iterations, and their algorithms. A value iteration algorithm for the game tic-tac-toe is given along with the results of a policy learning from itself. When the reward function is not straightforward to define, a surrogate reward function might be helpful. A surrogate reward function is defined by using the Fiedler vector of the Laplacian of the graph defined by the game. Laplacians based on weighted graphs in four different ways are defined and used to make different surrogate reward functions for a walking game. Finally, the surrogate reward functions are used in a value iterations algorithm and compared to the exact value function of the walking game.
        URI
        https://studenttheses.uu.nl/handle/20.500.12932/41650
        Collections
        • Theses

        Related items

        Showing items related by title, author, creator and subject.

        • Characterizing the Inputs and Functionality of Stress- and Reward-Encoding Neuronal Ensembles in the Ventral Tegmental Area 

          Danko, Diaz (2024)
          Excessive amounts of stress can contribute to the development and exacerbation of various psychiatric diseases associated with maladaptive reward-driven reinforcement. The ventral tegmental area (VTA) is a mesolimbic brain ...
        • Rewards at Work: The Relationship between Rewards and Work Outcomes, and the Importance attached to Rewards 

          Sares, S.M. (2020)
          The types of rewards one receives from work can have a huge impact on how employees see the organization they are working in. Some research has indicated that additionally the importance one places on these rewards might ...
        • The role of the orbitofrontal cortex in reward related behavior in addiction and alcoholism in particular 

          Steketee, R.M.E. (2009)
          Addiction can be conceptualized as a learning disorder as addicts are unable to regulate behavior associated with drug reward. Reward related learning has traditionally been associated with dopamine transmission in subcortical ...
        Utrecht university logo