dc.rights.license | CC-BY-NC-ND | |
dc.contributor.advisor | Akker, Marjan van den | |
dc.contributor.author | Prins, Marijn | |
dc.date.accessioned | 2025-08-21T00:05:30Z | |
dc.date.available | 2025-08-21T00:05:30Z | |
dc.date.issued | 2025 | |
dc.identifier.uri | https://studenttheses.uu.nl/handle/20.500.12932/49884 | |
dc.description.abstract | This thesis investigates Maskable Proximal Policy Optimization (PPO) for Electric Vehicle Charging Scheduling (EVCS) within a non-uniform smart grid, partially integrated with renewable energy, aiming to minimize charging tardiness. Using the Polderwijk simulation, which models real-world grid bottlenecks and partial solar energy deployment, we compare Maskable PPO against traditional dispatching strategies and a heuristic approach. Despite experimentation across varied observation spaces, action spaces, and reward functions, the Maskable PPO agent consistently failed to achieve convergence, being unable to respect grid capacity or minimize delays. Our findings show the challenges of applying model-free reinforcement learning (RL) to EVCS problems with strict hard constraints. This research indicates a need for future work to investigate alternative RL architectures for integrating hard constraints to achieve convergence in complex smart grid environments. | |
dc.description.sponsorship | Utrecht University | |
dc.language.iso | EN | |
dc.subject | This thesis investigates Maskable Proximal Policy Optimization (PPO) for Electric Vehicle Charging Scheduling (EVCS) within a non-uniform smart grid, partially integrated with renewable energy, aiming to minimize charging tardiness. | |
dc.title | Reinforcement Learning Applied to an EV Charging Scheduling
Problem | |
dc.type.content | Master Thesis | |
dc.rights.accessrights | Open Access | |
dc.subject.keywords | Polderwijk; EV Charging Scheduling problem; Reinforcement Learning; Maskable PPO; Markov Decision Process; Optimalization; Simulation | |
dc.subject.courseuu | Artificial Intelligence | |
dc.thesis.id | 52018 | |