Investigating relaxed probability updating games

Aldaibis, Collin

dc.rights.license	CC-BY-NC-ND
dc.contributor.advisor	Ommen, Thijs van
dc.contributor.author	Aldaibis, Collin
dc.date.accessioned	2022-03-16T00:00:52Z
dc.date.available	2022-03-16T00:00:52Z
dc.date.issued	2022
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/610
dc.description.abstract	Digital data is everywhere; it is the backbone of science and our modern society. But data is sometimes incomplete. A complex form of incomplete data is when data is coarse. Many coarse data problems cannot be solved with standard conditioning. The problem can be reformulated as a probability updating game: a zero-sum game between a host and a contestant. An instance of a probability updating game is made from rewriting the Monty Hall problem as a game. It is proved that if the host plays a strategy that satisfies the RCAR condition, it plays worst-case optimally and the probabilities can be updated robustly for the contestant. We study whether RCAR still characterises Nash equilibria when the zero-sum constraint or the one-shot constraint of these games are removed. We found that if RCAR characterises optimality for a zero-sum, one-shot probability updating game, it also characterises optimality for the finitely repeated game. Moreover, we conclude from empirical analysis that if RCAR characterises optimality for a zero-sum probability updating game, it may also characterise optimality for a moderately competitive non-zero-sum game.
dc.description.sponsorship	Utrecht University
dc.language.iso	EN
dc.subject	Many coarse data problems cannot be solved with standard conditioning. A coarse data problem can be rewritten as a probability updating game: a zero-sum game between a host and a contestant. If the RCAR condition is satisfied, probabilities can be updated robustly. We study whether RCAR still characterises Nash equilibria when the zero-sum - or the one-shot constraint are relaxed.
dc.title	Investigating relaxed probability updating games
dc.type.content	Master Thesis
dc.rights.accessrights	Open Access
dc.subject.keywords	probability; updating; game; reinforcement, learning; proximal policy optimization; proximal policy optimisation; dirichlet; repeated game; multi-agent; coarse data; coarse data problem;
dc.subject.courseuu	Computing Science
dc.thesis.id	2890

Files in this item

Name:: Thesis 15-03-2022 - Final.pdf
Size:: 1.431Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Theses

Show simple item record