dc.rights.license | CC-BY-NC-ND | |
dc.contributor.advisor | Ommen, Thijs van | |
dc.contributor.author | Aldaibis, Collin | |
dc.date.accessioned | 2022-03-16T00:00:52Z | |
dc.date.available | 2022-03-16T00:00:52Z | |
dc.date.issued | 2022 | |
dc.identifier.uri | https://studenttheses.uu.nl/handle/20.500.12932/610 | |
dc.description.abstract | Digital data is everywhere; it is the backbone of science and our modern society. But data is
sometimes incomplete. A complex form of incomplete data is when data is coarse. Many coarse
data problems cannot be solved with standard conditioning. The problem can be reformulated
as a probability updating game: a zero-sum game between a host and a contestant. An instance
of a probability updating game is made from rewriting the Monty Hall problem as a game. It
is proved that if the host plays a strategy that satisfies the RCAR condition, it plays worst-case
optimally and the probabilities can be updated robustly for the contestant. We study whether
RCAR still characterises Nash equilibria when the zero-sum constraint or the one-shot constraint
of these games are removed. We found that if RCAR characterises optimality for a zero-sum,
one-shot probability updating game, it also characterises optimality for the finitely repeated
game. Moreover, we conclude from empirical analysis that if RCAR characterises optimality
for a zero-sum probability updating game, it may also characterise optimality for a moderately
competitive non-zero-sum game. | |
dc.description.sponsorship | Utrecht University | |
dc.language.iso | EN | |
dc.subject | Many coarse data problems cannot be solved with standard conditioning.
A coarse data problem can be rewritten as a probability updating game:
a zero-sum game between a host and a contestant.
If the RCAR condition is satisfied, probabilities can be updated robustly.
We study whether RCAR still characterises Nash equilibria when the zero-sum - or the one-shot constraint are relaxed. | |
dc.title | Investigating relaxed probability updating games | |
dc.type.content | Master Thesis | |
dc.rights.accessrights | Open Access | |
dc.subject.keywords | probability; updating; game; reinforcement, learning; proximal policy optimization; proximal policy optimisation; dirichlet; repeated game; multi-agent; coarse data; coarse data problem; | |
dc.subject.courseuu | Computing Science | |
dc.thesis.id | 2890 | |