Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorOmmen, Thijs van
dc.contributor.authorAldaibis, Collin
dc.date.accessioned2022-03-16T00:00:52Z
dc.date.available2022-03-16T00:00:52Z
dc.date.issued2022
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/610
dc.description.abstractDigital data is everywhere; it is the backbone of science and our modern society. But data is sometimes incomplete. A complex form of incomplete data is when data is coarse. Many coarse data problems cannot be solved with standard conditioning. The problem can be reformulated as a probability updating game: a zero-sum game between a host and a contestant. An instance of a probability updating game is made from rewriting the Monty Hall problem as a game. It is proved that if the host plays a strategy that satisfies the RCAR condition, it plays worst-case optimally and the probabilities can be updated robustly for the contestant. We study whether RCAR still characterises Nash equilibria when the zero-sum constraint or the one-shot constraint of these games are removed. We found that if RCAR characterises optimality for a zero-sum, one-shot probability updating game, it also characterises optimality for the finitely repeated game. Moreover, we conclude from empirical analysis that if RCAR characterises optimality for a zero-sum probability updating game, it may also characterise optimality for a moderately competitive non-zero-sum game.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectMany coarse data problems cannot be solved with standard conditioning. A coarse data problem can be rewritten as a probability updating game: a zero-sum game between a host and a contestant. If the RCAR condition is satisfied, probabilities can be updated robustly. We study whether RCAR still characterises Nash equilibria when the zero-sum - or the one-shot constraint are relaxed.
dc.titleInvestigating relaxed probability updating games
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsprobability; updating; game; reinforcement, learning; proximal policy optimization; proximal policy optimisation; dirichlet; repeated game; multi-agent; coarse data; coarse data problem;
dc.subject.courseuuComputing Science
dc.thesis.id2890


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record