Deriving the spirit of the law

dc.rights.license	CC-BY-NC-ND
dc.contributor.advisor	Broersen, Jan
dc.contributor.author	Alfrink, T.A.
dc.date.accessioned	2019-08-27T17:00:48Z
dc.date.available	2019-08-27T17:00:48Z
dc.date.issued	2019
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/33698
dc.description.abstract	I define two approaches to rule-based AI Safety: the letter-based approach, which is to simply constrain an agent’s behavior to satisfy a set of static conditions, and the spirit-based approach, which is to somehow let the agent act in accordance with what those rules intended. I explore the conditions under which a letter-based approach is insufficient. Then I describe one prominent letter-based approach to AI Safety,describe how it represents rules in STIT logic, and offer a mechanism for inferring a generalization from those rules that aims to approximate their intention. For that I use a version space learning algorithm. I finish with a small experiment.
dc.description.sponsorship	Utrecht University
dc.format.extent	1425814
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.title	Deriving the spirit of the law
dc.type.content	Bachelor Thesis
dc.rights.accessrights	Open Access
dc.subject.keywords	ai safety, ai alignment
dc.subject.courseuu	Kunstmatige Intelligentie