Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorBroersen, Jan
dc.contributor.authorAlfrink, T.A.
dc.date.accessioned2019-08-27T17:00:48Z
dc.date.available2019-08-27T17:00:48Z
dc.date.issued2019
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/33698
dc.description.abstractI define two approaches to rule-based AI Safety: the letter-based approach, which is to simply constrain an agent’s behavior to satisfy a set of static conditions, and the spirit-based approach, which is to somehow let the agent act in accordance with what those rules intended. I explore the conditions under which a letter-based approach is insufficient. Then I describe one prominent letter-based approach to AI Safety,describe how it represents rules in STIT logic, and offer a mechanism for inferring a generalization from those rules that aims to approximate their intention. For that I use a version space learning algorithm. I finish with a small experiment.
dc.description.sponsorshipUtrecht University
dc.format.extent1425814
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.titleDeriving the spirit of the law
dc.type.contentBachelor Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsai safety, ai alignment
dc.subject.courseuuKunstmatige Intelligentie


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record