Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorSiebes, Dr. A.P.J.M.
dc.contributor.authorDorrestijn, J.E.G.
dc.date.accessioned2018-08-24T17:00:39Z
dc.date.available2018-08-24T17:00:39Z
dc.date.issued2018
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/30524
dc.description.abstractMany methods exist to avoid disclosing sensitive information when releasing a database. However these methods either cannot guarantee that the information of individuals is secure or are aimed at specific use cases. In this paper we develop a method which is both provably private and retains the overall form of the original database. To achieve this we derive a privacy measure, epsilon-dependence. Intuitively, epsilon-dependence requires that the input and output databases are nearly independent. We show that epsilon-dependence can be seen as an information theoretic refinement of differential privacy. We then adapt the KRIMP algorithm to generate databases while satisfying epsilon-dependence. We show through experiments that the generated databases are comparable to the original databases when performing machine learning or itemset mining tasks. The results are especially good on larger databases.
dc.description.sponsorshipUtrecht University
dc.format.extent341512
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.titleProvable Privacy for Database Generation: an Information Theoretic Approach
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.courseuuComputing Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record