Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorQahtan, Hakim
dc.contributor.authorTzikas, Rigas
dc.date.accessioned2022-11-01T01:01:41Z
dc.date.available2022-11-01T01:01:41Z
dc.date.issued2022
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/43132
dc.description.abstractMissing values represent one of the most common challenges for data analytics tasks. For that reason, a lot of techniques have been proposed to fill the missing values through what is called ”Data Imputation”. Recent studies on generating synthetic data demonstrate that Generative Adversarial Networks (GANs) can be used to effectively solve this problem as follows: for each example in the original data generate a synthetic example that keeps the existing values. The generated example should contain values for the features with missing values. However, to confirm if GANs can provide significant improvements over traditional data imputation techniques, we need a technique to measure the quality of the generated examples. The quality of the generated example can be measured by determining how realistic the synthetic data is compared to the original examples. In this project, we develop a tool for successfully measuring the quality of the synthetic data. We compare the quality of the generated data using GANs to other synthetic data generation techniques.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectHow realistic is my synthetic data? A qualitative approach.
dc.titleHow realistic is my synthetic data? A qualitative approach.
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.courseuuApplied Data Science
dc.thesis.id11676


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record