Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorSiebes, prof. dr. A.P.J.M.
dc.contributor.authorSingh, J.
dc.date.accessioned2018-08-24T17:00:39Z
dc.date.available2018-08-24T17:00:39Z
dc.date.issued2018
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/30523
dc.description.abstractIn this master thesis non-disjoint clustering algorithms are presented, which are based on the Minimum Description Length (MDL) principle. The algorithms capture the underlying distribution from different perspectives by compressing the data using a series of code tables. A cover algorithm describes how to compress the database using a code table. Every code table is iteratively grown until compression does not improve any more. Experiments show that the algorithms are able to identify structure in the data because the data gets compressed to some extent by the code tables. Clustering experiments show that the general structure is captured by all obtained code tables and that the different groups of patterns that are dissimilar to the general patterns, are captured by different code tables. This confirms that the code tables view the data from different perspectives. The classification experiments show that, given the class labels, the code tables are dissimilar enough to capture the different characteristics of the classes. Without the class labels it is able to find the difference between the classes when the support is sufficiently low. It is also possible to identify multi-valued dependencies in the data. This is the case when code tables in a single iteration are anti-chains and later end up in the same code table.
dc.description.sponsorshipUtrecht University
dc.format.extent1012137
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.titleEnsemble of Code Tables
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsMinimum Description Length, MDL, Code table, krimp, slim, groei, kolmogorov
dc.subject.courseuuComputing Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record