Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorFeelders, Ad
dc.contributor.advisorSiebes, Arno
dc.contributor.authorLentink, M.D.
dc.date.accessioned2016-09-15T17:00:51Z
dc.date.available2016-09-15T17:00:51Z
dc.date.issued2016
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/24303
dc.description.abstractIn this thesis we looked at different data originating from several Dealer Management Systems. By comparing the different data we tried to find a field set that can be used as features for our classifier models of receipts. We found that this data can be uniformed well by taking the few fields an intersection of the fields of all Dealer Management Systems yield. When we add some extra fields with slight manipulations we created a data set that has high potential for machine learning classifications. Different set ups showed F1 scores for classification well above 90% through three data sets with four learning models. Further we introduce new options in attempt to improve the classification rate further. We used our domain knowledge for the construction of smart token detectors and construct a unique compound word splitting algorithm for splitting Dutch compound words.
dc.description.sponsorshipUtrecht University
dc.format.extent920332
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.titleAutomatic classification of orders lines in joint Dealer Management Systems
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsclassification; rdc; dealer management system; data mining
dc.subject.courseuuComputing Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record