Automatic classification of orders lines in joint Dealer Management Systems
dc.rights.license | CC-BY-NC-ND | |
dc.contributor.advisor | Feelders, Ad | |
dc.contributor.advisor | Siebes, Arno | |
dc.contributor.author | Lentink, M.D. | |
dc.date.accessioned | 2016-09-15T17:00:51Z | |
dc.date.available | 2016-09-15T17:00:51Z | |
dc.date.issued | 2016 | |
dc.identifier.uri | https://studenttheses.uu.nl/handle/20.500.12932/24303 | |
dc.description.abstract | In this thesis we looked at different data originating from several Dealer Management Systems. By comparing the different data we tried to find a field set that can be used as features for our classifier models of receipts. We found that this data can be uniformed well by taking the few fields an intersection of the fields of all Dealer Management Systems yield. When we add some extra fields with slight manipulations we created a data set that has high potential for machine learning classifications. Different set ups showed F1 scores for classification well above 90% through three data sets with four learning models. Further we introduce new options in attempt to improve the classification rate further. We used our domain knowledge for the construction of smart token detectors and construct a unique compound word splitting algorithm for splitting Dutch compound words. | |
dc.description.sponsorship | Utrecht University | |
dc.format.extent | 920332 | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en | |
dc.title | Automatic classification of orders lines in joint Dealer Management Systems | |
dc.type.content | Master Thesis | |
dc.rights.accessrights | Open Access | |
dc.subject.keywords | classification; rdc; dealer management system; data mining | |
dc.subject.courseuu | Computing Science |