View Item 
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        JavaScript is disabled for your browser. Some features of this site may not work without it.

        Browse

        All of UU Student Theses RepositoryBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

        Automatic classification of orders lines in joint Dealer Management Systems

        Thumbnail
        View/Open
        Thesis - Final.pdf (898.7Kb)
        Publication date
        2016
        Author
        Lentink, M.D.
        Metadata
        Show full item record
        Summary
        In this thesis we looked at different data originating from several Dealer Management Systems. By comparing the different data we tried to find a field set that can be used as features for our classifier models of receipts. We found that this data can be uniformed well by taking the few fields an intersection of the fields of all Dealer Management Systems yield. When we add some extra fields with slight manipulations we created a data set that has high potential for machine learning classifications. Different set ups showed F1 scores for classification well above 90% through three data sets with four learning models. Further we introduce new options in attempt to improve the classification rate further. We used our domain knowledge for the construction of smart token detectors and construct a unique compound word splitting algorithm for splitting Dutch compound words.
        URI
        https://studenttheses.uu.nl/handle/20.500.12932/24303
        Collections
        • Theses
        Utrecht university logo