dc.rights.license | CC-BY-NC-ND | |
dc.contributor.advisor | Odijk, Jan | |
dc.contributor.author | Graaf, E.W. de | |
dc.date.accessioned | 2013-09-05T17:02:13Z | |
dc.date.available | 2013-09-05 | |
dc.date.available | 2013-09-05T17:02:13Z | |
dc.date.issued | 2013 | |
dc.identifier.uri | https://studenttheses.uu.nl/handle/20.500.12932/14554 | |
dc.description.abstract | In this bachelor's thesis I research the possibility of finding multi-word expressions (MWEs) in a large Dutch corpus called LASSY using the statistical distance methods called Point Mutual Information and Salience. I compare the achieved results with results from an already existing toolkit called the MWEtoolkit. The conclusion is that MWEs can not be detected with the chosen methods in the chosen way, but that the method might be useful when combined with other detection methods. | |
dc.description.sponsorship | Utrecht University | |
dc.format.extent | 159010 bytes | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en | |
dc.title | Automatic Recognition of Multi-Word Expressions in Dutch | |
dc.type.content | Bachelor Thesis | |
dc.rights.accessrights | Open Access | |
dc.subject.keywords | multi-word expression, pointwise mutual information, salience, lassy | |
dc.subject.courseuu | Kunstmatige Intelligentie | |