dc.rights.license | CC-BY-NC-ND | |
dc.contributor.advisor | Nguyen, Dennis | |
dc.contributor.author | Beek, Rens van der | |
dc.date.accessioned | 2024-08-29T00:02:14Z | |
dc.date.available | 2024-08-29T00:02:14Z | |
dc.date.issued | 2024 | |
dc.identifier.uri | https://studenttheses.uu.nl/handle/20.500.12932/47452 | |
dc.description.abstract | Methodologies and data are not currently instantly extractable from digital humanities papers. Biology and medicine have a tradition of using named entity recognition to tackle this lacuna. In this thesis, a named entity recognition model, trained on a sample of 197 relevant annotated five-sentence windows extracted from a broad corpus of 692 English-language Digital Humanities Quarterly articles to label entities as either ‘methodology’, ‘method’, ‘tool’, ‘task’ or ‘data’, is employed to multifocally read digital humanities methodologies. Multifocal reading practices are found to be promising, but performance and external validity of the named entity recognition model are lacking. | |
dc.description.sponsorship | Utrecht University | |
dc.language.iso | EN | |
dc.subject | In this thesis, named entity recognition is used to map methods, methodologies, tools tasks and data within digital humanities papers. An NER model is trained on a corpus of Digital Humanities Quarterly papers. Multiple visualization techniques are devised to multifocally read papers. | |
dc.title | Using Named Entity Recognition to Map Methods, Methodologies, Tools, Tasks and Data in the Digital Humanities | |
dc.type.content | Master Thesis | |
dc.rights.accessrights | Open Access | |
dc.subject.keywords | annotation; keyword matching; named entity recognition; multifocal reading; digital humanities; methodology; method; tool; task; data | |
dc.subject.courseuu | Applied Data Science | |
dc.thesis.id | 38101 | |