Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorNguyen, Dennis
dc.contributor.authorBeek, Rens van der
dc.date.accessioned2024-08-29T00:02:14Z
dc.date.available2024-08-29T00:02:14Z
dc.date.issued2024
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/47452
dc.description.abstractMethodologies and data are not currently instantly extractable from digital humanities papers. Biology and medicine have a tradition of using named entity recognition to tackle this lacuna. In this thesis, a named entity recognition model, trained on a sample of 197 relevant annotated five-sentence windows extracted from a broad corpus of 692 English-language Digital Humanities Quarterly articles to label entities as either ‘methodology’, ‘method’, ‘tool’, ‘task’ or ‘data’, is employed to multifocally read digital humanities methodologies. Multifocal reading practices are found to be promising, but performance and external validity of the named entity recognition model are lacking.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectIn this thesis, named entity recognition is used to map methods, methodologies, tools tasks and data within digital humanities papers. An NER model is trained on a corpus of Digital Humanities Quarterly papers. Multiple visualization techniques are devised to multifocally read papers.
dc.titleUsing Named Entity Recognition to Map Methods, Methodologies, Tools, Tasks and Data in the Digital Humanities
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsannotation; keyword matching; named entity recognition; multifocal reading; digital humanities; methodology; method; tool; task; data
dc.subject.courseuuApplied Data Science
dc.thesis.id38101


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record