dc.rights.license | CC-BY-NC-ND | |
dc.contributor.advisor | Schoot, Rens van de | |
dc.contributor.author | Migliore, Giulia | |
dc.date.accessioned | 2024-07-24T23:04:50Z | |
dc.date.available | 2024-07-24T23:04:50Z | |
dc.date.issued | 2024 | |
dc.identifier.uri | https://studenttheses.uu.nl/handle/20.500.12932/46882 | |
dc.description.abstract | Screening papers for systematic reviews is a resource-intensive and time-consuming process. This study aims to reduce the necessary resources by automating the title screening phase using Large Language Models (LLMs). Initially, prompt engineering is employed to identify the optimal prompt for the LLM. Subsequently, the performance of the LLM is evaluated against simpler machine learning models to determine its effectiveness in excluding irrelevant papers without false exclusions. The findings indicate that the Large Language Model outperforms simpler machine learning models in the title screening phase, accurately excluding 60% of the papers with only one false exclusion. These promising results suggest that LLMs can assist researchers in the title screening phase, significantly reducing time and costs while maintaining high screening quality. | |
dc.description.sponsorship | Utrecht University | |
dc.language.iso | EN | |
dc.subject | This study aims to reduce the necessary resources by automating the title screening phase using Large Language Models (LLMs). Initially, prompt engineering is employed to identify the optimal prompt for the LLM. Subsequently, the performance of the LLM is evaluated against simpler machine learning models to determine its effectiveness in excluding irrelevant papers without false exclusions. This way, researchers can significantly reduce time and costs while maintaining high screening quality. | |
dc.title | The application of LLM prompt engineering to optimize title screening | |
dc.type.content | Master Thesis | |
dc.rights.accessrights | Open Access | |
dc.subject.keywords | systematic reviews; title screening; LLMs; prompt engineering | |
dc.subject.courseuu | Applied Data Science | |
dc.thesis.id | 34903 | |