Agent-Based Pyramid: A Model for Augmenting ASReview's Capability in Outlier Detection

Ding, Zeyu

dc.rights.license	CC-BY-NC-ND
dc.contributor	Marco van Angeren
dc.contributor.advisor	Schoot, Rens van de
dc.contributor.author	Ding, Zeyu
dc.date.accessioned	2025-08-21T00:01:41Z
dc.date.available	2025-08-21T00:01:41Z
dc.date.issued	2025
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/49818
dc.description.abstract	The exponential growth of academic literature presents significant challenges for researchers conducting systematic reviews, particularly in fields requiring exhaustive coverage such as medicine. This paper presents an improved algorithm based on ASReview for identifying outlier papers that might be overlooked by traditional screening methods. While machine learning-based active learning systems excel at finding similar documents, they often miss relevant papers with diverse characteristics in sparse feature regions. We propose an Agent-Based Pyramid (ABP) structure that dynamically complements ASReview with systematic exploration capabilities. ABP operates in two modes after the stopping rule for the active learning has been met: (1) a manual mode where researchers input keywords for targeted exploration based on domain expertise, and (2) an automated mode that extracts predicted words of importance from ASReview's Random Forest classifier. The system monitors performance and automatically switches to agent-based exploration when detecting potential blind spots using these predicted words of importance to separate them from the manual keywords and to avoid any confusion with the RF model also being used for the prediction of relevance of records. Documents are organized hierarchically based on outlier scores calculated from either user keywords or machine-learned features. Main innovations include flexible dual-mode operation (manual/automated), adaptive ML-agent switching, and hierarchical document organization with election-based prioritization. Experiments on SYNERGY datasets show superior performance compared to pure ASReview, with improved recall curves for finding 100% relevant papers in simulations across most datasets. The manual mode provides essential control for domain-specific reviews, while the automated mode demonstrates the framework's effectiveness without human intervention. This versatility makes ABP a valuable tool for comprehensive systematic reviews in high-stakes domains where missing critical studies has serious consequences.
dc.description.sponsorship	Utrecht University
dc.language.iso	EN
dc.subject	This study introduces Agent-Based Pyramid (ABP), an improved algorithm that enhances ASReview to identify outlier papers often missed by traditional screening methods. While ML-based active learning excels at finding similar documents, it struggles with relevant papers in sparse feature regions.
dc.title	Agent-Based Pyramid: A Model for Augmenting ASReview's Capability in Outlier Detection
dc.type.content	Master Thesis
dc.rights.accessrights	Open Access
dc.subject.keywords	active learning, ASReview, Outlier, systematic review, Agent-Based Pyramid (ABP), machine learning
dc.subject.courseuu	Applied Data Science
dc.thesis.id	52050

Files in this item

Name:: Agent-Based Pyramid- A Model ...
Size:: 1.955Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Theses

Show simple item record