Can non-retracted published research articles be differentiated from research articles that are retracted due to error and misconduct?

Lindenmeyer, Arleen

View/Open

Thesis_UU_A_J_Lindenmeyer_0689009.pdf (1.321Mb)

Publication date

2022

Author

Lindenmeyer, Arleen

Metadata

Show full item record

Summary

To retain and raise trust in science, it is essential to correct misinformation promptly, and even better to prevent the publication of incorrect information, to begin with. Taking a technical approach, this study attempts to address this critical issue of misinformation and trust in science by building models with the ability to classify retracted and non-retracted published scientific articles. These classifiers could be used by institutions to detect papers containing misinformation before they are published. Further, this study highlights the advantage of differentiating between scientific articles that have been retracted due to error and scientific articles that have been retracted due to misconduct. With this distinction, a Logistic Regression classifier was able to achieve an F1 weighted test score of 0.75 and an external validation score of 0.67.

URI

https://studenttheses.uu.nl/handle/20.500.12932/42424

Collections

Theses