Extended Video2Report Database: Object Detection & Medical Action Recognition for Medical Consultations.

Kuijpers, Bas

dc.rights.license	CC-BY-NC-ND
dc.contributor	Prof. Dr. Albert A. Salah Prof. Dr. Sjaak Brinkkemper
dc.contributor.advisor	Salah, Albert
dc.contributor.author	Kuijpers, Bas
dc.date.accessioned	2022-02-10T00:00:41Z
dc.date.available	2022-02-10T00:00:41Z
dc.date.issued	2022
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/481
dc.description.abstract	Many healthcare professionals are burdened with a large administrative load, and questions arise whether the current approach is sustainable, despite consensus that reporting leads to a better quality of healthcare. A portion of the administration can be automated by analyzing and documenting the events that take place during medical appointments. Computer vision is used to detect medical actions by analyzing the poses of both patient and care provider, as well as the detection of medical objects. OpenPose (pose estimation) and Faster R-CNN Resnet 101 (object detection) are used and the output of both these models are processed and analyzed with machine learning models, Random Forest and Long-Short Term Memory. The Video2Report dataset containing videos of medical actions has been extended with various new actions and other action classes have been complemented with additional recordings. An image dataset containing medical objects was collected (2117 images) and annotated (2956 annotations). Our experiments with object detection models did not result in improvements, possibly caused by a scarcity of images resembling the actual usage scenario. The best performing model proved to be Random Forest with a cross-validated test score of 75.43%. LSTM models reached an accuracy of 63.08%.
dc.description.sponsorship	Utrecht University
dc.language.iso	EN
dc.subject	This research aims use action recognition to decrease the administrative burden within the healthcare sector. The Video2Report medical action dataset is extended with new recordings and additional classes. Pose estimation models (OpenPose) and object detection models (Faster R-CNN Resnet 101) are used to extract features useful for classification models (Random Forest & Long-Short Term Memory).
dc.title	Extended Video2Report Database: Object Detection & Medical Action Recognition for Medical Consultations.
dc.type.content	Master Thesis
dc.rights.accessrights	Open Access
dc.subject.keywords	Action Recognition; Computer Vision; Object Detection; Pose Estimation; Healthcare
dc.subject.courseuu	Business Informatics
dc.thesis.id	2182

Files in this item

Name:: Thesis Bas Kuijpers - Video2Re ...
Size:: 64.95Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Theses

Show simple item record