Towards Increasing Robustness Against Occlusions for Preterm Infant Pose Estimation in Videos

Navarro San Martin, Roberto

dc.rights.license	CC-BY-NC-ND
dc.contributor.advisor	Poppe, Ronald
dc.contributor.author	Navarro San Martin, Roberto
dc.date.accessioned	2022-09-09T02:03:29Z
dc.date.available	2022-09-09T02:03:29Z
dc.date.issued	2022
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/42572
dc.description.abstract	Preterm Infant birth rates are rising globally; the causes and implications are yet not fully understood. However, it is clear that preterm infants are more likely to develop a myriad of developmental disorders in comparison to full-term infants. Given the sensitive nature of these infants, they require extensive monitoring and supervision. This monitoring is often performed in Neonatal Intensive Care Units (NICUs). Current techniques for monitoring infant activity are obtrusive as they require the use of needles and electrodes which can be painful or uncomfortable for the preterm infants. Recently, there has been a surge in unobtrusive monitoring techniques, in particular video-based approaches. These approaches rely on behavioral signals present in video which can be captured by estimating the pose and motion of the infants. Current SOTA systems rely on models trained predominantly on adults to estimate the pose and motion of infants, which lead to significantly worse performance scores in down-stream tasks. Additionally, infant data is often extremely hard to collect and of low quality, with poor lighting conditions, severe perspective distortions and occlusions; this lack of data makes it hard to train deep-learning models which are known to be data reliant. Due to the low quantity of data available, this research created the Synthetic (and real) Preterm Infant Sequences (SPIS) dataset by leveraging a SMIL, a vertex-based statistical volumetric model, and SMPLify, a 2D-to-3D lifting approach to create augmented sequences of synthetic infants. This dataset was then used to train the Preterm Infant Pose Estimator (PIPE) model for infant motion modeling. Additionally, to tackle the challenges of occlusions in the preterm infant domain an occlusion augmentation module for Temporal Convolutional Networks was developed. An ablation study was performed in order to validate the performance of the PIPE for medical applications. This work identified that the occlusion augmentation technique was not sufficient for the preterm infant domain. Additionally, the results indicated that the fine tuning the pose estimation and temporal convolutional networks to preterm infant motion improved the performance of the PIPE architecture significantly, indicating that further work into the preterm infant domain for pose estimation is required. Overall, the PIPE architecture did not achieve results that were sufficient enough to be used for subsequent tasks in the preterm infant domain.
dc.description.sponsorship	Utrecht University
dc.language.iso	EN
dc.subject	The aim of the research was to increase the robustness of deep learning models (In particular Temporal Convolutional Neural networks) for preterm infant pose estimation in videos. The goal of the thesis was to develop a preterm infant pose estimator to be used for down-stream tasks in the medical domain.
dc.title	Towards Increasing Robustness Against Occlusions for Preterm Infant Pose Estimation in Videos
dc.type.content	Master Thesis
dc.rights.accessrights	Open Access
dc.subject.keywords	Computer Vision, Infants, NICU, TCN, CNN, Pose Estimation, Deep Learning, AI
dc.subject.courseuu	Artificial Intelligence
dc.thesis.id	9915

Files in this item

Name:: Thesis_Final_AI_Roberto_Navarr ...
Size:: 10.42Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Theses

Show simple item record