Pose Estimation in Video

Ursu, E.A.

dc.rights.license	CC-BY-NC-ND
dc.contributor.advisor	Tan, R.T.
dc.contributor.advisor	van der Aa, N.
dc.contributor.author	Ursu, E.A.
dc.date.accessioned	2013-09-19T17:02:00Z
dc.date.available	2013-09-19
dc.date.available	2013-09-19T17:02:00Z
dc.date.issued	2013
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/14906
dc.description.abstract	Human pose estimation in video has numerous applications, such as human activity analysis, automatic surveillance, human-computer interaction and markerless motion capture. It is challenging because of the kinematic structure of the human body and the variety of possible human poses, the endless appearance options caused by clothing and, finally, due to background clutter that can look like parts in the human body and confuse the system. Current methods in human pose estimation either focus on specific situations, such as pedestrians or laboratory controlled motions, or sacrifice accuracy in favour of coping with videos containing any type of human activity. What we will show in this thesis is an improved system built upon the method of [Ramanan et al., 2007], which models a person's body configuration as a puppet of rectangles. The system first analyses all the frames from a video to find a specific pose from which it learns the appearance of the person to be tracked. Then it processes the video to detect the person in any possible pose. We analysed the robustness of the original method by comparing pose estimations with labelled ground truth. We challenged the authors' claim that one set of parameters can fit multiple videos, which remains an open issue. Then, we extended the original method by including temporal information using two different types of motion models, which improved the tracking results. According to our qualitative evaluation of side-by-side tracking sequences, the new extensions resulted in more stable and accurate detections throughout time and are able to solve some challenging situations which arise when the motion is fast or body parts resemble each other. We found that the system performs poorly when detecting arms, due to their size, which remains the main problem to be solved in future work.
dc.description.sponsorship	Utrecht University
dc.format.extent	11428529 bytes
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.title	Pose Estimation in Video
dc.type.content	Master Thesis
dc.rights.accessrights	Open Access
dc.subject.keywords	pose estimation, video, pictorial structures, motion model
dc.subject.courseuu	Game and Media Technology

Files in this item

Name:: Elena_Ursu_Thesis.pdf
Size:: 10.89Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Theses

Show simple item record