Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorGemeren, C.J. van
dc.contributor.advisorPoppe, R.W.
dc.contributor.advisorVeltkamp, R.C.
dc.contributor.authorWind, J.
dc.date.accessioned2017-04-26T17:02:46Z
dc.date.available2017-04-26T17:02:46Z
dc.date.issued2017
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/25779
dc.description.abstractHuman action recognition has received much attention during the last decade. Inspired by the work by Patron-Perez et al., we construct a pipeline that recognizes 7 person-to-person interactions from video. We focus on urban settings and use videos that have been recorded using consumer hand-held digital cameras. Full bodies are visible, but occlusions do occur over time. We use the part based models by Felzenszwalb et al. to detect each person in every frame of the video. Human detections are tracked over time using the human tracking framework by Choi et al. Similar to Wang et al., we compute dense trajectories for each track. The orientation of each person is estimated and pairs of people are formed that are simultaneously on screen. We classify the interaction of each pair using a multiclass SVM. We look into the performance of each part of the pipeline in order to understand its relative strengths and weaknesses. The human tracking approach leads to good results on the ETH data set, but the results on the Collective Activity data set leave room for improvement. We obtain an accuracy of 34.97% while we try to classify 8 classes of orientations of the Collective Activity data set. Many wrong classifications lie in closely related classes. For the final interaction recognition we obtain an overall accuracy of 53.57%. Orientation estimation has an important influence on the overall performance, but the influence of the dense trajectories and the distance between the targets is limited.
dc.description.sponsorshipUtrecht University
dc.format.extent18766031
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.titleHuman Interaction Recognition from Video
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordshuman interaction recognition, interaction recognition, action recognition, computer vision, machine learning
dc.subject.courseuuGame and Media Technology


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record