PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

2D Human Pose Estimation in TV Shows
Vittorio Ferrari, Manuel Marin-Jimenez and Andrew Zisserman
In: Statistical and Geometrical Approaches to Visual Motion Analysis LNCS , 5604 (5604). (2009) Springer , Germany , pp. 128-147.


The goal of this work is fully automatic 2D human pose estimation in unconstrained TV shows and feature films. Direct pose estimation on this uncontrolled material is often too difficult, especially when knowing nothing about the location, scale, pose, and appearance of the person, or even whether there is a person in the frame or not. We propose an approach that progressively reduces the search space for body parts, to greatly facilitate the task for the pose estimator. Moreover, when video is available, we propose methods for exploiting the temporal continuity of both appearance and pose for improving the estimation based on individual frames. The method is fully automatic and self-initializing, and explains the spatio-temporal volume covered by a person moving in a shot by soft-labeling every pixel as belonging to a particular body part or to the background. We demonstrate upper-body pose estimation by running our system on four episodes of the TV series {\em Buffy the vampire slayer} (i.e. three hours of video). Our approach is evaluated quantitatively on several hundred video frames, based on ground-truth annotation of 2D poses. Finally, we present an application to full-body action recognition on the Weizmann dataset.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Book Section
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Machine Vision
ID Code:5463
Deposited By:Vittorio Ferrari
Deposited On:24 September 2009