Max-Planck-Institut für Informatik
max planck institut
mpii logo Minerva of the Max Planck Society

MPI-INF or MPI-SWS or Local Campus Event Calendar

<< Previous Entry Next Entry >> New Event Entry Edit this Entry Login to DB (to update, delete)
What and Who
Title:Articulated People Detection and Pose Estimation in Challenging Real World Environments
Speaker:Leonid Pishchulin
coming from:Max-Planck-Institut für Informatik - D2
Speakers Bio:
Event Type:Promotionskolloquium
Visibility:D1, D2, D3, D4, D5, RG1, SWS, MMCI
We use this to send out email in the morning.
Level:Public Audience
Date, Time and Location
Date:Tuesday, 31 May 2016
Duration:60 Minutes
Building:E1 4
In this thesis we are interested in the problem of articulated people detection and pose

estimation being key ingredients towards understanding visual scenes containing
people. Although extensive efforts are being made to address these problems, we
identify three promising directions that, we believe, didn’t get sufficient attention
First, we investigate how statistical 3D human shape models from computer
graphics can be leveraged to ease training data generation. We propose a range
of automatic data generation techniques that allow to directly represent relevant
variations in the training data. Sampling from both the underlying human shape
distribution and a large dataset of human poses allows to generate novel samples
with controllable shape and pose variations that are relevant for the task at hand.
Furthermore, we improve the state-of-the-art 3D human shape model itself by rebuilding
it from a large commercially available dataset of 3D bodies.
Second, we develop expressive spatial and strong appearance models for 2D
single- and multi-person pose estimation. We propose an expressive single person
model that incorporates higher order part dependencies while remaining efficient.
We augment this model with various types of strong appearance representations
aiming to substantially improve the body part hypotheses. Finally, we propose
an expressive model for joint pose estimation of multiple people. To that end, we
develop strong deep learning based body part detectors and an expressive fully
connected spatial model. The proposed approach treats multi-person pose estimation
as a joint partitioning and labeling problem of a set of body part hypotheses: it infers
the number of persons in a scene, identifies occluded body parts and disambiguates
body parts between people in close proximity of each other.
Third, we perform thorough evaluation and performance analysis of leading
human pose estimation and activity recognition methods. To that end we introduce a
novel benchmark that makes a significant advance in terms of diversity and difficulty,
compared to the previous datasets, and includes over 40, 000 annotated body poses
and over 1.5M frames. Furthermore, we provide a rich set of labels which are used to
perform a detailed analysis of competing approaches gaining insights into successes
and failures of these methods.
In summary, this thesis presents a novel approach to articulated people detection
and pose estimation. Thorough experimental evaluation on standard benchmarks
demonstrates significant improvements due to the proposed data augmentation techniques
and novel body models, while detailed performance analysis of competing
approaches on our newly introduced large-scale benchmark allows to identify the
most promising directions of improvement.

Name(s):Connie Balzert
Phone:0681 9325-2000
Video Broadcast
Video Broadcast:NoTo Location:
Tags, Category, Keywords and additional notes
Attachments, File(s):

Created by:Connie Balzert/MPI-INF, 04/11/2016 12:29 PMLast modified by:Uwe Brahm/MPII/DE, 11/24/2016 04:13 PM
  • Connie Balzert, 04/11/2016 12:29 PM -- Created document.