Describing People: A Poselet-Based Approach to Attribute Classification Lubomir Bourdev 1,2...

Describing People: A Poselet-Based Approach to

Attribute Classification

Lubomir Bourdev1,2 Subhransu Maji1

Jitendra Malik1

1EECS U.C. Berkeley 2Adobe Systems Inc.

Goal: Extract attributes from images of people

Who has long hair?

Who has short pants?

Male or female?

Prior work on poselets and on

attributes

Prior work on Poselets• Introduced by [Bourdev and Malik, ICCV09]• Detection with poselets [Bourdev et al, ECCV10]• Applications

• Segmentation [Brox et al, ECCV10] [Maire et al, ICCV 11]• Actions [Yang et al, CVPR10] [Maji et al, CVPR11] [Yao et

al, ICCV11]• Human parsing [Wang et al, CVPR11]• Semantic contours [Hariharan et al, ICCV11]• Subordinate level categorization [Farrell et al, ICCV11]

Prior work on AttributesAttributes as intermediate

Discovering attributes from text

Discovering attributes from images

Attributes from motion capture

Joint learning of classes &

attributes

Image retrieval with attributes

Attributes and actions

Active learning with attributes

Attributes of people

Gender attribute[Cottrell and Medcalfe, NIPS90] [Golomb et al, NIPS90] [Moghaddam & Yang, PAMI02] [Ferrari & Zisserman, NIPS07] [Kumar et al, ECCV08] [Gallagher and Chen, CVPR08] [Cao et al, ACM08] [Lampert et al, CVPR09] [Farhadi et al, CVPR 09] [Wang et al, BMVC09] [Wang and Forsyth, ICCV09] [Kumar et al, ICCV09] [Farhadi et al, CVPR10] [Berg et al, ECCV10] [Wang and Mori, ECCV10] [Sigal et al, ECCV10] [Branson el al, ECCV10] [Hwang et al, CVPR11] [Parikh and Grauman, CVPR11] [Douze et al, CVPR11] [Kovashka et al, ICCV11] [Liu et al, CVPR11] [Qiu et al, ICCV11] [Yao et al, ICCV11] [Dhar et al, CVPR11] [Parikh and Grauman, ICCV11] [Siddiquie et al, CVPR11]

attributes

Poselets for Attribute Classification

Male or female?

Gender recognition is easier if we factor out the

Poselets

[Bourdev & Malik ICCV09]

Poselets

Examples may differ visually but have common semantics

How do we train a poselet?

Finding correspondences at training time

Given part of a human pose

How do we find a similar pose configuration in the training set?

We use keypoints to annotate the joints, eyes, nose, etc. of people

Left Hip

Left Shoulder

Residual Error

Training poselet classifiers

Residual Error:

1. Given a seed patch2. Find the closest patch for every

other person3. Sort them by residual error4. Threshold them

Training poselet classifiers

1. Given a seed patch2. Find the closest patch for every

other person3. Sort them by residual error4. Threshold them5. Use them as positive training

examples to train a linear SVM with HOG features

Attribute Classification Algorithm at Test Time

Goal: Extract attributes of this person

Target person boundsBounds of other nearby people

Input:

Step 1: Detect poselet activations

[Bourdev et al, ECCV10]

Step 2: Cluster the activations

Step 3: Predict person bounds

Step 4: Identify the correct cluster

Max-flow in bipartite graph

PoseletActivations

Start with its poselet activations

Features

FeaturesFeatures

PoseletActivations

• Pyramid HOG• LAB histogram• Skin features

• Hands-skin• Legs-skin

Poseletpatch

B .* CSkinmask

Armsmask

PoseletActivations

Features

Poselet-levelAttributeClassifiers

Attribute Classification Overview

PoseletActivations

Features

Person-levelAttributeClassifiers

PoseletActivations

Features

Person-levelAttributeClassifiers

Context-levelAttributeClassifiers

Results

Our dataset• Source: VOC 2010 trainval for Person + H3D

• ~8000 annotations (4000 train + 4000 test)

• 9 binary attributes specified by 5 independent annotators via AMT

• Ground truth label: If 4 of the 5 agree

• Dataset will be made publicly available

Visual search on our test set

“Female”

“Wears hat”

“Has long hair”

“Wears glasses”

“Wears shorts”

“Has long sleeves”

“Doesn’t have long sleeves”

Our baseline• Canny-modulated HOG with SPM kernel [Lazebnik et al

CVPR06]

• To help the baseline trained separate SPM for four viewpoints:

• For each attribute we pick the best SPM as our baseline

Full view Head zoom Upper body Legs

Precision/recall on our test setLabel

frequency- - - -

___SPM

___No context

___FullModel

State-of-the-art Gender Recognition

• We outperform Cognitec (top-notch face recognizer)

• We outperform any gender recognizer based on frontal faces (are there others?)• 61% of our test have frontal faces.• Even with perfect classification of frontal faces,

max AP=80.5% vs. our AP of 82.4%

Men most confused as women

Confusions

Women most confused as menbaseball hat

long hair

hair hidden

Short pants most confused to be long pants

Non-T-shirt most confused to be T-shirt

annotationerrors

Are these pants short?

wrong person

occlusion

Best poselets per attribute

Gender:

Long Hair:

Wears glasses:

“A woman with long hair, glasses and long pants”(??)

We can describe a picture of a person

Conclusion

How poselets help in high-level vision

The image is a complex function of the viewpoint, pose,

appearance, etc.

Poselets decouple pose and camera view

from appearance

Google “poselets” to get:

• The set of published poselet papers• H3D data set + Matlab tools• Java3D annotation tool + video tutorial• Matlab code to detect people using poselets• Our latest trained poselets

“A man with short hair, glasses, short sleeves and shorts”

“A man with short hair and long sleeves”“A person with short hair,

no hat and long sleeves”

“A woman with long hair, glasses, short sleeves andlong pants”

“A person with long pants”

“A computer vision professor who likes machine learning”

Failure modePoselets website

http://eecs.berkeley.edu/~lbourdev/poselets

• The set of published poselet papers• H3D data set + Matlab tools• Java3D annotation tool + video tutorial• Matlab code to detect people using poselets• Our latest trained poselets

Describing People: A Poselet-Based Approach to Attribute Classification Lubomir Bourdev 1,2...

Documents

Transcript of Describing People: A Poselet-Based Approach to Attribute Classification Lubomir Bourdev 1,2...

Efficient Classification for Additive Kernel SVMsacberg.com/papers/mbm2012pami.pdf1 Efficient Classification for Additive Kernel SVMs Subhransu Maji 1, Alexander C. Berg2, and

Subhransu Maji - University of Massachusetts Amherstsmaji/cmpsci670/slides/lec22... · 2016. 12. 1. · CMPSCI 670 Subhransu Maji (UMASS) Take the outputs of various layers conv5,

Poselets Body Part Detectors Trained Using 3D Human Pose Annotations - Bourdev, Malik - Proceedings of International Conference on Computer Vision - 2009

Describing Textures in the Wild...Describing Textures in the Wild Mircea Cimpoi University of Oxford mircea@robots.ox.ac.uk Subhransu Maji Toyota Technological Institute smaji@ttic.edu

Subhransu Maji - UMass Amherstsmaji/promotion/research.pdfSubhransu Maji Research Statement 1Overview I am a computer scientist who conducts research in the areas of computer vision

lec04 hh kinect - University of Massachusetts Amherst · lec04_hh_kinect.pptx Author: Subhransu Maji Created Date: 4/27/2016 2:59:17 PM ...

Pedestrian Detection based on Clustered Poselet

Describing People: A Poselet-Based Approach to Attribute … · 2015. 1. 1. · Describing People: A Poselet-Based Approach to Attribute Classiﬁcation ∗ Lubomir Bourdev1,2, Subhransu

Poselet Key-framing: A Model for Human Activity Recognition€¦ · of keyframes. Second, it is tolerant to variations in ac-tion duration, as our model only assumes partial ordering

Real-Time Adaptive Image Compression - arXiv · Real-Time Adaptive Image Compression Oren Rippel * 1Lubomir Bourdev Abstract We present a machine learning-based approach to lossy

Predictive Neural Network Models For Autonomous Vehicle ... · Subhransu Mishra 1, Gowtham Garimella , Christoffer Heckman2, and Marin Kobilarov Abstract—This work considers the

availablpublicly dataset The e. is · 2015. 5. 24. · [1]Lubomir Bourdev and Jitendra Malik. Poselets: Body part detectors trained using 3D human pose annotations. In ICCV, 2009.

SIP Report Subhransu

Detecting People Using Mutually Consistent Poselet Activations · Detecting People Using Mutually Consistent Poselet Activations? Lubomir Bourdev 1;2, Subhransu Maji , Thomas Brox

Http://mscoco.org Ross Girshick Microsoft Research Tsung-Yi Lin Cornell Tech Michael Maire TTI Chicago Serge Belongie Cornell Tech Lubomir Bourdev Facebook.

JAFARNAMA BY Dr. Subhransu Roy

Efficient Run-Time Dispatching in Generic Programming with Minimal Code Bloat Lubomir Bourdev Advanced Technology Labs Adobe Systems Jaakko Järvi Computer.

Multiresolution Tree Networks for Point Cloud Procesing · 2018. 8. 28. · Matheus Gadelha, Rui Wang and Subhransu Maji College of Information and Computer Sciences, University of

Bottom-up Segmentation for Top-down Detectionttic.uchicago.edu/~rurtasun/publications/fidler_segdpm...for people by aligning the masks obtained for each Poselet [4], [23] integrates

Poselets: Body Part Detectors trained Using 3D Human Pose Annotations Lubomir Bourdev & Jitendra Malik ICCV 2009.