O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

31
OBJECT DETECTION WITH DISCRIMINATIVELY TRAINED PART BASED MODELS PRESENTED BY Xiaolong Wang

Transcript of O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

Page 1: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

OBJECT DETECTION WITH DISCRIMINATIVELY TRAINED PART BASED

MODELS

PRESENTED BY

Xiaolong Wang

Page 2: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

DETECTION

Page 3: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

CHALLENGE

• Deformation

Part of the Slides From Ross Girshick

Page 4: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

CHALLENGE

• Viewpoint

Page 5: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

CHALLENGE

• Variable structure

Page 6: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

CHALLENGE

Images from Chaitanya Desai

Page 7: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

• 2-layer Model

• Deformable

DEFORMABLE PART MODELS

Leo Zhu, CVPR 2010

Page 8: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

HOG PYRAMID

Root Filter

Part Filters

Page 9: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

FORMULATIONOne root (i=0) + n parts.

Model Parameters for HOG

HOG Features Model Parameters for Deformation

Page 10: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

INFERENCE

Page 11: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

MULTI-VIEWS

Page 12: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

LATENT ORIENTATION

• No orientation in PAMI paper (DPM v3)

• Use latent orientation (DPM v4) Guess what is it?

right-facing horse

Page 13: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

UNSUPERVISED ORIENTATION CLUSTERING

Page 14: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

LATENT ORIENTATION

• Inference: Choose the best view and best orientation.

• Learning: Train the parameters for 3 views, and flip the weights to get 3*2 views.

Page 15: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

HOW IMPORTANT IT IS

One view:42.1% 3-view: 47.3% 3*2-view: 56.8%

• For horse:

Page 16: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

HOW IMPORTANT IT IS

• For all classes (DPM v4):

Page 17: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

LEARNING

• Linear Formulation Putting all features in one vector Latent variable z represents part locations (and

component index for multi-views)

Page 18: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

LATENT SVM

• Iterative Algorithm with 2 steps: Calculate the latent variables (fixed ) Optimize the model parameters (fixed z).

Page 19: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

LATENT SVM

• Detection on Positive Samples Sliding window Overlap with root-node window > 0.7

Page 20: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

LATENT SVM

• Hard Negative Mining

Carl Vondrick HOGgles, ICCV 2013

Page 21: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

LATENT SVM

• Hard Negative Mining Small or no overlap High detection score

• Maintaining Sample Cache Select no more than 500 negative samples per image; Cache size = 20000

Page 22: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

LATENT SVM

• Dual Method Not scalable.

• Stochastic gradient descent(DPM v4) Important: Shuffle everytime!

• LBFGS(DPM v5) Second-order Newton Method Faster & better performance

Page 23: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

3-STEP INITIALIZATION

• Step-1: Only Train Root Filter positive data (highest overlap) No hard negative mining

Car

Page 24: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

3-STEP INITIALIZATION

• Step-2: Merg Components Setting root selection as latent variable

Page 25: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

3-STEP INITIALIZATION

• Step-3: Initialize Part Filters Fix part number as 8 (DPM v4/5) Sliding window, calculate L1/L2 norm of the positive

weights.

Page 26: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

POST PROCESSING

• Bounding Box Regression Linear regression for (x1,y1,x2,y2)

• Non-Maximum Suppression Pick up high score boxes

• Context

Page 27: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

CONTEXT

Marr Prize 2009

Context SVM,CVPR2010

segDPM,CVPR2013

Page 28: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

NUMBERS

VOC 2010: 29.6 and 32.2

VOC 2007: 33.7 and 35.4

VOC 2010: segDPM(with tons of things) 40.4

Page 29: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

LARGE-SCALE DATASET

• ImageNet 2013

DPM v4 in cpp

Page 30: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

SUMMARY

• Although DPMs is loosing to CNNs, the techniques and small tricks we learned from DPMs help solving many other vision problems.

Page 31: O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

QUESTIONS