A Discriminatively Trained, Multiscale, Deformable Part...

A Discriminatively Trained, Multiscale, Deformable Part Model

CS381V Visual Recognition - Paper Presentation

by Pedro Felzenszwalb, David McAllester, and Deva Ramanan

Slide credit:Duan Tran

Deformable Part Model

Slide credit:Pedro F. Felzenszwalb

Deformable Part Model

Root FilterDeformation

ModelPart Filters

Felzenszwalb, Pedro, David McAllester, and Deva Ramanan. "A discriminatively trained, multiscale, deformable part model." Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE, 2008.

Step 1: HOG Pyramid

4 x 9 descriptor:

Step 1: HOG Pyramid

● Normalize w.r.t. the sum of histogram values in each 2 x 2 block containing the cell.

Step 1: HOG Pyramid

Filter H is a w ⨉ h ⨉ 4 ⨉ 9 vector.

Pascal 2007 Dataset

Step 2: Initialize Root Filter

Scale to Filter Size

Set Filter Size

Train Root Filter

Step 2: Train Root Filter

Unscaled Image

Find best placement in HOG pyramid.

Train Filter

At least 50% overlap w/ ground truth.

Step 2 Summary

1. Set filter size based on statistics in the data.2. Train on unoccluded examples with SVM.

○ Scale each example to match the filter size.○ Random subwindows of negative images give negative examples.

3. Find the best filter placement in the HOG pyramid for each training image.○ Un-scaled training images.○ At least 50% overlap.

4. Re-train using best placements.○ Same negatives as before.○ Iterate twice.

Step 3: Initialize Part Filters

● Train latent SVM on the full model:○ β = (F0, F1, …, F6, a1, b1, …, a6, b6) are model parameters.

Trained Root Filter

Step 3: Train Object Model

Train Model

SVM Objective:

Labeled training data (xi, yi).

Score of placement z.

Step 3 Summary

1. Initialize 6 parts.○ Position in areas of highest energy of root filter.

2. Train latent SVM on the full model:○ β = (F0, F1, …, F6, a1, b1, …, a6, b6) are model parameters.○ For each positive example, find best overall placement z.○ Use high-scoring regions in negative images as hard negatives.○ Iterate 10 times. Each time cache as many hard negatives as can fit into memory.

■ Remove no-longer hard negatives.

SVM Objective:

Labeled training data (xi, yi).

Score of placement z.

Results

● Decent performance:○ PASCAL 2007 challenge.○ First place in 10/20 classes.○ Second place in 6/20.

● Fast:○ 3-4 hours training.○ ~2s evaluation.

Felzenszwalb, Pedro, David McAllester, and Deva Ramanan. "A discriminatively trained, multiscale, deformable part model." Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE, 2008.Felzenszwalb, Pedro F., et al. "Object detection with discriminatively trained part-based models."Pattern Analysis and Machine Intelligence, IEEE Transactions on 32.9 (2010): 1627-1645.

Car Model

Person

Bottle

Felzenszwalb, Pedro F., et al. "Object detection with discriminatively trained part-based models."Pattern Analysis and Machine Intelligence, IEEE Transactions on 32.9 (2010): 1627-1645.

Best overall results with all three components.

Conclusion

● HOG pyramid representation.● Root filter + part filters + latent placement variables.

○ Train with latent SVM.

● Hard negative mining.● Possible extensions?

○ Deeper part hierarchies (parts of parts).○ Multiple viewpoint models (front, side, back, etc.).○ 3D pose estimation.○ Visual words for parts: multi-class detection.

Object Hypothesis Computation

Felzenszwalb, Pedro F., et al. "Object detection with discriminatively trained part-based models."Pattern Analysis and Machine Intelligence, IEEE Transactions on 32.9 (2010): 1627-1645.

2x Resolution HOG

HOG Features

⨉...

Root Response

Part Responses

Object Model

Combined Score

+ Deformation

</presentation>

A Discriminatively Trained, Multiscale, Deformable Part...

Documents

Transcript of A Discriminatively Trained, Multiscale, Deformable Part...

Development in Object Detectionweb.engr.oregonstate.edu/~sinisa/courses/OSU/CS559/... · Ramanan. A discriminatively trained, multiscale, deformable part model, CVPR 2008. [3] C.

Deformable Registration

MultiscaleSolution of Deformable Solids in Linear FE Time ... · Abengoa(MCE: ABG) is an international company that applies innovative technology ... In the variational multiscale

Variational Context-Deformable ConvNets for Indoor Scene ... Variational Context-Deformable... · Deformable ConvNets v2 [56] reformulated DCN with mask weights, which alleviated

Multiscale deformable model segmentation and statistical ...midag.cs.unc.edu/pubs/papers/TMI02-Joshi-mreps.pdfscale level. Sections II and III discuss the medial representation of

A Discriminatively Trained, Multiscale, Deformable …people.cs.uchicago.edu/~pff/papers/latent.pdfA Discriminatively Trained, Multiscale, Deformable Part Model Pedro Felzenszwalb

- Pictorial Structures for Object Recognition Pedro F. Felzenszwalb & Daniel P. Huttenlocher - A Discriminatively Trained, Multiscale, Deformable Part.

Object Detection with Discriminatively Trained Part …people.cs.uchicago.edu/~pff/papers/lsvm-pami.pdf · Object Detection with Discriminatively Trained Part Based Models ... R.B.

ITK Deformable Registration Demons Methods. Deformable Registration.

Object Detection with Discriminatively Trained Part Based ...cs.brown.edu/people/pfelzens/talks/mlss.pdf · Object Detection with Discriminatively Trained Part Based Models Pedro

Vega: Nonlinear FEM Deformable Object Simulatorrun.usc.edu/vega/SinSchroederBarbic2012.pdf · Vega: Nonlinear FEM Deformable Object Simulator ... (CalculiX [DW]) deformable ... J.

Human food intake is discriminatively sensitive to gastric ... · Human food intake is discriminatively sensitive to gastric signaling ... absence of control of eating by gastric

Object Detection with Discriminatively Trained Part Based ...lear.inrialpes.fr/~oneata/reading_group/dpm.pdf · Object Detection with Discriminatively Trained Part Based Model P.F.

Multiscale extended finite element method for deformable ...

Discriminatively Trained Dense SurfaceNormal Estimation · 2014-07-05 · Discriminatively Trained Dense SurfaceNormal Estimation L’ubor Ladicky´, Bernhard Zeisl, and Marc Pollefeys

MULTISCALE DEFORMABLE REGISTRATION OF NOISY …Applications of image registration include radiation therapy, image-guided surgery, functional MRI analysis, and tumor detection, as

Concurrent multiscale computational modeling for … · Concurrent multiscale computational modeling for dense dry granular materials interfacing deformable solid bodies ... discrete

Discriminatively Embedded K-Means for Multi-View Clustering · 2016. 5. 16. · Discriminatively Embedded K-Means for Multi-view Clustering Jinglin Xu1, Junwei Han1, Feiping Nie2

Object Detection with Discriminatively Trained Part …cs.brown.edu/people/pfelzens/papers/lsvm-pami.pdf · a collection of parts arranged in a deformable conﬁgu- ... the part ﬁlter

JOURNAL OF LA A Discriminatively Learned CNN Embedding …JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2015 1 A Discriminatively Learned CNN Embedding for Person Re-identiﬁcation