Latent SVMs for Human Detection with a Locally Affine Deformation Field

Ľubor Ladický1 Phil Torr2 Andrew Zisserman1

1 University of Oxford 2 Oxford Brookes University

Object Detection

• Find all objects of interest

• Enclose them tightly in a bounding box

HOG Detector

Dalal & Triggs CVPR05

• Sliding window using learnt HOG template

• Post-processing using non-maxima suppression

HOG Detector

Classifier response :

HOG Detector

The weights w* and the bias b* learnt using the Linear SVM as:

Does not fit well !• Sliding window using learnt HOG template

HOG Detector

Deformable Part-based Model

Felzenszwalb et al. CVPR08

• Allows parts to move relative to the centre

• Effectively allows the template to deform

• Multiple models based on an aspect ratio

Felzenszwalb et al. CVPR08

Deformable Part-based Model

• Allows parts to move relative to the centre

• Effectively allows the template to deform

• Multiple models based on an aspect ratio

Cells c displaced by the deformation field d:

Our approach

Regularisation takes the form of a pairwise MRF:

Our approach

The weights w* and the bias b* learnt using the Latent Linear SVM as:

Comparison with our approach

HOG template

(no deformation)

Part-based model

(rigid movable parts)

Our model

(deformation field)

Our approach

Why hasn’t anyone tried it before?

Our approach

• Latent models with many latent variables tend to overfit

• Inference not feasible for a sliding window

Our approach

• Latent models with many latent variables tend to overfit

• Inference not feasible for a sliding window

• Deformation field used before only for

• Classification task (Duchenne et al ICCV11)

• Rescoring of detections (Ladický, PhD thesis)

Our approach

We restrict the deformation field to be locally affine ( ):

Our approach

Optimisation

Weights / bias (w*, b*) and the deformation fields dk estimated iteratively

Optimisation

Given the deformation fields the problem is a standard linear SVM:

Optimisation

Given (w*, b*) the problem is a constrained MRF optimisation:

The last can be decomposed as :

By defining the optimisation becomes:

Optimisation

• The location of the cells in the first row and in the first column fully

determine the location of each cell

• Any locally affine deformation field can be reached by two moves :

• each column i moves by (Δcdix ,Δcdi

• each row j moves by (Δrdjx ,Δrdj

Optimisation

• Such moves do not alter the local affinity

Optimisation

• Such moves do not alter the local affinity

• Both moves can be solved quickly using dynamic programming

Learning multiple poses / viewpoints

• We define a similarity measure between two training samples as :

Learning multiple poses / viewpoints

• We define a similarity measure between two training samples as :

• K-medoid clustering of S matrix clusters the data into multi model

Experiments

• Buffy dataset (typically used for pose estimation)

• Contains large variety of poses, viewpoints and aspect ratios

• Consists of 748 images

• Episode s5e3 used for training

• Episode s5e4 used for validation

• Episodes s5e2, s5e5 and s5e6 used for testing

Ferrari et al. CVPR08

Clustering of training samples

Each row corresponds to one model (out of 10 models)

Qualitative results

Quantitative results

Conclusion

• We propose

• Novel inference for locally affine deformation field (LADF)

• Object detector using LADF

• Clustering using LADF

Questions?

Thank you

Latent SVMs for Human Detection with a Locally Affine Deformation Field

Documents

Transcript of Latent SVMs for Human Detection with a Locally Affine Deformation Field

3D affine coordinate transformations - AOLSlearning.aols.org/aols/3D_Affine_Coordinate_Transformations.pdf · 3D affine coordinate transformations ... general 3D affine transformation

Introduction to SVMs. SVMs Geometric –Maximizing Margin Kernel Methods –Making nonlinear decision boundaries linear –Efficiently! Capacity –Structural.

Learning Structural SVMs with Latent VariablesLearning Structural SVMs with Latent Variables Chun-Nam Yu Dept. of Computer Science, Cornell University October 8-9, IBM SMiLe Workshop

affine cipher

Invariants to affine transform What is affine transform ?

Affine Transformation

Large Scale Transductive SVMs - bottou.org

Learning Structural SVMs with Latent Variables Xionghao Liu.

Affine Transformation. Affine Transformations In this lecture, we will continue with the discussion of the remaining affine transformations and composite.

311 Affine Scheme

OEMs SVMs (LDV/MDV/HDV)

Learning Ranking Functions with SVMs

Learning Structural SVMs with Latent Variablesdanroth/Teaching/CIS-700-006/slides/lat… · Learning Structural SVMs with Latent Variables ICML, 2009 Tsochantaridis et al. Support

Learning Structural SVMs with Latent Variables

Minimal Loss Hashing for Compact Binary Codesnorouzi/research/papers/min_loss...ing binary hash functions, based on structural SVMs with latent variables (Yu & Joachims,2009) and an

Affine Transforms

Non-Linear SVMs

SVMs in a Nutshell

Cat SVMS 72

Affine Parameterisation 1