Matching by Mapping

1

Matching by MappingYacov Hel-OrI.D.C.

Visiting Scholar – Google

joint work withHagit Hel-Or and Eyal

DavidU. of Haifa, Israel

• A given pattern p is sought in an image. • The pattern may appear at any location in the image.• The pattern may be subject to some deformations T(p).

2

Dense Pattern Matching

pattern p

image similarity map

3

Possible Deformations• Geometric deformations:– Different point of views– Different articulated poses

• Photometric deformations:– Different camera’s photometric parameters (exposure,

white balancing, sensor’s sensitivity, tone correction, etc.)

– Different illuminant colors– Different lighting geometry

• Serves as a building block in many applications.

• Applications: “patch based” methods– Image summarization– Image retargeting– Super resolution– Image denoising– Tracking, Recognition, many more …

Pattern Matching

Invariance – Find a signature that will be invariant to the deformation.

– Lose information. Weaken discriminative power.

Canonization– Transform into canonical position.

– Slow.

Brute force search– Search the entire deformation space.

– Slow

Dealing with Deformations

• In this work we deal only with tone mapping deformation.• Commonly can be locally represented as a functional

relationship between the sought pattern p and a candidate window w:

6

Pattern matching under Tone Mapping

w=M(p)or

p=M(w)Vp

Vw

Vp

Joint histograms of two images taken under different illumination conditions and different camera photometric parameters.

From Kagarlitsky, Moses, and Hel-Or, ICCV 2010.

• Given a pattern p and a candidate window w a distance metric must be defined, according to which matchings are determined:

• Desired properties of D(p,w) :– Discriminative– Robust to Noise– Invariant to some deformations: tone mapping– Fast to execute

Distance Metric

D(p,w)

Possible Tone Mappings

identity mapping affine mapping

monotonic mapping non-monotonic mapping

• Sum of Squared Difference (SSD):

– By far the most common solution.– Assumes the identity tone mapping.– Fast implementation (~1 convolution).

Common Distance Metrics

2

, wpwpDE

• Normalized Cross Correlation (NCC):

– Compensates for affine mappings (canonization).– Fast implementation (~ 1 convolution) .

)var()var(

),cov(1varvar

,2

wpwp

www

pppEwpDNCC

• Local Binary Pattern (LBP): Ojala et al. 96

– Each pixel is assigned a value representing its surrounding structural content.

– Compensates for monotonic mappings.– Fast implementation.– Sensitive to noise.

nP

nnc ggscLBP 2)(

1

0

0001

)(xifxif

xs

• Mutual Information (MI):

– Matching is sought by maximizing the MI.– Compensates for non-linear mappings.

H(w)

H(p)I(w,p)

I(p,w) = H(w)-H(w | p) = H(p)+H(w)-H(p,w)

• A functional dependency between two variables, p and w, can be detected in their joint histogram P(p,w)

The Joint Histogram

50 100 150 200 250

50

100

150

200

250

50

100

150

200

250

50 100 150 200 250

p and w are independent p and w are strongly dependant

Joint Histograms and MI wpHwHpHwpI ,),(

p

w

H(p)

H(w)

Joint Histograms and MI

p

w

p

w wpHwHpHwpI ,),(

p

w

p

w

p

w matching

non matching

Properties:• Measures the entropy loss in w given p.• High MI values indicate good match between w and p.• Compensates for non-monotonic mappings • Discriminative.• Sensitive to bin-size / kernel-variance. • Sensitive to small samples.• Very slow to apply.

MI as a Distance Measure:

H(w)H(p)

Properties:• Highly discriminative.• Tone mapping invariant.• Robust to noise and bin-size.• As fast as NCC (~1 convolution).• A natural generalization of the NCC for non-linear

mappings.

Proposed Approach: Matching by Tone Mapping (MTM)

• Proposed distance measure:

• Note: the division by var(w) avoids the trivial mapping.

Matching by Tone Mapping (MTM)

wnwpMwpD

M varmin,

2

pnpwMpwD

M varmin,

2

Basic Ideas:• Approximate M(p) by a piece-wise constant mapping.• Represent M(p) in a linear form:

• Solve for the parameter vector (closed form).

How can MTM be calculated efficiently?

M(p) Sp

Slice matrix Parameter vector

• Assume the pattern/window values are restricted to the half open interval [a,b).

• We divide [a,b) into k bins =[1,2,...,k+1]

• A value z is naturally associated with a single bin:

B(z)=j if z[j,j+1)

MTM as a linear form:

z

1 2 k+1j j+1

Pattern Slices

• We define a pattern slice

Pattern Slices

2nd slice p2 1st slice p1

0 1

• We define a pattern slice

The Slice Matrix

• Raster scanning the slice windows and stacking into a matrix constructs a slice matrix Sp =[p1 p2 … pk].

= Sp

*

• The matrix Sp is orthogonal: pipj = |pi| ij

• Its columns span the space of piecewise constant tone mappings of p:

Sp p

S p p

M(p)

*

Changing the values to a different vector, , applies piece-wise tone mapping:

p Sp

S p M(p)

• Representing tone mapping in a linear form, the MTM distance D(p,w) is defined as:

• Since Sp is orthogonal ( STS(i,j)=ij|pj| ), the above expression can be minimized in a closed form solution:

Back to Pattern Matching

wnwS

wpD p

varmin,

2

j

jj

wpp

wwn

wpD22 1

var1),(

j

jj

wpp

w22 1

D( ) =( , )

( )--( )

2

2

p w ( )-2

*

Loop j

:|||| 2w

2

p:

MTM for running windows:

j

jj

wpp

wwn

wpD22 1

var1),(

*

Loop j

:|||| 2p

2

p:

MTM for running windows:

j

jj

pww

ppn

pwD22 1

var1),(

• Convolutions can be applied efficiently since pj

is sparse.

• Convolving with pj requires |pj| operations.

• Since pipj= run time for all k sparse convolutions sum up to a single convolution!

Complexity

• Since we can rewrite: 1j

jp

j

jj

j

j

j

jj

wpp

wpwpp

w2222 11

jj

j

j

jj

pwp

pwpp

22

E2(w|pj)E(w2|pj)var(w|pj)n E (var(w|p))

MTM: Statistical Properties

E(w |p=pj)

var(w |p=pj)

pjp

w

Tone Mapping

E(var(w |p))

E(w |p=pj)

pj p

w

Tone Mappingvar(E(w |p))

• The correlation ratio 1-D(w,p) measures the relative reduction in variance of w given p.

• Restricting M to be a linear tone mapping: M(z)=az+b, the measure 1-D(w,p) boils down to the Normalized Cross Correlation:

Observations:

w

pwEwwpDvar

|var)var(,1

),(,1 2 wpNCCwpD

• MTM and MI are similar in spirit.

• While MI maximizes the entropy reduction in w given p, MTM maximizes the variance reduction in w given p.

MTM and MI

MI MTM

Maximizes Entropy reduction

Variance reduction

Speed slow fast

Bin size sensitive insensitive

Small samples sensitive insensitive

Results

• Minimum distance measure for each image column:

40

41

Non Monotonic mappings: Detection rates (over 2000 image pattern pairs) v.s extremity of the applied tone mapping.

42

Monotonic mappings: Detection rates (over 2000 image pattern pairs) v.s extremity of the applied tone mapping.

43

Performance of MI and MTM for various pattern sizes and over various bin-sizes

44

Run time for various pattern size (in 512x512 image)

Visual SAR

Example Application: Multi-Modal Registration

• How can we distinguish between target and shadow?

Example Application: Shadow Detection

Background model Video frame

Background Subtraction

Background model Video frame

• Assumption: Shadow areas are functionally dependent on the background model.

MTM distance

• A new distance measure that accounts for non-linear tone mappings.

• An efficient scheme for applying over the entire image (~1 convolution).

• Statistically motivated.

• A natural generalization of NCC.• Extension: Piecewise-linear tone mapping

– Enables fewer bins– Robust– Solving using TDMA– Requires ~2 convolutions

53

Conclusions:

54

THANK YOU

Matching by Mapping

Documents

Transcript of Matching by Mapping