FFTs in Graphics and Vision - Department of …misha/Fall08/07.pdfBut this is just a rotation in the...

FFTs in Graphics and Vision

Alignment, Invariance

and Pattern Matching

Outline

Alignment

Shape Matching

Invariance

Pattern Matching

Shape Representation

For 2D shape matching/analysis, it is common to

represent the geometry of a shape by a circular

array of real values.

Shape Representation

Example:

The circular extent function represents the extent

of the shape about the center of mass:• The value at an angle is the distance to the last

point of intersection of the ray from the origin, with

angle , with the shape

Alignment

Since the shape of an object doesn’t change

when we rotate it, we would like to know if the two

arrays are equivalent, up to rotation.

Alignment

Is there a rotation that will rotate the first array

into the second?

Alignment

into the second?

Given the n-dimensional arrays f[ ] and g[ ], is

there an index such that:

]) [(] [ fg

nkkfkg 0 ][][

Semantics

In a continuous setting, asking the binary

question “are the arrays equal” is not very

meaningful, sinceSampling

can result in two “equal” arrays having different

values.

Semantics

into the second?

For every rotation, how close is the rotation of the

first array to the second array?

Semantics

into the second?

For every rotation, how close is the rotation of the

first array to the second array?

For every rotation , what is the value of:

] []), [( gfD10

Alignment

Since the space of functions on a circle is an

inner product space, there is an implicit metric

defined by:

] [] [], [] [] [] [] [], [22 gfgfgfgfD

Alignment

Since the space of functions on a circle is an

inner product space, there is an implicit metric

defined by:

So in our situation, we would like to evaluate:

at every .

] [] [], [] [] [] [] [], [22 gfgfgfgfD

] []) [(], []) [(] []), [(2 gfgfgfD

Alignment

Re-writing this equation gives:

] []) [(], []) [(] []), [(2 gfgfgfD

] []), [(] []), [(

] [], []) [(]), [(] []), [(2

ggffgfD

Alignment

Since the Hermitian dot-product of two real-

valued arrays is also a real value:

] []), [(2] []) [(] []), [(222 gfgfgfD

] []), [(] []), [(

] [], []) [(]), [(] []), [(2

ggffgfD

Alignment

Since is a unitary transformation:

] []), [(2] [] [] []), [(222 gfgfgfD

] []), [(2] []) [(] []), [(222 gfgfgfD

Alignment

So, to compute the distance between a rotation of

the circular array f[ ] by , and the circular array

g[ ], we need to know:The magnitude of f[ ]: ,

The magnitude of g[ ]: ,

The value of the cross-correlation:

] []), [(2] [] [] []), [(222 gfgfgfD

] []), [( gf

Alignment

• The size of f[ ]:This is a constant value independent of and can be

computed in O(n) time.

• The size of g[ ]:This is a constant value independent of and can be

computed in O(n) time.

• The value of the cross-correlation:This can be computed using the FFT in O(n log n)

] []), [(2] [] [] []), [(222 gfgfgfD

] []), [( gf

Alignment

] []), [(2] [] [] []), [(222 gfgfgfD

f[ ] g[ ]] []), [( gfD

Alignment

Instead of looking for the minimum of the moving

L2-difference, why not just look for the maximum

of the moving dot-product?

] []), [(2] [] [] []), [(222 gfgfgfD

f[ ] g[ ]

]])[ [] [( gf *

Outline

Alignment

Shape Matching

Invariance

Pattern Matching

Shape Matching

In shape matching applications, we would like to

find the shapes in a database that are most

similar to a given query.

Shape Matching

General approach:

Define a function that takes in two models and

returns a measure of their proximity.

D , D ,M1 M1 M3M2

M1 is closer to M2 than it is to M322

Database Retrieval

• Compute the distance from the query to each

database model

3D Query

Database Models

D(Q,Mi)

Database Retrieval

• Sort the database models by proximity

3D Query

Database Models Sorted Models

D(Q,Mi)

MQDMQD ji )~

Database Retrieval

• Return the closest matches

Best Match(es)

3D Query

Database Models Sorted Models

D(Q,Mi)

MQDMQD ji )~

Shape Matching

In shape matching applications, we would like to

find the shapes in a database that are most

similar to a given query.

To do this efficiently, models are often

represented by Shape Descriptors: Arrays of values encapsulating information about

the shape of the model, such that

The distance between the arrays gives a measure

of proximity of the underlying shapes.

Shape Matching

Challenge:

Since the shape of the model doesn’t change if

we rotate it, we would like to match models

across rotational poses.

Shape Matching

Challenge:

Solution 1:

One way to resolve is this is to define the

measure of similarity by computing the cross-

correlation, to find the distance between two

models at the best possible alignment.

Shape Matching

Challenge:

Solution 1:

This can be too slow for interactive

applications that need to return the best

match from very large databases.

Shape Matching

Challenge:

Solution 1:

This can be too slow for interactive

applications that need to return the best

match from very large databases.

Not quite true for 1D arrays, but becomes

more true as the dimension increases.

Shape Matching

Challenge:

Solution 2:

Alternatively, we can try to design a shape

descriptor that is rotation invariant: Instances of the same shape in the same pose will

give the same shape descriptor.

Invariance

Given an array f[ ], we would like to define a

(possibly non-linear) mapping taking f[ ] to

some array g[ ], such that for all :

f[ ] g[ ]

]) [(]) [( gf

]) [(]) [( ff

Invariance

Given an array f[ ], expressed in terms of its

Fourier decomposition:

What happens to the Fourier coefficients of f[ ] as

we rotate f[ ]?

] [][ˆ] [n

Invariance

Given an array f[ ], expressed in terms of its

Fourier decomposition:

What happens to the Fourier coefficients of f[ ] as

we rotate f[ ]?

] [][ˆ] [n

]) [(][ˆ]) [(n

Invariance

Since the vk[ ]are a basis for the one-dimensional

irreducible representations of the rotation group,

we know that:

where k( ) is some complex number.

]) [(][ˆ]) [(n

] [)(]) [( kkk vv

Invariance

Since the vk[ ]are a basis for the one-dimensional

irreducible representations of the rotation group,

we know that:

where k( ) is some complex number.

Since the representation is unitary, we know that

k( ) must have unit complex norm.

] [)(]) [( kkk vv

]) [(][ˆ]) [(n

Invariance

Thus, if:

Are the Fourier coefficients of f[ ], then the Fourier

coefficients of (f[ ]) will be:

]1[ˆ)(,],1[ˆ)(],0[ˆ)( 110 nfff n

]1[ˆ,],1[ˆ],0[ˆ nfff

]) [(][ˆ]) [(n

Invariance

Thus, if:

Are the Fourier coefficients of f[ ], then the Fourier

coefficients of (f[ ]) will be:

and we have:

for all k.

]1[ˆ)(,],1[ˆ)(],0[ˆ)( 110 nfff n

]1[ˆ,],1[ˆ],0[ˆ nfff

][ˆ][ˆ)( kfkfk

]) [(][ˆ]) [(n

Invariance

So, we can get a rotation invariant representation

of f[ ] by storing only the magnitudes of the

Fourier coefficients (i.e. discarding phase):

]1[ˆ,,]1[ˆ,]0[ˆ]) [( nffff

][ˆ][ˆ)( kfkfk

Invariance

What kind of information do we get when we

compare just the amplitudes of the Fourier

coefficients?

Invariance

Suppose that we are given two arrays f[ ] and g[ ]

with only one non-zero Fourier coefficient:

what is the measure of similarity

at the optimal alignment?

] [][ˆ] [

][ˆ kg

][ˆ kf

Invariance

If we rotate f[ ] by , this

amounts to multiplying the

k-th Fourier coefficient by e-ik .

] [][ˆ] [

][ˆ kg

][ˆ kf

Invariance

If we rotate f[ ] by , this

amounts to multiplying the

k-th Fourier coefficient by e-ik .

But this is just a rotation in the complex plane.

] [][ˆ] [

][ˆ kg

][ˆ kf

Invariance

At the optimal rotation, the

Fourier coefficients are on

the same line and the measure

of similarity is the difference between the lengths.

] [][ˆ] [

][ˆ kg

][ˆ kfx

Invariance

Thus, storing only the amplitude of the Fourier

coefficients and discarding phase, we get a shape

representation (f[ ]) that: Is invariant to rotations

Provides a measure of similarity that corresponds

to the distance between f[ ] and g[ ] if we could

optimally align the different frequency components

independently.

Invariance

Thus, storing only the amplitude of the Fourier

coefficients and discarding phase, we get a shape

representation (f[ ]) that: Is invariant to rotations

Provides a measure of similarity that corresponds

to the distance between f[ ] and g[ ] if we could

optimally align the different frequency components

independently.

This is a lower bound for the distance between f[ ]

and g[ ] at the optimal alignment.46

Invariance

How good is the lower bound?

After discarding phase, what’s left?

Invariance

How good is the lower bound?

After discarding phase, what’s left?

Experiment:

To test this, we can consider what happens when

we take two arrays and swap the amplitudes of

the Fourier coefficients:1

] [] [ ] [] [n

k vesgverf kk

] []) [], [(ASwapn

k vergf k

Invariance

Amplitude Phase

Invariance

Does this imply that most of the information is

stored in the phase?

Invariance

Not necessarily. For human perception, dominant

information occurs at image boundaries: Positions in the image where there is an abrupt

change of value

Invariance

change of value

These discontinuities arise when the phases are

lined up so the occurrence of these events is

strongly phase dependent.

Invariance

change of value

These discontinuities arise when the phases are

lined up so the occurrence of these events is

strongly phase dependent.

If the grid encodes other type of information (non-

visual) amplitude can be more important.53

Outline

Alignment

Shape Matching

Invariance

Pattern Matching

Notation

If f[ ] and g[ ] are two n-dimensional arrays, then

we can define f[ ]·g[ ] to be the entry-wise product

of the two arrays, with:

][][][] [] [ jgjfjgf

Note 1

If f[ ] is a real-valued n-dimensional array, and g[ ]

and h[ ] are complex-valued n-dimensional

arrays, then:

] [] [], [] [], [] [ hfghgf

Note 1

arrays, then:

] [] [], [

][] [] [][

][][][

][][] [] [] [], [] [

khkfkg

khkgkf

khkgfhgf

Note 1

arrays, then:

] [] [], [

][] [] [][

][][][

][][] [] [] [], [] [

khkfkg

khkgkf

khkgfhgf

Note 1

arrays, then:

] [] [], [

][] [] [][

][][][

][][] [] [] [], [] [

khkfkg

khkgkf

khkgfhgf

Note 1

arrays, then:

] [] [], [

][] [] [][

][][][

][][] [] [] [], [] [

khkfkg

khkgkf

khkgfhgf

Note 1

arrays, then:

] [] [], [

][] [] [][

][][][

][][] [] [] [], [] [

khkfkg

khkgkf

khkgfhgf

Note 2

If is the unitary representation that shifts an

array by indices:

][]])[ [( kfkf

]) [(]) [(]) [] [( gfgf

Pattern Matching

Given an instance of a pattern, find all

occurrences of the pattern within a target image:

Pattern f[ ][ ] Target Image g[ ][ ]

Pattern Matching

Given an instance of a pattern, find all

occurrences of the pattern within a target image:

Pattern f[ ][ ] Target Image g[ ][ ]

Pattern Matching

We could compute the cross correlation of the

pattern with the image and look for local maxima:

Pattern Matching

We could compute the cross correlation of the

pattern with the image and look for local maxima:

Pattern Matching

The cross-correlation has large values because

the dot product of the image with the translated

pattern instance is large.

What causes the dot-product of f[ ] with g[ ] to be

large?

Pattern Matching

large? If the values of f[ ] are similar to the values of g[ ]

Pattern Matching

large? If the values of f[ ] are similar to the values of g[ ]

If the values of g[ ] are large

Pattern Matching

We don’t want to measure:

How correlated is the pattern instance with a

region in the image?

What we want to measure is:

How similar is the pattern instance with a

region in the image?

Pattern Matching

For every point in the image, we want to know

how similar the region about the point is to the

translated pattern.

Restricted Target

Translated Pattern

Pattern Matching

translated pattern.

Restricted Target

Translated Pattern

Pattern Matching

translated pattern.

Restricted Target

Translated Pattern

Pattern Matching

translated pattern.

Restricted Target

Translated Pattern

Pattern Matching

How do we express this formally?

Pattern Matching

If we represent the pattern by f[ ][ ], then the

translation of the pattern to the point p can be

written as:]) ][ [( fp

Pattern Matching

written as:

How do we express the restriction of g[ ][ ] to the

region about p?

]) ][ [( fp

Pattern Matching

written as:

How do we express the restriction of g[ ][ ] to the

region about p?

What we want to do is to zero out all of g[ ][ ]

except for the region about p.

]) ][ [( fp

Pattern Matching

Let [ ][ ] be the characteristic grid of the pattern:

otherwise0

pattern theinside is ),( if1]][[

f[ ][ ] [ ][ ]

Pattern Matching

The restriction of g[ ][ ] to the region about p can

be expressed as:

p(f[ ][ ]) shifts the characteristic grid to be centered

about p.

Multiplying by p(f[ ][ ]) zeros out everything except

for the region about p.

] ][ []) ][ [( gp

Pattern Matching

be expressed as:

] ][ []) ][ [( gp

[ ][ ]

g[ ][ ]

Pattern Matching

be expressed as:

] ][ []) ][ [( gp

[ ][ ]

p( [ ][ ])

g[ ][ ]

Pattern Matching

be expressed as:

] ][ []) ][ [( gp

[ ][ ]

p( [ ][ ]) p( [ ][ ])·g[ ][ ]

g[ ][ ]

Pattern Matching

For every p, we would like to compute:

] ][ []) ][ [(]) ][ [( gf pp

p( [ ][ ])·g[ ][ ]p(f[ ][ ])84

Pattern Matching

For every p, we would like to compute:

Writing this out in terms of dot-products gives

three terms:

] ][ []) ][ [(]) ][ [( gf pp

]) ][ [(]), ][ [( ff pp

] ][ []) ][ [(]), ][ [(2 gf pp

] ][ []) ][ [(], ][ []) ][ [( gg pp

Pattern Matching (Term 1)

Using the the fact that the representation is

unitary gives:2

] ][ []) ][ [(]), ][ [( fff pp

]) ][ [(]), ][ [( ff pp

Using the the fact that [ ][ ] is real-valued, we

can move it to the other side of the dot-product:

] ][ []) ][ [(]), ][ [(2 gf pp

] ][ []), ][ [(]) ][ [(] ][ []) ][ [(]), ][ [( gfgf pppp

Since the product of the representations is the

representation of the products we get:

] ][ []) ][ [(]), ][ [(2 gf pp

] ][ []), ][ [(]) ][ [(] ][ []) ][ [(]), ][ [( gfgf pppp

] ][ []), ][ [] ][ [(] ][ []), ][ [(]) ][ [( gfgf ppp

Since the product of the representations is the

representation of the products we get:

And since [ ][ ] is equal to one whenever f[ ][ ] is

non-zero we get:

] ][ []) ][ [(]), ][ [(2 gf pp

] ][ []), ][ [(]) ][ [(] ][ []) ][ [(]), ][ [( gfgf pppp

] ][ []), ][ [] ][ [(] ][ []), ][ [(]) ][ [( gfgf ppp

] ][ []), ][ [(] ][ []), ][ [] ][ [( gfgf pp 89

Using the the fact that [ ][ ] and g[ ][ ] are both

real-valued, we can move them to the other sides

of the dot-product:

] ][ []) ][ [(], ][ []) ][ [( gg pp

] ][ [,]) ][ [(] ][ []) ][ [(], ][ []) ][ [( 22ggg ppp

Using the the fact that [ ][ ] and g[ ][ ] are both

real-valued, we can move them to the other side

of the dot-product:

Since the [ ][ ] is always equal to either 0 or 1,

we have [ ][ ]= 2[ ][ ] so that:

] ][ []) ][ [(], ][ []) ][ [( gg pp

] ][ [,]) ][ [(] ][ []) ][ [(], ][ []) ][ [( 22ggg ppp

] ][ []), ][ [(] ][ [,]) ][ [( 222gg pp

Pattern Matching

Combining all of this together, we get:

] ][ []), ][ [(2

] ][ []), ][ [(] ][ [] ][ []) ][ [(]) ][ [( 222

Pattern Matching

Or somewhat more cleanly:

] ][ []), ][ [(2

] ][ []), ][ [(] ][ [] ][ []) ][ [(]) ][ [( 222

] ][ [] ][ [2 ] ][ [] ][ [ ] ][ [ 22gfgf **

Pattern Matching

Or somewhat more cleanly:

] ][ []), ][ [(2

] ][ []), ][ [(] ][ [] ][ []) ][ [(]) ][ [( 222

] ][ [] ][ [2 ] ][ [] ][ [ ] ][ [ 22gfgf **

The windowed norm The moving dot-product

Pattern Matching

] ][ [] ][ [2] ][ [] ][ [] ][ [ 22gfgf **

f[ ][ ] g[ ][ ]

f[ ][ ]*g[ ][ ]

||f[ ][ ]||2+…

FFTs in Graphics and Vision - Department of …misha/Fall08/07.pdfBut this is just a rotation in the...

Documents

Transcript of FFTs in Graphics and Vision - Department of …misha/Fall08/07.pdfBut this is just a rotation in the...

2016. december hónjt.hu/onkorm/attachments/adl_id/1969/156847/... · m k Ö Ö Ö m k m k m k m k m k m k m k m k m k m k Ö Ö Ö Ö Ö Ö Ö m k m k m k m k m k m k m k m k m k

² ¼ k§ão Webinário 4... · Ç ² ¼ åö mö ²ö k ö J x ²!ªÇw w ²À0ª F mZ(0ª ²0ªßX( ª Jö ö m ´ ! ö ! x bö x ö x ! ö x k ¼ m J ö x L

mUM ÖF SEIMS ANS MrOSBSTÄTiC TESr REOOiREMENTS IN … · mUM ÖF SEIMS ANS MrOSBSTÄTiC TESr REOOiREMENTS IN SFfCIFIC/ITffiNS frame H. Sab.r, TTD C JLJLIXY . Utblü D April 1967

S K @* 0 X b f0 - u-gakugei.ac.jpkokoro/education/education07/data/H...Æ8 ö 5 0i K S M 6ë º v æ ö% \ K S £'z í#Õ 6ë º v Þ í æ b q ö ì #Õ t º v Ô ¹ Ý \ K Z ` p4

k å } | - KIAS · 12 Ü Ä v (13 Æ ö K K _ Q v Ö z â Þ ô } v Ì Å ³ . w Ã Æ ½ à { ú § ¥ w ò V ¥ Å À Æ ö K $ M ½ 4 x ³ .

ÖF q 3HØÃ} HÈ s ] $ Bwap.china-pharmacy.com/attached/wenjian/14/202017/20200909_15… · Ö q \ ñ / China Pharmacy Vol. No. ñ N U G Ê 2 M d :À GÖ¨ ! 5ª AÖÝ q { > ³ À}

p ÁBnB·B B^B B·BzB· K Ö³/ 26 2 ¬ 20P M© Mº Mº P (Ö ; Ð 2D & (Ö " (Ö (ò "L M© Mº Mº " ÿ " " Ú ÃBú 116 , (Ö-%¤4Ú 43.1 Bæ (Ö Í 31.9 Bæ d1b P B.BKBNBE B1B

‰Ö¶ rS`˜ c h4 ˝ˆ˝ *K .c p@ˇXd ”— $ ” ”˛ ^

á Ö ô P Z ¾ & Ê = ¬ R æ × 7media.zhengjian.org/media/2018/08/02/QL155_1.pdf · i ê W Ö c á á Ë c / Å s Ì d K Ö x * F { Ë ® d Ë C I Ë ° ö G ¶ l Ç ¤ Ì # K

( ö = 2'¨>2>1 ²'¨ >0 8o '¨>0 0Á v 0[8 - Ibaraki...- 9 - Õ M Ó ö = >1 ( 4.0 >k 6 2.7 >k è Vb 3û2 _ KM M ^ Zb + ï 0Á 8 ö ='¨>0'¨ >1 8o'¨>0 b0d _0ñ \M \ [ (@>2>k 6 2.7

CS10707 2017 UL Annual Report Final JA · ö?öVö?ö ö?öbödön · õ Ú K - % f " ö ö?öW öjödöhösögö?övöhökököhö`ölör öoö ö ö ö ö ö ö ö öKö?öbö ö

Stratocumulus – Theory and Model Martin K ö hler (room 108)

* q Ã R z Ö Ö K4® Â4¾ { G K%¤! æchemical-net.env.go.jp/pdf/20110721_China_j.pdf2011/07/21 · K4® _ K!¢!¥"Ù#)%¤! 1ñ47 Ç Ö K4® é ¿ : n ± { O _ !c º ;/ -!!¢

TO · 2011. 5. 14. · COLi IFROLLED .FKAGMEtiTATI OH PROJECT I1ES ö fch Part'al R £ p,p r t, ' • • FrMGKEUTATIOiTYtifo Öf ^'/50 ROD PROJECTILES --. •. .EX 26 MOO -I MID

Tamil ver-4 12 - edge.sin1.swiftserve.comedge.sin1.swiftserve.com/angeltv/angeltvlfd/downloads/GOKT_tamil... · Ö¼B· »aBºï^ ¼>k Ö«VëBÝ][ ·s¼ÄrD_14JAN2012_OUTSIDE °ÞÄ_

Ö ? e 7 } m v K B m K i F !d ¦ Ñ Ï Ñ Ï

« ê k ç U...ñ s Ö Î « ê k ç F Ö ¤ è ¥ Ö v Q h ¸ Ï ! ê & ê . : î ß · Ï u ´ ¨ Ó Ç £ ¼ Ê ´ t « « ¬ ª ¢ f £ « ³ : Ñ Ó v v ¸ 4 ´ ¯ t ~ ¿ « C

Z D Ö : O ï Â D ø - WHO · 2019. 6. 1. · : F ö ¬ s ¦ K Æ Z f ^ Ö M µ ` û F î ¬ Z ¼ ] F õ Z D 8 ¬ s F Ö: ` x ¬ ` x ¬ s ¦ L x ^ Ö M µ P u k 9 a ¬ Ö: ` x è

K ö ln (Cologne)

K G » y ¢ ¢ ê Ë ö Õ