Mvfl_part2 Cvpr2012 Spatio-temporal and Higher-Order Feature Learning

7/31/2019 Mvfl_part2 Cvpr2012 Spatio-temporal and Higher-Order Feature Learning

http://slidepdf.com/reader/full/mvflpart2-cvpr2012-spatio-temporal-and-higher-order-feature-learning 1/59

Outline

1 IntroductionFeature LearningCorrespondence in Computer VisionRelational feature learning

2 Learning relational featuresSparse Coding Review

Encoding relationsInferenceLearning

3 Factorization, eigen-spaces and complex cellsFactorizationEigen-spaces, energy models, complex cells

4 ApplicationsApplicationsConclusions

Roland Memisevic (Uni Frankfurt) Multiview Feature Learning Tutorial at CVPR 2012 33 / 174

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Outline







http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Modeling data with latent variables

z

yf (

z

)g(y )


http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Sparse Coding Review

× z5

× z4y

× z3

× z2

× z1

Model an image-patch as the superposition of basis functions , or“lters ”:y =

k

W ·k zk , y j =k

w jk zk


http://0.0.0.0/

http://0.0.0.0/

http://goforward/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Feature Learning Graphical Model

y j

z

zk

y

w jk

yα j =

k

w jk zαk

Synthesis model

Parameters w jk connect pixels y j with code components zk

Dimensionality of z can be smaller, larger, or same as y

When the dimensionality is the same or larger, then z must beconstrained , eg. by forcing it to be sparse .


http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




y j

z

zk

y

w jk

yα j =

k

w jk zαk

Learning

Given data-set y 1 , . . . , y N , adapt parameters W , inferringz 1 , . . . , z N along the way.Unsupervised learning.


http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




y j

z

zk

y

w jk

yα j =

k

w jk zαk

Learning

For example

minW, z 1 ,..., z N

1N

α

y α −k

zαk W .k 2 + λ

k

|zαk |

Alternating between W and all zα

.Roland Memisevic (Uni Frankfurt) Multiview Feature Learning Tutorial at CVPR 2012 37 / 174

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




y j

zzk

y

w jk

yα j =

k

w jk zαk

Inference (“Analysis”)Given new image y , compute z .This is how we do recognition.


http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Feature learning models

y j

z

zk

y

w jk

y j =k

w jk zk

Many Variants

Probabilistic vs. Non-probabilistic;Directed vs. undirected;Mixture vs.factorial vs. non-symmetric


http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




y j

w jk

zk

by j

y

z

bzk

y j =k

w jk zk + by j

In practice: add bias terms.But we drop these for now to avoid clutter.


http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




y j

z

zk

y

w jk

y j =k

w jk zk

Some sparse coding models make inference easy:


http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




y j

z

zk

y

w jk p(y j |z ) = sigmoidk

w jk zk

p(zk |y ) = sigmoid

j

w jk y j

Restricted Boltzmann machine (RBM)

p(y , z ) = 1Z exp jk w jk y j zk

Contrastive Divergence learning (Hebbian-style learning)

Inference: p(zk |y ) = sigmoid j w jk y j

Further advantage: Allows for stacking (d e ep le a rning).Roland Memisevic (Uni Frankfurt) Multiview Feature Learning Tutorial at CVPR 2012 38 / 174

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




zk

y j

wkj

a jk

y

y

y j

zk = sigmoid j

a jk y j

y j =k

w jk zk

AutoencoderAdd inference parameters A , and set z = sigmoid ( Ay )Learning: min W,A α y α − W sigmoid ( Ay α ) 2

Add a sparsity penalty for z , or corrupt inputs during training (Vincent et al., 2008)


http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




y j

z

y

zk

w jk

y j =k

w jk zk

Independent Components Analysis (ICA)Learning: Make responses sparse, while keeping lters sensible

minW

W T y 1

s.t . W T W = I


http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Feature learning summary

z (y ) = W T y

y (z ) = W z

y j

z

zk

y

w jk

Linear inference summaryA lot of methods dene inference through linear dependenciesbetween y and z .PCA, ICA, Restricted Boltzmann Machine, Autoencoder, Mixtureof Gaussians, KMeans, ...


l

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




z (y ) = W T y

y (z ) = W z

y j

z

zk

y

w jk


Almost all methods yield Gabor lters when trained on naturalimages.Almost all based on the same rationale:Tease apart the hidden causes of variability in the data.


O li

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Outline







S di f i i ?

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Sparse coding of images pairs?

?

x y

z

How to extend sparse coding to model relations?Sparse coding on the concatenation ?


S di f i i ?

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




?

x y

z



S di g f i g i ?

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




?

x y

z



Sparse coding on the concatenation ?

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




A case study: Translations of binary, one-d images.Suppose images are random and can change in one of threeways:

Example Image x : Possible Imagey :



http://0.0.0.0/

http://0.0.0.0/

http://find/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




A hidden variable that collects evidence for a shift to the right.What if the images are random or noisy?Can we pool over more than one pixel?

zk



http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/





zk



http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/





zk

?



http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/




Obviously not, because now the hidden unit would get equallyhappy if it would see the non-shift (second pixel from the left).The problem: Hidden variables act like OR-gates, that accumulateevidence.

zk

?


Cross-products

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Cross products

Intuitively, it seems, what we need instead are logical ANDs, whichcan represent coincidences (eg. Zetzsche et al., 2003, 2005).This amounts to using the outer product L := outer( x , y ):

We can unroll this matrix, and let this be the data:zk

wijk


Cross-products

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Cross products

In the shift-example, every component L ij of the outer-productmatrix will constitute evidence for exactly one type of shift.Hiddens pool over products of pixels.


Cross-products

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Cross products



Cross-products

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Cross products



A family of manifolds

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y

y

x

An different perspective:

Feature learning reveals the (local) manifold structure in data.When y is a transformed version of x , we can still think of y asbeing conned to a manifold, but it will be a conditional manifold .Idea: Learn a model for y , but let parameters be a function of x .


Three-way interactions

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y

y j

x i

x y

z

zk

If we use a linear function, we have w jk (x ) = i wi jk x i .Inference turns into:

zk = j

w jk y j = j i

wijk x i y j =ij

wijk x i y j

Hidden units are a bilinear function of the two input images.


Conditional sparse coding

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



p g

y j

x i

x y

z

zk

This is a feature learning model, whose parameters are

modulated by inputs.So this is a conditional feature learning model.(Tenenbaum, Freeman; 2000), (Grimes, Rao; 2005), (Ohlshausen;2007), (Memisevic, Hinton; 2007)


An alternative visualization

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y jx i

zk

z

x y

Each hidden variable can blend in one slice W ··k of the parametertensor.Each slice does linear regression in “pixel space”.So for binary hiddens, this is a mixture of 2K image warps .


Outline

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



1 IntroductionFeature Learning

Correspondence in Computer VisionRelational feature learning

2 Learning relational featuresSparse Coding ReviewEncoding relationsInferenceLearning




Inference

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y j

x i

x y

z

zk

Given any two sets of variables, it is easy to infer the third.As a result, inference is basically the same as in any standardsparse coding model.(Graph is tri-partite, sparse coding bi-partite.)


Inference

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y j

x i

x y

z

zk

Inferring z (given x and y )Infer the transformation z by modulating parameters linearly:

zk = j

w jk y j =k i

wijk x i y j =ij

wijk x i y j


Inference

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y j

x i

x y

z

zk

Inferring z (given x and y )The meaning of z : The transformation that takes x to y (or viceversa).


Inference

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y j

x i

x y

z

zk

Inferring y (given x and z )For y , we have

y j =k

w jk zk =k i

wijk x i zk =ik

wijk x i zk


Inference

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y j

x i

x y

z

zk

Inferring y (given x and z )The meaning of y : “x transformed according to z ”.


Inference

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y j

x i

x y

z

zk

Inference can mean various other things in addition.For example, given x and y , how likely are these to cometogether?More on this type of inference later.


Outline

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



1 IntroductionFeature Learning

Correspondence in Computer VisionRelational feature learning2 Learning relational features

Sparse Coding ReviewEncoding relationsInferenceLearning

3 Factorization, eigen-spaces and complex cellsFactorization

Eigen-spaces, energy models, complex cells4 Applications

ApplicationsConclusions


Learning

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y j

x i

x y

z

zk

Training data are now pairs (x α , y α ) – the points we want torelate.

The parameter-gating relation shows that one way to train thismodel is:

Conditional sparse codingPredict y from x , inferring z along the way as usual.



http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y j

x i

x y

z

zk

The cost that data-case (x α , y α ) contributes is:

jyα j −

ikwijk x αi zαk )2

Differentiating with respect to wijk just like before.Inference is still linear wrt. parameters.



http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Conditional sparse coding is predictive coding :We model the next time frame, given the previous one.

Inference then provides an encoding of the transformation.This is often a sensible strategy, but not always as we shall see.


Example: Gated Boltzmann machine

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y j

x i

x y

z

zk

For a restricted Boltzmann machine, this amounts to changing theenergy function into a three-way energy (Memisevic, Hinton;2007):

E (x , y , z ) =ijk

wijk x i y j zk

Then p(y , z |x ) = 1Z (x ) exp E (x , y , z ) ,

Z (x ) = y ,z exp E (x , y , z )Roland Memisevic (Uni Frankfurt) Multiview Feature Learning Tutorial at CVPR 2012 59 / 174

Example: Gated Boltzmann machine

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y j

x i

x y

z

zk

For a restricted Boltzmann machine, this amounts to changing theenergy function into a three-way energy (Memisevic, Hinton;2007):

E (x , y , z ) =ijk

wijk x i y j zk

Then p(y , z |x ) = 1Z (x ) exp E (x , y , z ) ,

Z (x ) = y ,z exp E (x , y , z )Roland Memisevic (Uni Frankfurt) Multiview Feature Learning Tutorial at CVPR 2012 59 / 174

Example: Gated auto-encoder

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



x izzk

y j y

y j

x

Similar for autoencoders.

Both, encoder and decoder weights turn into functions of x .Learning the same as in a standard auto-encoder modeling y .The model is still a DAG, so back-prop works exactly like in astandard autoencoder.


Other examples

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



(Grimes, Rao; 2005): Bi-linear sparse coding(Ohlshausen et al.; 2007): Conditional, bi-linear sparse coding(Luecke, et al.; 2007): Neurally inspired control unit networks


Toy example: Conditionally trained “Hiddenow elds”

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



ow-elds


Toy example: Conditionally trained “Hiddenow elds” inhibitory connections

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



ow-elds , inhibitory connections


Toy example: Learning optical ow

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



x testx y z y (x test , z )


“Combinatorial owelds”

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



x testx y z y (x test , z )


Joint training

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y jx i

zk

z

x y

Conditional training makes it hard to answer questions like:“How likely are the given images transforms of one another?”To answer questions like these, we require a joint image model, p(x , y |z ), given mapping units.


Joint training

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y jx i

zk

z

x y

E (x , y , z ) =ijk

wijk x i y j zk

p(x , y , z ) =1

Z exp E (x , y , z )

Z =x ,y ,z

exp E (x , y , z )

(Susskind et al., 2011): Three-way Gibbs sampling in a GatedBoltzmann Machine.Can apply this to matching tasks (more later).


Joint training

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



x izzk

y j y

y j

x

For the auto-encoder, there is a simple hack:

Add up two conditional costs: j yα

j − ik wijk x αi zα

k )2+ i x αi − jk wijk yα

j zαk )2

This forces parameters to be able to transform in both directions.


Joint training

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y jzzk

x i x

x i

y

x

For the auto-encoder, there is a simple hack:

Add up two conditional costs: j yα

j − ik wijk x αi zα

k )2+ i x αi − jk wijk yα

j zαk )2

This forces parameters to be able to transform in both directions.


Pool over products

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



Take-home messageTo gather evidence for a transformation,

let each hidden unit pool over products of input-components.


Some references

http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/



y jx i

zk

z

x y

(Hinton; 1981), (v.d. Malsburg; 1981)(Grimes, Rao; 2005): Bi-linear sparse coding.(Tenenbaum, Freeman; 2000), (Grimes, Rao; 2005), (Ohlshausen;2007), (Memisevic, Hinton; 2007), (Susskind, et al., 2011)(Zetzsche et al.; 2003, 2005)


http://0.0.0.0/

http://0.0.0.0/

http://find/

http://goback/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

http://0.0.0.0/

Mvfl_part2 Cvpr2012 Spatio-temporal and Higher-Order Feature Learning

Documents

Transcript of Mvfl_part2 Cvpr2012 Spatio-temporal and Higher-Order Feature Learning