Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel...

82
Kernel Methods for Domain Adaptation Boqing Gong University of Southern California Joint work with Kristen Grauman and Fei Sha

Transcript of Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel...

Page 1: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Kernel Methods for Domain Adaptation !!Boqing Gong University of Southern California !!

Joint work with Kristen Grauman and Fei Sha

Page 2: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Key to computer vision research

Instrumental to benchmark different methods !

But, datasets are biased

Need undo bias when developing vision systems

Vision datasets

Page 3: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Setup

Source domain (with labeled data)

!

Target domain (no labels for training)

Unsupervised domain adaptation (DA)

?

Page 4: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Setup

Source domain (with labeled data)

!

Target domain (no labels for training)

Objective

Learn classifier to work well on target

Unsupervised domain adaptation (DA)

Different distributions

?

Page 5: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Unsupervised DA is ill-posed

Image credits: https://www.indiegogo.com/projects/more-than-enough

Page 6: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Make assumptions

- Covariate shift, target shift, sample selection bias, etc. !

Need domain knowledge

- Exploring intrinsic structures in data (Subspace, cluster, manifold, landmarks, etc.)

Unsupervised DA is ill-posed

Image credits: https://www.indiegogo.com/projects/more-than-enough

Page 7: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Background - quick reviewCorrecting sampling bias

[Shimodaira, ’00]

[Huang et al., Bickel et al., ’07]

[Sugiyama et al., ’08]

[Sethy et al., ’06]

[Sethy et al., ’09]

+

----++ Re-weight source instances

Page 8: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Background - quick reviewCorrecting sampling bias

[Shimodaira, ’00]

[Huang et al., Bickel et al., ’07]

[Sugiyama et al., ’08]

[Sethy et al., ’06]

[Sethy et al., ’09]

Adjusting mismatched models

[Evgeniou and Pontil, ’05]

[Duan et al., ’09]

[Duan et al., Daumé III et al., Saenko et al., ’10]

[Kulis et al., Chen et al., ’11]

+

----++

+

----++

Page 9: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Background - quick reviewCorrecting sampling bias

[Shimodaira, ’00]

[Huang et al., Bickel et al., ’07]

[Sugiyama et al., ’08]

[Sethy et al., ’06]

[Sethy et al., ’09]

Adjusting mismatched models

[Evgeniou and Pontil, ’05]

[Duan et al., ’09]

[Duan et al., Daumé III et al., Saenko et al., ’10]

[Kulis et al., Chen et al., ’11]

+

----++

+

----++

Inferring domain-invariant features

[Pan et al., ’09]

[Blitzer et al., ’06] [Gopalan et al., ’11]

[Chen et al., ’12][Daumé III, ’07]

[Argyriou et al, ’08] [Gong et al., ’12]

[Muandet et al., ’13]

+++-

- +-+- +

x 7! z, s.t.

PS(z, y) ⇡ PT (z, y)

Page 10: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Key: to reduce source-target discrepancy

Our solution: kernel methods for !

Inferring domain-invariant features

Directly matching distributions

Geodesic flow kernel

Page 11: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Key: to reduce source-target discrepancy

Our solution: kernel methods for !

Inferring domain-invariant features

Directly matching distributions

Many factors overlap &

Landmarks Latent domains

Page 12: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

!

Inferring domain-invariant features

Geodesic flow kernel

Kernel methods for DA

Page 13: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Target

Source

Modeling data via subspaces

Assume low-dimensional structure

!

!

!

!

E.g., PCA, LDA, partial least squares

Page 14: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

!

!

!

!

Grassmann manifold - Collection of d-dim subspaces of a vector space

- Each point corresponds to a subspace

Target subspaceSource

subspace

Characterizing domains geometrically

Page 15: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

TargetSource

Geodesic flow on the manifold – starting at source & arriving at target in unit time – flow parameterized with one parameter t – closed-form, easy to compute with SVD

Modeling domain shift with geodesic flow

Page 16: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

TargetSource

Geodesic flow on the manifold – starting at source & arriving at target in unit time – flow parameterized with one parameter t – closed-form, easy to compute with SVD

Modeling domain shift with geodesic flow

Page 17: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

TargetSource

Domain-invariant features

Page 18: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

TargetSource

More similar to source.

Domain-invariant features

Page 19: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

More similar to target.

TargetSource

Domain-invariant features

Page 20: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Blend the two.

TargetSource

Domain-invariant features

Page 21: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

The kernel trick

!

Avoiding the explicit mapping

z

1 = [�(0)Tx, · · · ,�(t)Tx, · · · ,�(1)Tx]

z1 ! hz1i , z1j i

Page 22: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

!

We define the geodesic flow kernel (GFK): !

!

!

Advantages !– Analytically computable, clean formulation –Only one hyperparameter, automatically determined – Broadly applicable: can kernelize many classifiers

hz1i , z

1j i =

Z 1

0

��(t)Txi

�T ��(t)Txj

�dt = x

Ti Gxj

Domain-invariant kernel

[Gong et al., CVPR’12]

Page 23: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Four vision datasets/domains on visual object recognition

[Griffin et al. ’07, Saenko et al. 10’] !

10 common classes

10~100 images per class

Bag-of-words and SURF features

Experimental study

The Office

Page 24: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Comparison results

0

15

30

45

60

A-->C A-->D C-->A C-->W W-->A W-->C

No adaptation Gopalan et al.'11 Pan et al.'09GFK Landmark

Acc

urac

y (%

)

Office

Page 25: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Comparison results

0

15

30

45

60

A-->C A-->D C-->A C-->W W-->A W-->C

No adaptation Gopalan et al.'11 Pan et al.'09Landmark

Acc

urac

y (%

)

Office

Page 26: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Comparison results

0

15

30

45

60

A-->C A-->D C-->A C-->W W-->A W-->C

No adaptation Gopalan et al.'11 Pan et al.'09GFK Landmark

Acc

urac

y (%

)

Office

Page 27: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

!

Inferring domain-invariant features

Geodesic flow kernel

Kernel methods for DA

hz1i , z

1j i =

Z 1

0

��(t)Txi

�T ��(t)Txj

�dt = x

Ti Gxj

Page 28: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

!

Directly matching distributions

Geodesic flow kernel Landmarks

Kernel methods for DA

Many factors overlap & interact

Latent domains

Page 29: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

A case study: building vision systems for Amazon images

Image credits: Nadine DeNinno (http://www.ibtimes.com)

Page 30: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Pose: Canonical

Lighting: Uniform

background: clean

Not all source instances equally adaptable/useful

Page 31: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Landmarks are labeled source instances distributed similarly to the target domain.

Selecting most adaptable source instances

Source

Target[Gong et al., ICML’13]

Page 32: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Landmarks are labeled source instances distributed similarly to the target domain.

Source

Target[Gong et al., ICML’13]

Selecting most adaptable source instances

Page 33: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Landmarks are labeled source instances distributed similarly to the target domain.

Identifying landmarks: Source

Target[Gong et al., ICML’13]

Selecting most adaptable source instances

Page 34: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Landmarks are labeled source instances distributed similarly to the target domain.

Identifying landmarks: Source

Target[Gong et al., ICML’13]

?

Selecting most adaptable source instances

Page 35: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Kernel embedding of distributions

µ[P ] , Ex

[�(x)]

!

µ maps distributions to RKHS

RKHS associated with kernel k(,), and ϕ(x)=k(x,· )

H

Page 36: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Kernel embedding of distributions

µ[P ] , Ex

[�(x)]

H

Page 37: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Kernel embedding of distributions

!

The mapping µ is injective, if k(,) is characteristic

µ[P] preserves all statistical features of P(x)

µ[P ] , Ex

[�(x)]

H

Page 38: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Kernel embedding of distributions

µ[P ] , Ex

[�(x)]

H

Page 39: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Kernel embedding of distributions

µ[P ] , Ex

[�(x)]

H!

Empirical kernel embedding:

µ̂[P ] =1

n

nX

i=1

�(xi), xi ⇠ P

Page 40: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

A distance of distributions

!

d(,) is a metric of distributions, if k(,) is characteristic

Maximum mean discrepancy (MMD): the sup() operation

[Müller’97,Gretton et al.’07,Sriperumbudur et al.’10]

d(P1, P2)

, supkfkH1

(Ex⇠P1f(x)� E

x⇠P2f(x))

= · · ·= kµ[P1]� µ[P2]kH H

d(P1 , P

2 )

Page 41: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Integer programming

!

!

where

!

Identifying landmarks by empirical MMD

min{↵m}

�����1Pi ↵i

MX

m=1

↵m�(xm)� 1

N

NX

n=1

�(xn)

�����

2

H

[Gong et al., ICML’13]

↵m =

⇢1 if xm is a landmark wrt target0 else

m = 1, 2, · · · ,M

Page 42: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Convex relaxation

Identifying landmarks by empirical MMD

Page 43: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Convex relaxation

Identifying landmarks by empirical MMD

�m =↵mPi ↵i

Page 44: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Convex relaxation

Identifying landmarks by empirical MMD

�m =↵mPi ↵i

Page 45: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Four vision datasets/domains on visual object recognition

[Griffin et al. ’07, Saenko et al. 10’] !

10 common classes

10~100 images per class

Bag-of-words and SURF features

Experimental study (cont’d)

The Office

Page 46: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Comparison results: object recognition

0

15

30

45

60

A-->C A-->D C-->A C-->W W-->A W-->C

No adaptation Gopalan et al.'11 Pan et al.'09GFK Landmark

Acc

urac

y (%

)

Office

Page 47: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Comparison results: object recognition

0

15

30

45

60

A-->C A-->D C-->A C-->W W-->A W-->C

No adaptation Gopalan et al.'11 Pan et al.'09GFK Landmark

Acc

urac

y (%

)

Office

Page 48: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

How do landmarks look like?

Headphone Mugtarget

TargetS

ource

Page 49: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

How do landmarks look like?

Headphone Mugtarget

TargetS

ource

6σ=2

0σ=2

-3σ=2

Page 50: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

How do landmarks look like?

Headphone Mugtarget

TargetS

ource

6σ=2

0σ=2

-3σ=2

Unselected

Page 51: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Landmarks

Labeled source instances Distributed similarly to the target

!

New intrinsic structure shared between domains

Proxy of discriminative loss of target

Outperformed the state-of-the-arts

Page 52: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Geodesic flow kernel Landmarks

Many factors overlap & interact

Latent domains

Kernel methods for DA!

Directly matching distributions

Page 53: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Domains = datasets?

Most DA methods

Assume good-quality domains

Evaluated as cross-dataset adaptation

Page 54: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Domains = datasets?

Most DA methods

Assume good-quality domains

Evaluated as cross-dataset adaptation

Common mistake: equating datasets with domains

Suboptimal to use DA methods for cross-dataset generalization

Page 55: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

What constitutes a domain?

In speech and NLP:

Speakers

Languages

Article topics

…other factors

Page 56: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

What constitutes a domain?

In speech and NLP:

Speakers

Languages

Article topics

…other factors

In computer vision:

Factors?

!

? ?

??

Page 57: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

What constitutes a domain?

In speech and NLP:

Speakers

Languages

Article topics

…other factors

!

!

In computer vision:

Pose

Page 58: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

What constitutes a domain?

In speech and NLP:

Speakers

Languages

Article topics

…other factors

!

!

In computer vision:

PoseLighting

Page 59: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

What constitutes a domain?

In speech and NLP:

Speakers

Languages

Article topics

…other factors

!

!

In computer vision:

PoseLighting

Fore/Background

Page 60: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

What constitutes a domain?

In speech and NLP:

Speakers

Languages

Article topics

…other factors

!

!

In computer vision:

PoseLighting

Occlusion

Fore/Background

Page 61: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

What constitutes a domain?

In speech and NLP:

Speakers

Languages

Article topics

…other factors

!

!

In computer vision:

PoseLighting

Occlusion

Fore/Background

Many factors overlap & interact

Page 62: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

In computer vision:

Hard to manually enumerate/discretize visual factors

Our solution: working with distributions via kernels

!

What constitutes a domain?

In speech and NLP:

Speakers

Languages

Article topics

…other factors

!

!

? ?

??

Page 63: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Domains as distributions

Labeled data could be drawn from multiple domains/distributions

!

!

!

!

Reshaping data to domains before adaptation

P1 P2 PK…

Visual datasets

Page 64: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Two axiomatic properties

I. Maximum distinctiveness:

Identifying distinct domains maximally different in distribution from each other

!

II. Maximum learnability

Being able to derive strong discriminative models from the identified domains

Page 65: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Domains maximally different in distribution from each other

[Gong et al., NIPS’13]

I. Maximum distinctiveness

…P1 P2 PK

Hmax

{zmk}

X

k 6=k0

ˆd(Pk, Pk0; {zmk})

zmk =

⇢1 if xm 2 the k-th domain

0 else

m = 1, 2, · · · ,M, k = 1, 2, · · · ,K

Page 66: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Being able to learn strong classifiers from domains !

Within-domain cross-validation

!

!!

-Determining the number of domains K

II. Maximum learnability

[Gong et al., NIPS’13]

Accuracy(K) =KX

k=1

Mk

MAccuracyk

Page 67: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Four vision datasets/domains on visual object recognition

[Griffin et al. ’07, Saenko et al. 10’] !

Five views/domains on cross-view human action recognition

[Weinland et al.’07]

!

Experimental study

The Office

IXMAS

Page 68: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Comparison results

Sources/datasets A, C D, W C, D, W View 0,1 View 2,3,4

Targets D, W A, C A View 2,3,4 View 0,1

From 41.0 32.6 41.8 44.6 47.1

Fromdomains

42.6 35.5 44.6 47.3 50.3

Cross-domain adaptation fffffffsfjalsjflajfljalfjlaljfljf > cross-dataset adaptation

Page 69: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Comparison results

Sources/datasets A, C D, W C, D, W View 0,1 View 2,3,4

Targets D, W A, C A View 2,3,4 View 0,1

From 41.0 32.6 41.8 44.6 47.1

Fromdomains

42.6 35.5 44.6 47.3 50.3

Page 70: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Comparison results

Sources/datasets A, C D, W C, D, W View 0,1 View 2,3,4

Targets D, W A, C A View 2,3,4 View 0,1

From 41.0 32.6 41.8 44.6 47.1

Fromdomains

42.6 35.5 44.6 47.3 50.3

Cross-domain adaptation fffffffsfjalsjflajfljalfjlaljfljf > cross-dataset adaptation

Page 71: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Assigning test data to discovered domains

Reshaping test set

tnk = I(xn is assigned to the k-th domain)

min{tnk}

X

k

d̂ (domaink,target; {tnk})

[Gong et al., NIPS’13]

Page 72: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Latent domains

Dataset ≠ domain

Suboptimal to adapt across datasets using DA methods

Cross-domain adaptation > cross-dataset adaptation !

Identifying latent domains

Maximum distinctiveness & maximum learnability

A non-parametric kernel method

P1 P2 PK…

Visual datasets

Page 73: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Summary (1/2)

(unsupervised) Domain adaptation is

Potentially successful solutions come with

Appropriate assumptions

Well-modeled domain knowledge !

Dataset≠domain, cross-dataset generalization by

Identifying landmarks from dataset

Reshaping data to obtain good-quality domains

ill-posed

Page 74: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Summary (2/2)

Kernel methods for domain adaptation: !

Inferring domain invariant features

Geodesic flow kernel (GFK)

Kernel trick

Directly matching distributions

Landmarks and Latent domains

Kernel embedding of distributions

Page 75: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Many factors overlap & interact

Geodesic flow kernel Landmarks Latent domainshz1i , z

1j i

=

Z 1

0

��(t)Txi

�T ��(t)Txi

�dt

=x

Ti Gxj

P1 P2 PK…

Visual datasets

Code available: http://www-scf.usc.edu/~boqinggo

Page 76: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Thanks!

Many factors overlap & interact

Geodesic flow kernel Landmarks Latent domainshz1i , z

1j i

=

Z 1

0

��(t)Txi

�T ��(t)Txi

�dt

=x

Ti Gxj

P1 P2 PK…

Visual datasets

Code available: http://www-scf.usc.edu/~boqinggo

Page 77: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!
Page 78: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Latent domains

Page 79: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Domain adaptation from discovered domains

vs. from original source domains/datasets

to all possible target domains !

Evaluation metric: expected/averaged accuracy

Evaluation strategy

ET [domains|datasets] ! T

Page 80: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Amazon images from [Saenko et al.’10].

Hard to manually define discrete domains

Page 81: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Domain I Domain II

Our reshaped domains

Page 82: Kernel Methods for Domain Adaptationadas.cvc.uab.es/task-cv2014/wp-content/uploads/2014/10/...Kernel Methods for Domain Adaptation!! Boqing Gong University of Southern California!!

Results: reshaping test data

S S Best domain Reshaping

View 012 37.3 37.7 38.5

View 123 39.9 40.4 41.1

View 234 47.8 46.5 49.2

View 340 52.3 50.7 54.9

View 401 43.3 43.9 44.8

Reshaping both training and test data gives rise aato the best performance.