Joint Recovery of Dense Correspondence and Cosegmentation ... · Ours Single layer SIFT Flow DSP...

Post on 06-Oct-2020

0 views 0 download

Transcript of Joint Recovery of Dense Correspondence and Cosegmentation ... · Ours Single layer SIFT Flow DSP...

Experiments

Contributions

Introduction Hierarchical (layered graph) model

Joint Recovery of Dense Correspondence and Cosegmentation in Two ImagesTatsunori Taniai (The University of Tokyo) Sudipta N. Sinha (Microsoft Research) Yoichi Sato (The University of Tokyo)

New dataset with ground truth/evaluation toolkit

• 400 image pairs, 7 object categories

New joint model and inference technique

• Discrete-continuous labeling problem for flow and segmentation

estimation in a hierarchical MRF model.

• Joint inference of hierarchical structure and labeling via an energy

minimization framework using iterated graph cuts.

• Recovers layered structure of nested image regions.

• We propose a method to simultaneously recover

cosegmentation and correspondence (or flow) maps.

• Our joint formulation improves performance on both tasks;

more accurate than existing methods that solve either task.Why hierarchy ? We need powerful regularization to be robust against

significant appearance dissimilarity of different object instances.

Why not precompute hierarchical structure ? A good hierarchical structure

must respect object boundary and smoothness of the flow map. However, these

are not available a priori and thus, jointly inferred with the flow and segmentation.

Flow

data term

Cosegmentation

data term

Graph structure

term

Layer 1

(final result)

Layer 2

Layer K

𝐹 𝐺, 𝑓, 𝛼 = 𝐸graph 𝐺 + 𝐸flow 𝑓|𝐺 + 𝐸seg 𝛼|𝐺 + 𝐸reg 𝑓, 𝛼|𝐺

Pairwise

smoothness term

Applications

• 3D reconstruction from object categories [Vicente+ CVPR’14]

• Non-parametric scene parsing

[Liu+ TPAMI’11, Smith+ CVPR’13, Karsch+ TPAMI’14]

Pixel grid

Methods Flow Coseg. Regularization

Our method ✔ ✔ Hierarchical MRF

Our method (no hierarchy) ✔ ✔ 2D MRF

SIFT flow [Liu+ TPAMI’11] ✔ 2D MRF

DSP [Kim+ CVPR’13] ✔ Pyramid hierarchy

DAISY filter flow

[Yang+ CVPR’14]✔

No explicit

regularization

Faktor & Irani [ICCV’11] ✔ ―

Joulin+ [CVPR’10] ✔ ―

0.0

0.2

0.4

0.6

0.8

1.0

0 5 10 15

FG3DCar

0.0

0.2

0.4

0.6

0.8

0 5 10 15

JODS

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0 5 10 15

PASCAL

―――― Input ――――

Two images of semantically

related but different object

instances from similar views

――――――― Output ―――――――

Common object

cosegmentation

(binary mask) and

dense flow map

that aligns the

common region

in the images

Label transfer [Smith+ CVPR’13]

Database

images

• Node sparsity

• Color consistency

of superpixels

• HOG features for

appearance matching

• Similarity transform

• FG/BG color

likelihood (learned

during initialization)

• Spatial neighbors

• Parent child edges

Two-step optimization1) Bottom-up graph construction

Incrementally add layers from lower levels, while estimating flow and segmentation labels.

Initialize Add a layer by merging nodes

2) Top-down labeling refinement

Bottom layer nodes

Bottom layer labels

(final output)

Fixed

Update flow and segmentation labels, while keeping the graph structure fixed.

FG BG FlowsLabels

Update all labels

Fixed

Based on continuous MRF optimization technique (via graph cuts)

Taniai+. “Continuous Stereo Matching Using Local Expansion Moves” (arXiv 2016)

Update lower-layers labels

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

Ours Single layer

Faktor & Irani Joulin+

Flow accuracy

Acc

ura

te p

ixels

(%

)

Flow error threshold in a normalized scale (image width is 100 px)

Ours Single layer SIFT Flow DSP DFF

Cosegmentation accuracy

FG3DCar JODS PASCAL

Inte

rsect

ion

-over-

un

ion

rati

o

Input Flow (correspondence) Cosegmentation

FG

3D

Car

JOD

SPA

SC

AL

Dataset info

Images in our dataset are

grouped by their source

• FG3DCar [Lin+ ‘14]

• JODS [Rubinstein+ ‘13]

• PASCAL [Hariharan+ ‘11]

Input Ground truth Ours SIFT Flow DSP Ours Faktor&Irani Joulin+

http://taniai.space/...