Joint Recovery of Dense Correspondence and Cosegmentation ... · Ours Single layer SIFT Flow DSP...

1
Experiments Contributions Introduction Hierarchical (layered graph) model Joint Recovery of Dense Correspondence and Cosegmentation in Two Images Tatsunori Taniai (The University of Tokyo) Sudipta N. Sinha (Microsoft Research) Yoichi Sato (The University of Tokyo) New dataset with ground truth/evaluation toolkit 400 image pairs, 7 object categories New joint model and inference technique Discrete-continuous labeling problem for flow and segmentation estimation in a hierarchical MRF model. Joint inference of hierarchical structure and labeling via an energy minimization framework using iterated graph cuts. Recovers layered structure of nested image regions. We propose a method to simultaneously recover cosegmentation and correspondence (or flow) maps. Our joint formulation improves performance on both tasks; more accurate than existing methods that solve either task. Why hierarchy ? We need powerful regularization to be robust against significant appearance dissimilarity of different object instances. Why not precompute hierarchical structure ? A good hierarchical structure must respect object boundary and smoothness of the flow map. However, these are not available a priori and thus, jointly inferred with the flow and segmentation. Flow data term Cosegmentation data term Graph structure term Layer 1 (final result) Layer 2 Layer K ,, = graph + flow | + seg | + reg , | Pairwise smoothness term Applications 3D reconstruction from object categories [Vicente+ CVPR’14] Non-parametric scene parsing [Liu+ TPAMI’11, Smith+ CVPR’13, Karsch+ TPAMI’14] Pixel grid Methods Flow Coseg. Regularization Our method Hierarchical MRF Our method (no hierarchy) 2D MRF SIFT flow [Liu+ TPAMI’11] 2D MRF DSP [Kim+ CVPR’13] Pyramid hierarchy DAISY filter flow [Yang+ CVPR’14] No explicit regularization Faktor & Irani [ICCV’11] Joulin+ [CVPR’10] 0.0 0.2 0.4 0.6 0.8 1.0 0 5 10 15 FG3DCar 0.0 0.2 0.4 0.6 0.8 0 5 10 15 JODS 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0 5 10 15 PASCAL ―――― Input ―――― Two images of semantically related but different object instances from similar views ――――――― Output ――――――― Common object cosegmentation (binary mask) and dense flow map that aligns the common region in the images Label transfer [Smith+ CVPR’13] Database images Node sparsity Color consistency of superpixels HOG features for appearance matching Similarity transform FG/BG color likelihood (learned during initialization) Spatial neighbors Parent child edges Two-step optimization 1) Bottom-up graph construction Incrementally add layers from lower levels, while estimating flow and segmentation labels. Initialize Add a layer by merging nodes 2) Top-down labeling refinement Bottom layer nodes Bottom layer labels (final output) Fixed Update flow and segmentation labels, while keeping the graph structure fixed. FG BG Flows Labels Update all labels Fixed Based on continuous MRF optimization technique (via graph cuts) Taniai+. “Continuous Stereo Matching Using Local Expansion Moves” (arXiv 2016) Update lower-layers labels 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 Ours Single layer Faktor & Irani Joulin+ Flow accuracy Accurate pixels (%) Flow error threshold in a normalized scale (image width is 100 px) Ours Single layer SIFT Flow DSP DFF Cosegmentation accuracy FG3DCar JODS PASCAL Intersection-over-union ratio Input Flow (correspondence) Cosegmentation FG3DCar JODS PASCAL Dataset info Images in our dataset are grouped by their source FG3DCar [Lin+ ‘14] JODS [Rubinstein+ ‘13] PASCAL [Hariharan+ ‘11] Input Ground truth Ours SIFT Flow DSP Ours Faktor&Irani Joulin+ http://taniai.space/...

Transcript of Joint Recovery of Dense Correspondence and Cosegmentation ... · Ours Single layer SIFT Flow DSP...

Page 1: Joint Recovery of Dense Correspondence and Cosegmentation ... · Ours Single layer SIFT Flow DSP DFF Cosegmentation accuracy FG3DCar JODS PASCAL n-ver-io Input Flow (correspondence)

Experiments

Contributions

Introduction Hierarchical (layered graph) model

Joint Recovery of Dense Correspondence and Cosegmentation in Two ImagesTatsunori Taniai (The University of Tokyo) Sudipta N. Sinha (Microsoft Research) Yoichi Sato (The University of Tokyo)

New dataset with ground truth/evaluation toolkit

• 400 image pairs, 7 object categories

New joint model and inference technique

• Discrete-continuous labeling problem for flow and segmentation

estimation in a hierarchical MRF model.

• Joint inference of hierarchical structure and labeling via an energy

minimization framework using iterated graph cuts.

• Recovers layered structure of nested image regions.

• We propose a method to simultaneously recover

cosegmentation and correspondence (or flow) maps.

• Our joint formulation improves performance on both tasks;

more accurate than existing methods that solve either task.Why hierarchy ? We need powerful regularization to be robust against

significant appearance dissimilarity of different object instances.

Why not precompute hierarchical structure ? A good hierarchical structure

must respect object boundary and smoothness of the flow map. However, these

are not available a priori and thus, jointly inferred with the flow and segmentation.

Flow

data term

Cosegmentation

data term

Graph structure

term

Layer 1

(final result)

Layer 2

Layer K

𝐹 𝐺, 𝑓, 𝛼 = 𝐸graph 𝐺 + 𝐸flow 𝑓|𝐺 + 𝐸seg 𝛼|𝐺 + 𝐸reg 𝑓, 𝛼|𝐺

Pairwise

smoothness term

Applications

• 3D reconstruction from object categories [Vicente+ CVPR’14]

• Non-parametric scene parsing

[Liu+ TPAMI’11, Smith+ CVPR’13, Karsch+ TPAMI’14]

Pixel grid

Methods Flow Coseg. Regularization

Our method ✔ ✔ Hierarchical MRF

Our method (no hierarchy) ✔ ✔ 2D MRF

SIFT flow [Liu+ TPAMI’11] ✔ 2D MRF

DSP [Kim+ CVPR’13] ✔ Pyramid hierarchy

DAISY filter flow

[Yang+ CVPR’14]✔

No explicit

regularization

Faktor & Irani [ICCV’11] ✔ ―

Joulin+ [CVPR’10] ✔ ―

0.0

0.2

0.4

0.6

0.8

1.0

0 5 10 15

FG3DCar

0.0

0.2

0.4

0.6

0.8

0 5 10 15

JODS

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0 5 10 15

PASCAL

―――― Input ――――

Two images of semantically

related but different object

instances from similar views

――――――― Output ―――――――

Common object

cosegmentation

(binary mask) and

dense flow map

that aligns the

common region

in the images

Label transfer [Smith+ CVPR’13]

Database

images

• Node sparsity

• Color consistency

of superpixels

• HOG features for

appearance matching

• Similarity transform

• FG/BG color

likelihood (learned

during initialization)

• Spatial neighbors

• Parent child edges

Two-step optimization1) Bottom-up graph construction

Incrementally add layers from lower levels, while estimating flow and segmentation labels.

Initialize Add a layer by merging nodes

2) Top-down labeling refinement

Bottom layer nodes

Bottom layer labels

(final output)

Fixed

Update flow and segmentation labels, while keeping the graph structure fixed.

FG BG FlowsLabels

Update all labels

Fixed

Based on continuous MRF optimization technique (via graph cuts)

Taniai+. “Continuous Stereo Matching Using Local Expansion Moves” (arXiv 2016)

Update lower-layers labels

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

Ours Single layer

Faktor & Irani Joulin+

Flow accuracy

Acc

ura

te p

ixels

(%

)

Flow error threshold in a normalized scale (image width is 100 px)

Ours Single layer SIFT Flow DSP DFF

Cosegmentation accuracy

FG3DCar JODS PASCAL

Inte

rsect

ion

-over-

un

ion

rati

o

Input Flow (correspondence) Cosegmentation

FG

3D

Car

JOD

SPA

SC

AL

Dataset info

Images in our dataset are

grouped by their source

• FG3DCar [Lin+ ‘14]

• JODS [Rubinstein+ ‘13]

• PASCAL [Hariharan+ ‘11]

Input Ground truth Ours SIFT Flow DSP Ours Faktor&Irani Joulin+

http://taniai.space/...