Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and...

68
Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu Cord and Dr. Jorge Stolfi Co-advisors: Profs. Dr. Nicolas Thome and Dr. Neucimar J. Leite Institute of Computing (IC) – University of Campinas (UNICAMP) Laboratoire d’Informatique de Paris 6 (LIP6) – Universit´ e Pierre et Marie Curie (UPMC) 19 March 2012

Transcript of Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and...

Page 1: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Text Recognition and 2D/3DObject Tracking

PhD. DefenseRodrigo Minetto

Advisors: Profs. Dr. Matthieu Cord and Dr. Jorge StolfiCo-advisors: Profs. Dr. Nicolas Thome and Dr. Neucimar J. Leite

Institute of Computing (IC) – University of Campinas (UNICAMP)

Laboratoire d’Informatique de Paris 6 (LIP6) – Universite Pierre et Marie Curie (UPMC)

19 March 2012

Page 2: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Overview

1 Text detection and recognition;

2 Text tracking;

3 Tracking of 3D rigid objects;

4 General conclusions;

5 Publications.

R. Minetto 2 / 56

Page 3: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Part I

Text detection and recognition

R. Minetto 3 / 56

Page 4: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Contributions

Description of a novel text classifier (T-HOG):

Region splitting into horizontal cells;

Cells weighting by fuzzy overlapping functions.

Comparison of the standard HOG (R-HOG) andT-HOG for text recognition;

Text filtering and detection with T-HOG;

Development of a full text detection system:

SnooperText;

SnooperText + T-HOG: on a real GIS.

Page 5: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Contributions

;

;

Page 6: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Contributions

Page 7: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Statement of the problemText recognition (Text/Non-text classification problem)

Input data: sub-images of an arbitrary 3D scene.

Page 8: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Statement of the problemOutput data: binaries decisions→ TRUE: if the sub-image contains a Roman-like text;→ FALSE: otherwise.

T-HOG True

True

True

True

TrueTrue

True

False

False

False

False

False

Page 9: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

R-HOG Idea (Histogram of Oriented Gradients)

Images of complex objects typically have 6= HOG’s in 6= parts;

Humans: 6= gradient orientation distributions (head/torso/legs):

Figure: Image from: Histograms of Oriented Gradients for Human Detection.Navneet Dalal and Bill Triggs. CVPR 2004

Page 10: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

HOGs of some isolated letters:

I ρ(∇I ) θ(∇I ) HOG

Page 11: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

T-HOG IdeaRoman-like text-lines: 6= HOG’s in the top/middle/bottom parts.Top/Bottom: Large proportion of horizontal strokes→ gradients pointing mostly in the vertical direction;

Middle: Large proportion of vertical strokes →gradients pointing mostly in the horizontal direction;All parts: Small amount of diagonal strokes

Page 12: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

F-HOG of text/non-text regions

Page 13: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Standard HOG blocks/cells ArrangementsWhat is the best region splitting for text recognition?

Region splitting into 3 HOG’s:

1x3

3x1

1x3h

3hx1

Page 14: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Decision error trade-off curves – 3 HOGs

1x3

3x1

Page 15: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Standard HOG blocks/cells ArrangementsRegion splitting into 6 HOG’s:

6x1 3x2 2x3

1x6 6fx1 3hx2

2x3h 1x6f

Page 16: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Decision error trade-off curves – 6 HOGs

Page 17: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Standard HOG blocks/cells ArrangementsConclusion: horizontal stripes are better!

What is the ideal number of horizontal stripes (ny)?

1x2? 1x3?

1x6?

Page 18: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Standard HOG blocks/cells ArrangementsConclusion: horizontal stripes are better!

What is the ideal number of horizontal stripes (ny)?

1x2? 1x3?

1x6?

Page 19: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Standard HOG blocks/cells Arrangements

Page 20: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

T-HOG cell weight functionsWhat is the importance of cell weight functions?

Sharp cells:

w0 w1 w2

Page 21: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

T-HOG cell weight functions

Fuzzy cells:

w0 w1 w2

Page 22: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Standard HOG cell weight functionsProblem: Sharp boundaries!

Figure: Single block: 1× 3 cells (σ/2).

Figure: Single block of 1× 3 cells (σ/4).

Figure: Overlapping 1× 3 single-cell blocks.

Page 23: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

T-HOG descriptor schemeText Object (original size)

//Resized and normalized image

&&LLLLxxrrrr

∇Ix

�� **VVVVVVVVVVVVVVV

∇Iy

��tthhhhhhhhhhhhhhh

θ∇I

�� &&LLLL

**VVVVVVVVVVVVVVV

|∇I |

��xxrrrrtthhhhhhhhhhhhhhh

w0

��

w1

��

w2

��top HOG middle HOG bottom HOG

Page 24: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

T-HOG x Standard HOG

Comparison T-HOG/R-HOG for text recognition:

Page 25: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

BuildPyramid

Input Image

SegmentationStep

RefiningStep

Fourier descriptors

Final letterdecision

Pseudo Zernickemoments

Our polardescriptors

SnooperText Scheme

Letter Classification Step

SVM

SVM

SVM

SVMGrouping

Step

Page 26: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Experiments

SnooperText Multi-resolution:

L0

L2

L1

R. Minetto 25 / 56

Page 27: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Experiments

SnooperText detection:

R. Minetto 26 / 56

Page 28: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Experiments

SnooperText + T-HOG as a post-filter:

R. Minetto 27 / 56

Page 29: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

ExperimentsPerformance - ICDAR Challenge

System p r fI fII

SnooperText + T-HOG 0.73 0.61 0.65 0.67Yi and Tian (TIP 2011) 0.71 0.62 0.62 0.66

H. Chen et al. (ICIP 2011) 0.73 0.60 — 0.66Epshtein et al. (CVPR 2010) 0.73 0.60 — 0.66

Hinnerk Becker† 0.62 0.67 0.62 0.64Alex Chen† 0.60 0.60 0.58 0.60

Ashida† 0.55 0.46 0.50 0.50HWDavid† 0.44 0.46 0.45 0.45

Wolf† 0.30 0.44 0.35 0.36Qiang Zhu† 0.33 0.40 0.33 0.36

. . .

Page 30: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

ExperimentsPerformance - Epshtein dataset

System p r fI fII

SnooperText + T-HOG 0.59 0.47 0.49 0.52Epshtein et al. (CVPR 2010) 0.54 0.42 — 0.47

Performance - iTowns dataset

System p r fI fII

SnooperText + T-HOG 0.72 0.50 0.56 0.59

Page 31: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

ExperimentsT-HOG as sliding window text detector:

Page 32: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

ExperimentsT-HOG as sliding window text detector:

Page 33: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

ExperimentsT-HOG as sliding window text detector:

Page 34: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Part II

Text tracking

R. Minetto 33 / 56

Page 35: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Contributions

Description of 4 text tracking strategies:

SnooperTrack.

Definition of special metrics for text tracking systems;

Description of a PF designed specifically for text:

T-HOG: contents similarity;

T-HOG: contents classification.

Benchmark with 6 videos of urban scenes (XML).

Page 36: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Statement of the problem

Input data: a video V→ frames V(0),V(1), . . . ,V(n−1);

Text detection;

Text tracking;

V(i−2) V(i−1)V(i−1)

R. Minetto 35 / 56

Page 37: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Statement of the problem

Input data: a video V→ frames V(0),V(1), . . . ,V(n−1);

Text detection;

Text tracking;

V(i−1) V(i)

R. Minetto 36 / 56

Page 38: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Statement of the problem

Output data:

List of text regions T (0),T (1), . . . ,T (n−1);Tracking relations π(1), π(2), . . . , π(n−1);

. . . V(i−2) V(i−1) V(i) . . .

R. Minetto 37 / 56

Page 39: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Statement of the problem

Output data:

List of text regions T (0),T (1), . . . ,T (n−1);

Tracking relations π(1), π(2), . . . , π(n−1);

. . . V(i−2) V(i−1) V(i) . . .

R. Minetto 37 / 56

Page 40: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Statement of the problem

Output data:

List of text regions T (0),T (1), . . . ,T (n−1);Tracking relations π(1), π(2), . . . , π(n−1);

. . . V(i−2) V(i−1) V(i) . . .

R. Minetto 37 / 56

Page 41: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Tracking Strategies

Track1

Full-frame text detectionon every frame

Prediction and Matching

V(0)

Full-frame text detection

...

Prediction and Matching

Page 42: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Tracking Strategies

Track1Full-frame text detection

on every frame

Prediction and Matching

V(0)Full-frame text detection

...

Prediction and Matching

Page 43: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Tracking Strategies

Track1Full-frame text detection

on every frame

Prediction and Matching

V(0)

Full-frame text detection

...

Prediction and Matching

Page 44: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Tracking Strategies

Track2

Partial-frame

text detectionon every frame

Forward Tracking

V(0)

Partial-frame text detection

...

Forward Tracking

Page 45: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Tracking Strategies

Track2Partial-frame text detection

on every frame

Forward Tracking

V(0)Partial-frame text detection

...

Forward Tracking

Page 46: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Tracking Strategies

Track2Partial-frame text detection

on every frame

Forward Tracking

V(0)

Partial-frame text detection

...

Forward Tracking

Page 47: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Tracking Strategies

Track3 - SnooperTrackPartial-frame text detection

on key frames

Forward Tracking

V(0)

KeyFrame

KeyFrame

KeyFrame

Partial-frame text detection

...

Forward Tracking

Page 48: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Tracking Strategies

Track3 - SnooperTrackPartial-frame text detection

on key frames

Forward Tracking

V(0)

KeyFrame

KeyFrame

KeyFrame

Partial-frame text detection

...

Forward Tracking

Page 49: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Tracking Strategies

Track3 - SnooperTrackPartial-frame text detection

on key frames

Forward Tracking

V(0)

KeyFrame

KeyFrame

KeyFrame

Partial-frame text detection

...

Forward Tracking

Page 50: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Tracking Strategies

Track4Partial-frame text detection

on key frames

Bidirectional

Tracking

V(0)

...

Bidirectional Tracking

Page 51: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Tracking Strategies

Track4Partial-frame text detection

on key frames

Bidirectional Tracking

V(0)

...

Bidirectional Tracking

Page 52: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Experiments

Video dataset (6 outdoor videos);

Metrics:

Region detection accuracy;

Object detection accuracy;

Text tracking accuracy;

ICDAR detection score;

Text detector: SnooperText/Ideal.

Text tracker:

T1 (simple linear prediction);

T2–4 (particle filter + T-HOG).

Page 53: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Conclusions

Impact of text detection errors;

Impact of false positive detections;

Impact of over-segmentation:Impact of backward tracking:

↑ f -score improved 4% in all metrics.

Track1 with an ideal text detector was the best;

Track3(ST)–Track4 with a real text detector were thebest;

Track1–Track2 are more time consuming thanTrak3–Track4. All algorithms using the SnooperTextdetector.

Page 54: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Part III

Tracking of 3D objects

R. Minetto 44 / 56

Page 55: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Contributions

Development of a new algorithm – AFFTrack:

Camera calibration;

Feature tracking;

Development of a benchmark (publicly available).

Page 56: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Statement of the problemInput:

→ A video V (V(0),V(1), . . . ,V(n−1));

→ Initial frame positions at V(0). Image V(0)

(593,0)

(582,0)

(130,0)(160,0)

(160,0)

(0,362)

(0,359)

(0,185)

(0,174)

Page 57: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Statement of the problemInput:

→ Feature object’s geometry;

→ Canonical and masks images.Scene model

.YX

Z

qqcccccccccccccccccccccccccccccccccccccc

,,YYYYYYYYYYYYYYYYYYYYYYYY

OO````````````````````````````````````````]]]]]]]]]]]]]]]]]]]]]]]]]]

(593,0,638)

(582,0,123)

(1130,0,607)(1660,0,604)

(1660,0,123)

(0,362,603)

(0,359,124)

(0,1085,603)

(0,1074,124)

Page 58: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Statement of the problemOutput:

→ Feature tracking.

Page 59: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Statement of the problemOutput:

→ Feature tracking.

Page 60: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Statement of the problemOutput:

→ Camera parameters C (i) for each V(i).

Page 61: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

AFFTrack

FeatureFinder

Cameracalibrator

Adjusted positions

Confidence weights

Feature positions

R. Minetto 51 / 56

Page 62: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Experiments

8 real videos:

Optura (320× 240);

Sony (640× 480);

9 synthetic videos:

POV-Ray;

Page 63: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Conclusions

AFFTrack can track objects in the presence:

occlusions;

video noise;

tracking failures;

AFFTrack can handle videos with:

variable zoom;

distortion zoom;

AFFTrack is free from long-term drift;

Page 64: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

General conclusions and perspectives

We considered 3 computer vision problems;

We developed 8 new algorithms:

T-HOGSnooperTextTrack1Track2

Track3 - SnooperTrackTrack4Particle Filter with T-HOG

AFFTrack

Page 65: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

General conclusions and perspectives

SnooperText/T-HOG was used in the iTowns project;

We assembled 2 new benchmarks (publicly available);

We plan:

Compare T-HOG with other descriptors (LBP, etc);

Improve the SnooperText algorithm;

Handling of occlusions in text tracking.

Page 66: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Publications:

[Minetto et al. 2011] R. Minetto, N. Thome, M. Cord,J. Stolfi, F. Precioso, J. Guyomard and N. J. Leite.

Text Detection and Recognitionin Urban Scenes.

IEEE/ISPRS Workshop on Computer Vision forRemote Sensing of the Environment CVRS-ICCV .

[Minetto et al. 2011] R. Minetto, N. Thome, M. Cord,N.J. Leite and J. Stolfi.

SnooperTrack: Text Detection andTracking for Outdoor Videos.

IEEE International Conference on Image Processing (ICIP).

R. Minetto 55 / 56

Page 67: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

[Minetto et al. 2010] R. Minetto, N. Thome, M. Cord,J. Fabrizio and B. Marcotegui.

SnooperText: A Multiresolution System forText Detection in Complex Visual Scenes.

IEEE International Conference on Image Processing (ICIP).

[Minetto et al. 2009] R. Minetto, N.J. Leite and J. Stolfi.

AFFTrack: Robust Tracking ofFeatures in Variable-Zoom Videos.

IEEE International Conference on Image Processing (ICIP).

[Minetto et al. 2008] R. Minetto, N.J. Leite and J. Stolfi.

Integrating Tsai’s Camera CalibrationAlgorithm with KLT Feature Tracking.

Proceedings of IV Workshop de Visao Computacional.

R. Minetto 56 / 56

Page 68: Text Recognition and 2D/3D Object Tracking - DAINFrminetto/slides_phd.pdf · Text Recognition and 2D/3D Object Tracking PhD. Defense Rodrigo Minetto Advisors: Profs. Dr. Matthieu

Thank you for your attention!

QUESTIONS ?

R. Minetto 56 / 56