Reconnaissance d’objets et vision artificielle

41

description

Reconnaissance d’objets et vision artificielle. Jean Ponce ( [email protected]) http://www.di.ens.fr/~ponce Equipe-projet WILLOW ENS/INRIA/CNRS UMR 8548 Laboratoire d’Informatique Ecole Normale Sup érieure, Paris. Outline. Motivation: Making mosaics Perspetive and weak perspective - PowerPoint PPT Presentation

Transcript of Reconnaissance d’objets et vision artificielle

Page 1: Reconnaissance d’objets et vision artificielle
Page 2: Reconnaissance d’objets et vision artificielle

Reconnaissance d’objetset vision artificielle

Jean Ponce ([email protected])http://www.di.ens.fr/~ponce

Equipe-projet WILLOWENS/INRIA/CNRS UMR 8548Laboratoire d’Informatique

Ecole Normale Supérieure, Paris

Page 3: Reconnaissance d’objets et vision artificielle

Outline

• Motivation: Making mosaics• Perspetive and weak perspective• Coordinates changes• Intrinsic and extrensic parameters• Affine registration• Projective registration

Page 4: Reconnaissance d’objets et vision artificielle

Feature-based alignment outline

Page 5: Reconnaissance d’objets et vision artificielle

Feature-based alignment outline

Extract features

Page 6: Reconnaissance d’objets et vision artificielle

Feature-based alignment outline

Extract featuresCompute putative matches

Page 7: Reconnaissance d’objets et vision artificielle

Feature-based alignment outline

Extract featuresCompute putative matchesLoop:

• Hypothesize transformation T (small group of putative matches that are related by T)

Page 8: Reconnaissance d’objets et vision artificielle

Feature-based alignment outline

Extract featuresCompute putative matchesLoop:

• Hypothesize transformation T (small group of putative matches that are related by T)

• Verify transformation (search for other matches consistent with T)

Page 9: Reconnaissance d’objets et vision artificielle

Feature-based alignment outline

Extract featuresCompute putative matchesLoop:

• Hypothesize transformation T (small group of putative matches that are related by T)

• Verify transformation (search for other matches consistent with T)

Page 10: Reconnaissance d’objets et vision artificielle

2D transformation models

Similarity(translation, scale, rotation)

Affine

Projective(homography)

Why these transformations ???

Page 11: Reconnaissance d’objets et vision artificielle

Pinhole perspective equation

zyfy

zxfx

''

''NOTE: z is always negative..

Page 12: Reconnaissance d’objets et vision artificielle

Affine models: Weak perspective projection

0

'where''

zfm

myymxx

is the magnification.

When the scene relief is small compared its distance from theCamera, m can be taken constant: weak perspective projection.

Page 13: Reconnaissance d’objets et vision artificielle

Affine models: Orthographic projection

yyxx

'' When the camera is at a

(roughly constant) distancefrom the scene, take m=1.

Page 14: Reconnaissance d’objets et vision artificielle

Analytical camera geometry

Page 15: Reconnaissance d’objets et vision artificielle

Coordinate Changes: Pure Translations

OBP = OBOA + OAP , BP = AP + BOA

Page 16: Reconnaissance d’objets et vision artificielle

Coordinate Changes: Pure Rotations

BABABA

BABABA

BABABABAR

kkkjkijkjjjiikijii

.........

TB

A

TB

A

TB

A

kji

ABA

BA

B kji

Page 17: Reconnaissance d’objets et vision artificielle

Coordinate Changes: Rotations about

the z Axis

1000cossin0sincos

RBA

Page 18: Reconnaissance d’objets et vision artificielle

A rotation matrix is characterized by the following properties:

• Its inverse is equal to its transpose, and

• its determinant is equal to 1.

Or equivalently:

• Its rows (or columns) form a right-handedorthonormal coordinate system.

Page 19: Reconnaissance d’objets et vision artificielle

Coordinate changes: pure rotations

PRP

zyx

zyx

OP

ABA

B

B

B

B

BBBA

A

A

AAA

kjikji

Page 20: Reconnaissance d’objets et vision artificielle

Coordinate Changes: Rigid Transformations

ABAB

AB OPRP

11111P

TOPRPORP A

BA

ABAB

AA

TA

BBA

B

0

Page 21: Reconnaissance d’objets et vision artificielle

Pinhole perspective equation

zyfy

zxfx

''

''NOTE: z is always negative..

Page 22: Reconnaissance d’objets et vision artificielle

The intrinsic parameters of a camera

Normalized imagecoordinates

Physical image coordinates

Units:k,l : pixel/mf : ma,b : pixel

Page 23: Reconnaissance d’objets et vision artificielle

The intrinsic parameters of a camera

Calibration matrix

The perspectiveprojection equation

Page 24: Reconnaissance d’objets et vision artificielle

The extrinsic parameters of a camera

Page 25: Reconnaissance d’objets et vision artificielle

Perspective projections induce projective transformations between planes

Page 26: Reconnaissance d’objets et vision artificielle

Weak-perspective projection

Paraperspective projection

Affine cameras

Page 27: Reconnaissance d’objets et vision artificielle

Orthographic projection

Parallel projection

More affine cameras

Page 28: Reconnaissance d’objets et vision artificielle

Weak-perspective projection model

r(p and P are in homogeneous coordinates)

p = A P + b (neither p nor P is in hom. coordinates)

p = M P (P is in homogeneous coordinates)

Page 29: Reconnaissance d’objets et vision artificielle

Affine projections induce affine transformations from planes onto their images.

Page 30: Reconnaissance d’objets et vision artificielle

Affine transformationsAn affine transformation maps a parallelogram ontoanother parallelogram

11001''

22221

11211

vu

baabaa

vu

Page 31: Reconnaissance d’objets et vision artificielle

Fitting an affine transformationAssume we know the correspondences, how do we get

the transformation?

2

1

43

21

tt

yx

mmmm

yx

i

i

i

i

i

i

ii

ii

yx

ttmmmm

yxyx

2

1

4

3

2

1

10000100

),( ii yx ),( ii yx

Page 32: Reconnaissance d’objets et vision artificielle

Fitting an affine transformation

Linear system with six unknownsEach match gives us two linearly independent

equations: need at least three to solve for the transformation parameters

i

i

ii

ii

yx

ttmmmm

yxyx

2

1

4

3

2

1

10000100

Page 33: Reconnaissance d’objets et vision artificielle

Beyond affine transformationsWhat is the transformation between two views of a

planar surface?

What is the transformation between images from two cameras that share the same center?

Page 34: Reconnaissance d’objets et vision artificielle

Perspective projections induce projective transformations between planes

Page 35: Reconnaissance d’objets et vision artificielle

Beyond affine transformationsHomography: plane projective transformation

(transformation taking a quad to another arbitrary quad)

Page 36: Reconnaissance d’objets et vision artificielle

Fitting a homographyRecall: homogenenous coordinates

Converting to homogenenousimage coordinates

Converting from homogenenousimage coordinates

Page 37: Reconnaissance d’objets et vision artificielle

Fitting a homographyRecall: homogenenous coordinates

Equation for homography:

Converting to homogenenousimage coordinates

Converting from homogenenousimage coordinates

11 333231

232221

131211

yx

hhhhhhhhh

yx

Page 38: Reconnaissance d’objets et vision artificielle

Fitting a homographyEquation for homography:

iT

T

T

ii xhhh

xHx

3

2

1

11 333231

232221

131211

i

i

i

i

yx

hhhhhhhhh

yx

0 ii xHx

iT

iiT

i

iT

iiT

iT

iT

i

ii

yxx

y

xhxhxhxhxhxh

xHx

12

31

23

00

00

3

2

1

hhh

xxxxxx

TTii

Tii

Tii

TTi

Tii

Ti

T

xyxy

3 equations, only 2 linearly independent

9 entries, 8 degrees of freedom(scale is arbitrary)

Page 39: Reconnaissance d’objets et vision artificielle

Direct linear transform

H has 8 degrees of freedom (9 parameters, but scale is arbitrary)

One match gives us two linearly independent equationsFour matches needed for a minimal solution (null space

of 8x9 matrix)More than four: homogeneous least squares

0

00

00

3

2

1111

111

hhh

xxxx

xxxx

Tnn

TTn

Tnn

Tn

T

TTT

TTT

xy

xy

0hA

Page 40: Reconnaissance d’objets et vision artificielle

Application: Panorama stitching

Images courtesy of A. Zisserman.

Page 41: Reconnaissance d’objets et vision artificielle