Algorithms and Applications in Computer Vision Lihi Zelnik-Manor lihi@ee.technion.ac.il.

Algorithms and Applications in Computer Vision

Lihi Zelnik-Manorlihi@ee.technion.ac.il

Let’s get started: Image formation

• How are objects in the world captured in an image?

Physical parameters of image formation

• Geometric– Type of projection– Camera pose

• Optical– Sensor’s lens type– focal length, field of view, aperture

• Photometric– Type, direction, intensity of light reaching sensor– Surfaces’ reflectance properties

Image formation

• Let’s design a camera– Idea 1: put a piece of film in front of an object– Do we get a reasonable image?

Slide by Steve Seitz

Pinhole camera

• Add a barrier to block off most of the rays– This reduces blurring– The opening is known as the aperture– How does this transform the image?

Pinhole camera• Pinhole camera is a simple model to approximate

imaging process, perspective projection.

Fig from Forsyth and Ponce

If we treat pinhole as a point, only one ray from any given point can enter the camera.

Virtual image

pinhole

Image plane

Camera obscura

"Reinerus Gemma-Frisius, observed an eclipse of the sun at Louvain on January 24, 1544, and later he used this illustration of the event in his book De Radio Astronomica et Geometrica, 1545. It is thought to be the first published illustration of a camera obscura..." Hammond, John H., The Camera Obscura, A Chronicle

http://www.acmi.net.au/AIC/CAMERA_OBSCURA.html

In Latin, means ‘dark room’

Camera obscura

Jetty at Margate England, 1898.

Adapted from R. Duraiswami

http://brightbytes.com/cosite/collection2.html

Around 1870sAn attraction in the late 19th century

Camera obscura at home

Sketch from http://www.funsci.com/fun3_en/sky/sky.htmhttp://blog.makezine.com/archive/2006/02/how_to_room_sized_camera_obscu.html

Perspective effects

• Far away objects appear smaller

Forsyth and Ponce

Perspective effects

Perspective effects• Parallel lines in the scene intersect in the image• Converge in image on horizon line

Image plane(virtual)

pinhole

• Sensor– sampling, etc.

Perspective and art• Use of correct perspective projection indicated in 1st

century B.C. frescoes• Skill resurfaces in Renaissance: artists develop

systematic methods to determine perspective projection (around 1480-1515)

Durer, 1525RaphaelK. Grauman

Perspective projection equations

• 3d world mapped to 2d projection in image plane

Forsyth and Ponce

Camera frame

Image plane

Optical axis

Focal length

Scene / world points

Forsyth and Ponce

𝑧𝑓 ′ 𝑥

𝑥 ′

𝑧𝑥¿𝑓 ′

𝑥 ′𝑧𝑥¿ 𝑓 ′𝑥 ′

Forsyth and Ponce

𝑧𝑓 ′ 𝑥

𝑥 ′

𝑥′= 𝑓 ′𝑥𝑧𝑦 ′= 𝑓 ′

𝑦𝑧

Homogeneous coordinatesIs this a linear transformation?

Trick: add one more coordinate:

homogeneous image coordinates

homogeneous scene coordinates

Converting from homogeneous coordinates

• no—division by z is nonlinear

Perspective Projection Matrix

divide by the third coordinate to convert back to non-homogeneous coordinates

• Projection is a matrix multiplication using homogeneous coordinates:

0'/100

)','(z

Complete mapping from world points to image pixel positions?

Perspective projection & calibration

Camera frame

Intrinsic:Image coordinates relative to camera Pixel coordinates

Extrinsic:Camera frame World frame

World frame

World to camera coord. trans. matrix

Perspectiveprojection matrix

Camera to pixel coord.

trans. matrix (3x3)

point(3x1)

3Dpoint(4x1)

K. Grauman

Camera frame

World frame

trans. matrix (3x3)

point(3x1)

3Dpoint(4x1)

K. Grauman

So far we defined only the perspective projection matrix

World – to -camera

K. Grauman

1 0 0 0 1 1

y R t y

point(4x1)

3Dpoint(4x1)

Worldcamera

Extrinsic parameters: translation and rotation of camera frame

tpRp CW

C Non-homogeneous

coordinates

Homogeneous coordinates

ptRp WC

W. Freeman

Camera frame

World frame

trans. matrix (3x3)

point(3x1)

3Dpoint(4x1)

K. Grauman

Intrinsic parameters: from idealized world coordinates to pixel values

Forsyth&Ponce

Perspective projection

W. Freeman

Intrinsic parameters

But “pixels” are in some arbitrary spatial units

W. Freeman

Maybe pixels are not square

W. Freeman

We don’t know the origin of our camera pixel coordinates

W. Freeman

' cot( )

' sin( )

x yx u

May be skew between camera pixel axes

)cot()cos(

W. Freeman

pp C K

Intrinsic parameters, homogeneous coordinates

' cot( )

' sin( )

x yx u

cot( )' 0

' 0 0sin( )

1 00 0 1 1

yz y v

Using homogenous coordinates,we can write this as:

In camera-based coords

In pixels

W. Freeman

pp C K

Intrinsic parameters, homogeneous coordinates

' cot( )

' sin( )

x yx u

cot( )' 0

' 0 0sin( )

00 0 1 1

Using homogenous coordinates,we can write this as:

In camera-based coords

In pixels

W. Freeman

Extrinsic parameters: translation and rotation of camera frame

tpRp CW

C Non-homogeneous

coordinates

Homogeneous coordinates

ptRp WC

W. Freeman

Combining extrinsic and intrinsic calibration parameters, in homogeneous coordinates

Forsyth&Ponce

0 0 0 1

C CWW WR t

Intrinsic

Extrinsic

ptRp WC

World coordinatesCamera coordinates

pixels

W. Freeman

Other ways to write the same equation

' . . .

1 . . .1

pixel coordinates

world coordinates

Conversion back from homogeneous coordinates leads to:

W. Freeman

Calibration target

http://www.kinetic.bc.ca/CompVision/opti-CAL.html

Find the position, ui and vi, in pixels, of each calibration object feature point.

Camera calibration

( ' ) 0

( ' ) 0i i

m x m P

m y m P

So for each feature point, i, we have:

From before, we had these equations relating image positions,u,v, to points at 3-d positions P (in homogeneous coordinates):

W. Freeman

Stack all these measurements of i=1…n points

into a big matrix:

Camera calibration

( ' ) 0

( ' ) 0i i

m x m P

m y m P

1 1 1 1

00 '00 '

T T Tn n nT T T

P y P m

mP x P

W. Freeman

141 1 1 1 1 1 1 1 1 1

211 1 1 1 1 1 1 1 1 1

1 0 0 0 0 ' ' ' '

0 0 0 0 1 ' ' ' '

1 0 0 0 0 ' ' ' '

0 0 0 0 1 ' ' ' '

x y z x y z

nx ny nz n nx n ny n nz n

mP P P x P x P x P x

mP P P y P y P y P y

Showing all the elements:

In vector form:Camera calibration

W. Freeman

1 1 1 1

00 '00 '

T T Tn n nT T T

P y P m

mP x P

We want to solve for the unit vector m (the stacked one)that minimizes 2

Q m = 0

The minimum eigenvector of the matrix QTQ gives us that(see Forsyth&Ponce, 3.1), because it is the unit vector x that minimizes xT QTQ x.

Camera calibration

W. Freeman

141 1 1 1 1 1 1 1 1 1

211 1 1 1 1 1 1 1 1 1

1 0 0 0 0 ' ' ' '

0 0 0 0 1 ' ' ' '

1 0 0 0 0 ' ' ' '

0 0 0 0 1 ' ' ' '

x y z x y z

nx ny nz n nx n ny n nz n

Once you have the M matrix, can recover the intrinsic and extrinsic parameters as in Forsyth&Ponce, sect. 3.2.2.

Camera calibration

W. Freeman

Camera frame

World frame

trans. matrix (3x3)

point(3x1)

3Dpoint(4x1)

K. Grauman

Recall, perspective effects…

• Far away objects appear smaller

Forsyth and Ponce

Perspective effects

Projection properties• Many-to-one: all points along same ray map to same

point in image• Points ?

– points• Lines ?

– lines (collinearity preserved)• Distances and angles are / are not ? preserved

– are not• Degenerate cases:

– Line through focal point projects to a point.– Plane through focal point projects to line

– Plane perpendicular to image plane projects to part of the image.

Weak perspective• Approximation: treat magnification as constant• Assumes scene depth << average distance to camera

World points

Image plane

1 0 0 0

0 1 0 0

0 0 1/ ' 0 / '1

zf z f

Orthographic projection• Given camera at constant distance from scene• World points projected along rays parallel to optical

access

From 3D to 2D

Other types of projection

• Lots of intriguing variants…• (I’ll just mention a few fun ones)

S. Seitz

360 degree field of view…

• Basic approach– Take a photo of a parabolic mirror with an orthographic lens (Nayar)– Or buy one a lens from a variety of omnicam manufacturers…

• See http://www.cis.upenn.edu/~kostas/omni.html

S. Seitz

Tilt-shift

Titlt-shift images from Olivo Barbieriand Photoshop imitations

http://www.northlight-images.co.uk/article_pages/tilt_and_shift_ts-e.html

S. Seitz

tilt, shift

http://en.wikipedia.org/wiki/Tilt-shift_photography

Tilt-shift perspective correction

http://en.wikipedia.org/wiki/Tilt-shift_photography

normal lens tilt-shift lens

http://www.northlight-images.co.uk/article_pages/tilt_and_shift_ts-e.html

Rollout Photographs © Justin Kerr

http://research.famsi.org/kerrmaya.html

Rotating sensor (or object)

Also known as “cyclographs”, “peripheral images”

S. Seitz

Photofinish

S. Seitz

1. A single vertical slit instead of a shutterThe film is advanced continuously at a similar speed to the racers' images

2. A high speed camera takes a continuous series of partial frame photos at a fast rate

Pinhole size / aperture

Smaller

Larger

How does the size of the aperture affect the image we’d get?

K. Grauman

Adding a lens

• A lens focuses light onto the film– Rays passing through the center are not deviated– All parallel rays converge to one point on a plane

located at the focal length fSlide by Steve Seitz

focal point

Pinhole vs. lens

K. Grauman

Cameras with lenses

focal point

optical center(Center Of Projection)

• A lens focuses parallel rays onto a single focal point• Gather more light, while keeping focus; make

pinhole perspective projection practical

K. Grauman

Human eye

Fig from Shapiro and Stockman

Pupil/Iris – control amount of light passing through lens

Retina - contains sensor cells, where image is formed

Fovea – highest concentration of cones

Rough analogy with human visual system:

Thin lens

Thin lensRays entering parallel on one side go through focus on other, and vice versa.

In ideal case – all rays from P imaged at P’.

Left focus Right focus

Focal length fLens diameter d

K. Grauman

Thin lens equation

• Any object point satisfying this equation is in focus

K. Grauman

Focus and depth of field

Image credit: cambridgeincolour.com

Focus and depth of field• Depth of field: distance between image planes where

blur is tolerable

Thin lens: scene points at distinct depths come in focus at different image planes.

(Real camera lens systems have greater depth of field.)

Shapiro and Stockman

“circles of confusion”

Focus and depth of field

Images from Wikipedia http://en.wikipedia.org/wiki/Depth_of_field

1. Blurred

2. In focus

3. Blurred

Focus and depth of field• How does the aperture affect the depth of field?

• A smaller aperture increases the range in which the object is approximately in focus

Flower images from Wikipedia http://en.wikipedia.org/wiki/Depth_of_field Slide from S. Seitz

Depth from focus

[figs from H. Jin and P. Favaro, 2002]

Images from same point of view, different camera parameters

3d shape / depth estimates

Field of view

• Angular measure of portion of 3d space seen by the camera

Images from http://en.wikipedia.org/wiki/Angle_of_view K. Grauman

• As f gets smaller, image becomes more wide angle – more world points project

onto the finite image plane

• As f gets larger, image becomes more telescopic – smaller part of the world

projects onto the finite image plane

Field of view depends on focal length

from R. Duraiswami

Field of view depends on focal length

Smaller FOV = larger Focal LengthSlide by A. Efros

Vignetting

http://www.ptgui.com/examples/vigntutorial.htmlhttp://www.tlucretius.net/Photo/eHolga.html

Vignetting• “natural”:

• “mechanical”: intrusion on optical path

Chromatic aberration

Environment map

http://www.sparse.org/3d.html

Diffuse / Lambertian

Foreshortening

The object will appear “compressed”

Specular reflectionIdeal reflector: the specular reflection is visible only when line-of-sight == reflected-ray.

• Ambient+ diffuse+specular:

Digital cameras• Film sensor array

• Often an array of charge coupled devices

• Each CCD is light sensitive diode that converts photons (light energy) to electrons

cameraCCD array

optics frame grabber computer

K. Grauman

Historical context• Pinhole model: Mozi (470-390 BCE),

Aristotle (384-322 BCE)• Principles of optics (including lenses):

Alhacen (965-1039 CE) • Camera obscura: Leonardo da Vinci

(1452-1519), Johann Zahn (1631-1707)• First photo: Joseph Nicephore Niepce (1822)• Daguerréotypes (1839)• Photographic film (Eastman, 1889)• Cinema (Lumière Brothers, 1895)• Color Photography (Lumière Brothers, 1908)• Television (Baird, Farnsworth, Zworykin, 1920s)• First consumer camera with CCD:

Sony Mavica (1981)• First fully digital camera: Kodak DCS100 (1990)

Niepce, “La Table Servie,” 1822

CCD chip

Alhacen’s notes

Slide credit: L. Lazebnik K. Grauman

Digital Sensors

Resolution• Sensor: size of real world scene element a that

images to a single pixel• Image: number of pixels• Implications:

– what analysis is feasible– affects best representation choice.

[fig from Mori et al]

Digital imagesThink of images as matrices taken from CCD array.

K. Grauman

im[176][201] has value 164 im[194][203] has value 37

width 520j=1

500 height

i=1Intensity : [0,255]

Digital images

K. Grauman

Color sensing in digital cameras

Source: Steve Seitz

Estimate missing components from neighboring values(demosaicing)

Bayer grid

Color images, RGB color space

K. GraumanMuch more on color in next lecture…

Summary• Image formation affected by geometry, photometry, and

optics.• Projection equations express how world points mapped to

2d image.• Homogenous coordinates allow linear system for

projection equations.• Lenses make pinhole model practical• Photometry models: Lambertian, BRDF• Digital imagers, Bayer demosaicing

Parameters (focal length, aperture, lens diameter, sensor sampling…) strongly affect image obtained.

K. Grauman

Slide Credits

• Trevor Darrell• Bill Freeman• Steve Seitz• Kristen Grauman• Forsyth and Ponce• Rick Szeliski• and others, as marked…

Algorithms and Applications in Computer Vision Lihi Zelnik-Manor lihi@ee.technion.ac.il.

Documents

Transcript of Algorithms and Applications in Computer Vision Lihi Zelnik-Manor lihi@ee.technion.ac.il.

Algorithms & Applications in Computer Vision Lihi Zelnik-Manor Lecture 11: Structure from Motion.

th Zhenhua Meng Nicolai Winther-Nielsen Lihi Yariv-Laor Is ...

Macedonian Zelnik

Event-Based Analysis of Videoirani/zelnik-irani_1.pdfEvent-Based Analysis of Video ⁄ Lihi Zelnik-Manor Michal Irani Dept. of Computer Science and Applied Math The Weizmann Institute

Approximate Nearest Subspace Search with Applications to Pattern Recognition Ronen Basri, Tal Hassner, Lihi Zelnik-Manor presented by Andrew Guillory and.

Transactional Data Structure Librarieswebee.technion.ac.il/people/idish/ftp/Transactional... · 2016. 6. 11. · Idit Keidar Technion and Yahoo Research, Israel idish@ee.technion.ac.il

Hydropower’s Place in a - RTO InsiderLIHI Governance –minimum 50% NGO representation 4 Hydropower's Place in a Clean Energy Future, Dana Hall, LIHI Deputy Director Current LIHI

Stas Goferman Lihi Zelnik-Manor Ayellet Tal Technion.

Breaking the Cycle - Colleagues Are All You Needopenaccess.thecvf.com/content_CVPR_2020/papers/... · Technion, Israel snizori@campus.technion.ac.il Ayellet Tal Technion, Israel ayellet@ee.technion.ac.il

Algorithms & Applications in Computer Vision Lihi Zelnik-Manor Lecture 9: Optical Flow.

046745 תותוא לש יתרפס דוביע Digital Signal Processing...תואצרהל םיפקש ץבוק (ע"שת ףרוח) ךאלמ דוד 'פורפ :הצרמ malah@ee.technion.ac.il

Transactional Data Structure Librarieswebee.technion.ac.il/~idish/ftp/TransactionalLibrariesPLDI16.pdf · Technion and Yahoo Research, Israel idish@ee.technion.ac.il Abstract We introduce

Compressed Sensing meets Information Theory Dror Baron drorb@ee.technion.ac.il Duarte Wakin Sarvotham Baraniuk Guo Shamai.

Lydgate Park to Lihi ParkKe Ala Hele Makalae Phase III Pathways Project Ke Ala Hele Makalae Phase III Lydgate Park to Lihi Park Presented by the Lydgate.

Freeman Algorithms and Applications in Computer Vision Lihi Zelnik-Manor lihi@ee.technion.ac.il Lecture 5: Pyramids.

LIHI Review Report Penacook Upper Falls...LIHI criteria for certification if certain special conditions are met . I recommend certification with the following non‐Standard Conditions:

Multi-perspective Panoramas Pictures capture memoriesalumni.media.mit.edu/~maov/classes/comp_photo_vision08f/lect/20… · Multi-perspective Panoramas Slides from a talk by Lihi Zelnik-Manor

Yonina Eldar Technion – Israel Institute of Technology yonina@ee.technion.ac.il Beyond Nyquist: Compressed.

55 - Instituto ButantanDr. Willy Becak Comissão Editorial Henrique Moisés Canter — Presidente Adolpho Brunner Júnior — Membros Olga Bohomoletz Henriques Raymond Zelnik Sylvia

Emi Johnson, Consultant Lihi Rosenthal, Seneca Division Director Presented with Endless Gratitude to Amy Andersen and Tamara Clay, EDCOE.