Stereo and 3D Vision

Lecture 08

How do we get 3D from Stereo Images?

left image right image

3D point

disparity: the difference in image location of the same 3Dpoint when projected under perspective to two different cameras

d = xleft - xright

Perception of depth arises from “disparity” of a given 3D point in your right and left retinal images

Recall ( : Perspective Projection

This is the axis of the real image plane.

O is the center of projection.

This is the axis of the frontimage plane, which we use.

xxi xf z

camera lens

3D objectpoint

image of pointB in front image

real imagepoint

Z(from similar triangles)

(Note: For convenience, we orient Z axis as above and use f instead of –f as in lecture 2)

from Lecture 2)

Projection for Stereo Images

Simple Model: Optic axes of 2 cameras are parallel

camera

baseline

camera

P=(x,z)

zimage planes

z xf xl

z x-bf xr= z y y

f yl yr= =

Y-axis isperpendicularto the page.

(from similar triangles)

3D from Stereo Images: Triangulation

For stereo cameras with parallel optical axes, focal length f,baseline b, corresponding image points (xl,yl) and (xr,yr), the location of the 3D point can be derived from previous slide’s equations:

Depth z = f*b / (xl - xr) = f*b/d

x = xl*z/f or b + xr*z/f

y = yl*z/f or yr*z/f

This method ofdetermining depthfrom disparity d is called triangulation.

Note that depth is inversely proportional to disparity

Two main problems:1. Need to know focal length f, baseline b

- use prior knowledge or camera calibration2. Need to find corresponding point (xr,yr) for

each (xl,yl) ⇒ Correspondence problem

Depth z = f*b / (xl - xr) = f*b/d

x = xl*z/f or b + xr*z/f

y = yl*z/f or yr*z/f

Solving the stereo correspondence problem

1. Cross correlation or SSD using small windows.

2. Symbolic feature matching, usually using segments/corners.

3. Use the newer interest operators, e.g., SIFT.

sparse

Given a point in the left image, do you need to search the entire right image for the

corresponding point?

Epipolar Constraint for Correspondence

C1 C2b

epipolarplane

PEpipolar plane = plane connecting C1, C2, and point P

Epipolar plane cuts through image planes forming an epipolarline in each planeMatch for P1 (or P2) in the other image must lie on epipolar line

Epipolar Constraint for Correspondence

C1 C2b

Epipolar line

Match for P1 in the other image must lie on epipolar lineSo need search only along this line

What if the optical axes of the 2 cameras are not parallel to each other?

Epipolar constraint still holds…P

e1 and e2 are the epipolarlines for point P

But the epipolar lines may no longer be horizontal

Java demo: http://www.ai.sri.com/~luong/research/Meta3DViewer/EpipolarGeo.html

Example

Yellow epipolarlines for the three points shown on the left image

Given a point P1 in left image on epipolar line e1, can find epipolar line e2 provided we know relative orientations of cameras ⇒

(from a slide by Pascal Fua)

Requires camera calibration (see lecture 2)

Alternate approach: Stereo image rectification

• Reproject image planes onto a commonplane parallel to the line between optical centers

• Epipolar line is horizontal after this transformation• Two homographies (3x3 transforms), one for each

input image reprojection, is computed. See:C. Loop and Z. Zhang. Computing Rectifying Homographies for Stereo Vision. IEEE Conf. Computer Vision and Pattern Recognition, 1999.

Example

After rectification, need only search for matches along horizontal scan line

(adapted from slide by Pascal Fua)

Your basic stereo algorithm

For each epipolar lineFor each pixel in the left image

• compare with every pixel on same epipolar line in right image

• pick pixel with minimum match cost

Improvement: match windows

A good survey and evaluation: http://vision.middlebury.edu/stereo/

Matching using Sum of Squared Differences (SSD)

Stereo matching based on SSD

dBest matching disparity

Problems with window size

• Smaller window+ Good precision, more detail– Sensitive to noise

• Larger window+ Robust to noise– Reduced precision, less detail

W = 20

Effect of window size W

Input stereo pair

Example depth from stereo results

Ground truthScene

• Data from University of Tsukuba

Results with window-based stereo matching

Window-based matching(best window size)

Ground truth

Better methods exist...

State of the art method:Boykov et al., Fast Approximate Energy Minimization via Graph Cuts,

International Conference on Computer Vision, 1999

Ground truth

For the latest and greatest: http://www.middlebury.edu/stereo/

• Camera calibration errors• Poor image resolution• Occlusions• Violations of brightness constancy (specular reflections)• Large motions• Low-contrast image regions

What will cause errors?

Stereo reconstruction pipelineSteps

• Calibrate cameras• Rectify images• Compute disparity• Estimate depth

What if 3D object has little or no texture?Matching points might be difficult or impossible

Can we still recover depth information?

Idea: Use structured light!

Disparity between laser points on the same scanline in the images determines the 3-D coordinates of the laser point on object

http://www.cs.wright.edu/~agoshtas/stereorange.html

Recovered 3D Model

http://www.cs.wright.edu/~agoshtas/stereorange.html

Actually, we can make do with just 1 camera

Angle θ

Virtual “image” plane intersecting laser beam at same distance as camera’s image place

From calibration of both camera and light projector, we cancompute 3D coordinates laser points on the surface

The Digital Michelangelo Project

Optical triangulation• Project a single stripe of laser light• Scan it across the surface of the object

Direction of travel

Object

CCD image plane

LaserCylindrical lens

Laser sheet

http://graphics.stanford.edu/projects/mich/

Laser scanned models

The Digital Michelangelo Project, Levoy et al.

Laser scanned models

The Digital Michelangelo Project, Levoy et al.

Stereo and 3D Vision

Documents

Transcript of Stereo and 3D Vision

Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013

A Performance Review of 3D TOF Vision Systems in ...cdn.intechweb.org/pdfs/5767.pdf · Vision Systems in Comparison to Stereo Vision Systems ... A Performance Review of 3D TOF Vision

IDA-3D: Instance-Depth-Aware 3D Object Detection From Stereo Vision … · 2020. 6. 29. · IDA-3D: Instance-Depth-Aware 3D Object Detection from Stereo Vision for Autonomous Driving

3D Stereo Anaglyphs - Venice Carnival - Delpiano · 3D Stereo Anaglyphs - Venice Carnival Author: Roberto Delpiano Subject: 3D Stereo Anaglyphs - Venice Carnival Keywords: 3D Stereo

Stereo-vision based 3D modeling for unmanned ground ...

3D Head Mesh Data - courses.cs.washington.edu3D Head Mesh Data • Stereo Vision • Active Stereo • 3D Reconstruction • 3dMD System 1 . Data Types • Volumetric Data – Voxel

3D GRAPHICAL MODELING OF VEGETABLE SEEDLINGS BASED ON A STEREO MACHINE VISION SYSTEM

Stereo 3D Technologies

FP-Stereo: Hardware-Efﬁcient Stereo Vision for Embedded ...

CS 6320, 3D Computer Vision Spring 2013, Prof. Guido ...gerig/CS6320-S2013/Materials/...CS 6320, 3D Computer Vision Spring 2013, Prof. Guido Gerig Assignment 2: Stereo and 3D Reconstruction

Segmentation- Based Stereo Michael Bleyer LVA Stereo Vision.

Stereo Particle Imaging Velocimetry Techniques Technical ...huhui/paper/journal/2012-3D-Vision-book.pdf · Stereo Particle Imaging Velocimetry Techniques: Technical Basis, System

3d Stereo Handbook

Viva3D Real-time Stereo Vision - ViewPoint 3D Stereo Vision user manual en 2… · Stereo vision and depth Viva3D's stereo image processing module calculates depth from rectified

Stereo vision

Comparison of Open Source Stereo Vision Algorithms - ID62nefeli.lib.teicrete.gr/browse/stef/epp/2015/... · Computer stereo vision is the extraction of 3D information from a pair

Computer Vision: Stereo and 3D Reconstruction · Computer Vision: Stereo and 3D Reconstruction Daniel G. Aliaga Department of Computer Science ... to 1D search along conjugate epipolar

Urban 3D Semantic Modelling Using Stereo Visionphst/Papers/2013/ICRA2013_1807.pdf · Urban 3D Semantic Modelling Using Stereo Vision ... on the KITTI odometry dataset and have manually

Stereo Vision-based Semantic 3D Object and Ego-motion ...openaccess.thecvf.com/.../Peiliang_LI_Stereo_Vision-based_Semanti… · Stereo Vision-based Semantic 3D Object and Ego-motion

Miroslav Hlaváč Martin Kozák 27. 07. 2011 Fish position determination in 3D space by stereo vision.