Components of a computer vision system Lighting Scene Camera Computer Scene Interpretation Srinivasa...

download Components of a computer vision system Lighting Scene Camera Computer Scene Interpretation Srinivasa Narasimhan’s slide.

If you can't read please download the document

Transcript of Components of a computer vision system Lighting Scene Camera Computer Scene Interpretation Srinivasa...

  • Slide 1
  • Components of a computer vision system Lighting Scene Camera Computer Scene Interpretation Srinivasa Narasimhans slide
  • Slide 2
  • Computer vision vs Human Vision What we seeWhat a computer sees Srinivasa Narasimhans slide
  • Slide 3
  • A little story about Computer Vision In 1966, Marvin Minsky at MIT asked his undergraduate student Gerald Jay Sussman to spend the summer linking a camera to a computer and getting the computer to describe what it saw. We now know that the problem is slightly more difficult than that. (Szeliski 2009, Computer Vision)
  • Slide 4
  • A little story about Computer Vision In 1966, Marvin Minsky at MIT asked his undergraduate student Gerald Jay Sussman to spend the summer linking a camera to a computer and getting the computer to describe what it saw. We now know that the problem is slightly more difficult than that. (Szeliski 2009, Computer Vision) Founder, MIT AI project
  • Slide 5
  • A little story about Computer Vision In 1966, Marvin Minsky at MIT asked his undergraduate student Gerald Jay Sussman to spend the summer linking a camera to a computer and getting the computer to describe what it saw. We now know that the problem is slightly more difficult than that. (Szeliski 2009, Computer Vision) Founder, MIT AI project Professor of Electrical Engineering, MIT
  • Slide 6
  • A little story about Computer Vision In 1966, Marvin Minsky at MIT asked his undergraduate student Gerald Jay Sussman to spend the summer linking a camera to a computer and getting the computer to describe what it saw. We now know that the problem is slightly more difficult than that. (Szeliski 2009, Computer Vision) Image Understanding
  • Slide 7
  • A little story about Computer Vision In 1966, Marvin Minsky at MIT asked his undergraduate student Gerald Jay Sussman to spend the summer linking a camera to a computer and getting the computer to describe what it saw. We now know that the problem is slightly more difficult than that. (Szeliski 2009, Computer Vision) Image Understanding Image Sensing
  • Slide 8
  • Continue on CAPTCHA CAPTCHA stands for "Completely Automated Public Turing test to Tell Computers and Humans Apart". Picture of a CAPTCHA in use at Yahoo. http://www.cs.sfu.ca/~mori/research/gimpy/
  • Slide 9
  • Breaking a Visual CAPTCHA http://www.cs.sfu.ca/~mori/research/gimpy/ On EZ-Gimpy: a success rate of 176/191=92%! Other examples http://www.cs.sfu.ca/~mori/research/gimpy/ez /
  • Slide 10
  • Breaking a Visual CAPTCHA http://www.cs.sfu.ca/~mori/research/gimpy/ On more difficult Gimpy: a success rate of 33%! Other examples http://www.cs.sfu.ca/~mori/research/gimpy/ha rd/
  • Slide 11
  • Breaking a Visual CAPTCHA YAHOOs current CAPTCHA format http://en.wikipedia.org/wiki/CAPTCHA
  • Slide 12
  • Face Detection and Recognition Applications: Security, Law Enforcement, Surveillance
  • Slide 13
  • Face Detection and Recognition Smart cameras: auto focus, red eye removal, auto color correction
  • Slide 14
  • Face Detection and Tracking
  • Slide 15
  • Slide 16
  • Lexus LS600 Driver Monitor System
  • Slide 17
  • General Motion Tracking Hidden Dragon Crouching Tiger
  • Slide 18
  • General Motion Tracking Application Andy Serkis, Gollum, Lord of the Rings
  • Slide 19
  • Segmentation http://www.eecs.berkeley.edu/Research/Projects/CS/vision/bsds/
  • Slide 20
  • Segmentation using Graph Cuts Application Medical Image Processing
  • Slide 21
  • Segmentation using Graph Cuts Input Matting: Soft Segmentation Composition
  • Slide 22
  • Segmentation using Graph Cuts State-of-the-art Tool (videosnapcut.mp4) http://juew.org/projects/SnapCut/snapcut.htm
  • Slide 23
  • From 2D to 3D http://www.eecs.harvard.edu/~zickler/helmholtz.html
  • Slide 24
  • Projective Geometry
  • Slide 25
  • Single View Metrology http://research.microsoft.com/vision/cambridg e/3d/default.htm
  • Slide 26
  • Single View Metrology http://research.microsoft.com/vision/cambridg e/3d/default.htm
  • Slide 27
  • Stereo scene point optical center image plane
  • Slide 28
  • Stereo Basic Principle: Triangulation Gives reconstruction as intersection of two rays Requires Camera positions point correspondence
  • Slide 29
  • Using 3D structure to organize photos http://phototour.cs.washington.edu/
  • Slide 30
  • Using 3D structure to organize photos http://photosynth.net/
  • Slide 31
  • Reconstructing detailed 3D models example input imagerendered model
  • Slide 32
  • Reconstructing detailed 3D models example input imagerendered model
  • Slide 33
  • Reconstructing detailed 3D models example input imagerendered model http://grail.cs.washington.edu/ projects/mvscpc/
  • Slide 34
  • Reconstructing detailed 3D models example input imagerendered model
  • Slide 35
  • Reconstructing detailed 3D models example input imagerendered model
  • Slide 36
  • Application: View morphing
  • Slide 37
  • Slide 38
  • From Static Statues to Dynamic Targets http://research.microsoft.com/~larryz/videoviewinterpolation.htm || MSR Image based Reality Project
  • Slide 39
  • Video Projectors Color Cameras Black & White Cameras Spacetime Face Capture System
  • Slide 40
  • System in Action
  • Slide 41
  • Input Videos (640 480, 60fps)
  • Slide 42
  • Spacetime Stereo Reconstruction
  • Slide 43
  • Applications Entertainment: Games & Movies Medical Practice: Prosthetics
  • Slide 44
  • Computational Photography High Dynamic Range Conventional ImageHigh Dynamic Range Image Nayar et al 2002
  • Slide 45
  • Computational Photography High Dynamic Range High Dynamic Range Image Nayar et al 2002 Sensor Optics Modulator Assorted-pixel camera
  • Slide 46
  • Computational Photography High Dynamic Range Digital Gain Adjustment Handheld camera
  • Slide 47
  • Computational Photography High Dynamic Range High Dynamic Range Image Zhang et al 2010 Handheld camera
  • Slide 48
  • Summary Recognize things Reconstruct 3D structures Enhance Photography
  • Slide 49
  • If you are interested in, Courses: CS766 Computer Vision CS638 Special Topics Computational Photography CS638 Special Topics Computational Methods in Medical Image Analysis Faculty: Chuck Dyer, Vikas Singh, Li Zhang Major Conferences: Computer Vision and Pattern Recognition (CVPR) International Conference on Computer Vision (ICCV) European Conference on Computer Vision (ECCV) ACM SIGGRAPH Conference (SIGGRAPH)