A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al...

23
A Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal – UMR6602, CNRS, Université Clermont Auvergne, SIGMA CHU de Clermont-Ferrand, Departments of Gynecologic Surgery, HPB Surgery, Hepatogastroenterology and Radiology

Transcript of A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al...

Page 1: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

A Brief Introduction to Computer Vision

Adrien Bartoli et al

Endoscopy and Computer Vision group (EnCoV)

Institut Pascal – UMR6602, CNRS, Université Clermont Auvergne, SIGMACHU de Clermont-Ferrand, Departments of Gynecologic Surgery, HPB Surgery, Hepatogastroenterology and Radiology

Page 2: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

What is it?

Page 3: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Computer Vision

Image classification

LANDSCAPE

LANDSCAPE

? ?

Page 4: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Computer Vision

Image classification

LANDSCAPE

PERSON

Object detection

LANDSCAPEEmma

Rocky

LANDSCAPE

Object recognition

Page 5: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Computer Vision

Image classification

LANDSCAPE

PERSON

Object detection

LANDSCAPEEmma

Rocky

LANDSCAPE

Object recognition

Scene description

Page 6: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

How Far is the Tower?

Page 7: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Computer Vision

Image classification

LANDSCAPE

PERSON

Object detection

LANDSCAPEEmma

Rocky

LANDSCAPE

Object recognition

Size measurementDepth measurement Dynamics measurement

39.9 ± 0.1 m

92.2 ± 0.3 m

5.2 ± 0.6 km/h

Scene description

Page 8: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Depth measurement

Computer Vision

Image classification

LANDSCAPE

PERSON

Object detection

LANDSCAPEEmma

Rocky

LANDSCAPE

Object recognition

Size measurement Dynamics measurement

39.9 ± 0.1 m

92.2 ± 0.3 m

5.2 ± 0.6 km

Scene description

Scene measurement

Page 9: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Computer Vision

PERSON

CAR

5.3 ± 0.1 m

5.2 ± 0.6 km/h

dynamics

distance

Page 10: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Analogies between Computer Vision and Biological Vision

camera

memoryprocessor

PACS

Page 11: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Computer Vision

… is more concerned with the processing than with the physical camera

… does not study biological vision

… attempts to replicate some capabilities of human vision, but not only

Page 12: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Related Fields, Application Examples

Computer VisionPhysics

Artificial IntelligenceMathematics Computer Science Medical Imaging

Computational Geometry

Computer Graphics

Self-driving carsImage retrieval Augmented Reality

Scene description Scene measurement

?• adenoma• serrated adenoma• hyperplastic

Scene description

Scene measurement

9,5 mm2

Page 13: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Image Retrieval

?

Page 14: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Image Retrieval from Descriptors

Hand-crafted Learnt

Page 15: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Computer-Aided Colonoscopy: Polyp Type Recognition

Classifications: Paris, Kudo, Sano, Hiroshima, etc

?• adenoma• serrated adenoma• hyperplastic

2 - analysis

virtualbiopsy

1 - learning

Juniors (between 0 and 4 years practice): 72.2%Seniors (more than 8 years practice): 76.5%Machine: 82.4%

+ biopsy + biopsy + biopsy + biopsy + biopsy + biopsy

+ biopsy + biopsy + biopsy + biopsy + biopsy + biopsy

+ biopsy + biopsy + biopsy + biopsy + biopsy + biopsy

76 polyps

Page 16: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Scene Measurement

Agarwal et al, University of Washington

Page 17: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Visual Cues

Escaping Criticism by del Caso, 1874

Stereo Blur Shadow Silhouette Occlusions Shading

Johnson et al, MIT Cate et al, Virginia Tech

Page 18: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Computer-Aided Colonoscopy: Polyp Measurement

Escaping Criticism by del Caso, 1874

Stereo Blur Shadow Silhouette Occlusions Shading

Distance (mm)

Qu

anti

tyo

f b

lur2 - estimate scale

from focus blur

3,3 mm

9,5 mm2 3,8 mm 3,2 mm

Precision: about 0.3 mm(evaluated on a porcine model)

1 - estimate 3D shape from multiple images

Page 19: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Computer-Aided Laparoscopy: Augmented Reality

Escaping Criticism by del Caso, 1874

Stereo Blur Shadow Silhouette Occlusions Shading

Page 20: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Current and Future Trends

Computer Vision has been a combination of modeling, hand-crafting/tuning and machine learning…

…with Deep Neural Networks, Computer Vision is now emphasizingmachine learning and displacing the modeling and hand-crafting

Page 21: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

Current and Future Trends

Computer Vision has been a combination of modeling, hand-crafting/tuning and machine learning…

…with Deep Neural Networks, Computer Vision is now emphasizingmachine learning and displacing the modeling and hand-crafting

Page 22: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

A Brief Introduction to Computer Vision

Adrien Bartoli et al

Endoscopy and Computer Vision group (EnCoV)

Institut Pascal – UMR6602, CNRS, Université Clermont Auvergne, SIGMACHU de Clermont-Ferrand, Departments of Gynecologic Surgery, HPB Surgery, Hepatogastroenterology and Radiology

Page 23: A Brief Introduction to Computer VisionA Brief Introduction to Computer Vision Adrien Bartoli et al Endoscopy and Computer Vision group (EnCoV) Institut Pascal –UMR6602, CNRS, Université

We live in a world where digital visual data are ubiquitous. These data are very diverse in imaging modality and contents,ranging from one’s holiday pictures to medical radiological images. As humans, we naturally use visual data to inferinformation. For instance, we are extremely good at recognizing people and places from conventional images orunderstanding a preoperative CT scan. Over the last few decades, a fundamental scientific question has emerged: can wetransfer the sense of vision to a computer? In other words, can we program a computer to see by understanding visualdata? This question lies at the heart of computer vision.

Computer vision is a scientific discipline which studies the automated interpretation of digital visual data. It is primarily abranch of computer science but is also strongly interdisciplinary, as it uses physics, geometry, optimization and artificialintelligence, to name but a few. Computer vision achieves results by modeling the visual cues and understanding theirrelationship to the target task or by learning from data.

Broadly speaking, the typical tasks in computer vision fall in the categories of scene description and 3D perception. Theformer concerns object detection and recognition: who was in this picture? where was it taken from? which organs areshown in this CT? and so on. The latter concerns 3D localization and measurements: what was the 3D shape of thatobject? how much did the camera move in this video? how big was this lesion as seen in this endoscopy image? and soon. As humans, we are typically doing very well in scene description but much worse in 3D perception. For instance, canyou tell quantitatively how big a tumor is just by looking at your laparoscopy screen? Under some circumstances, acomputer can, and will do it accurately. Interestingly, there is a number of tasks at which the computer may nowadaysoutperform the humans.

In this presentation, I will review the original and recent approaches to computer vision and focus on describing themodel based approach to some relevant tasks in 3D perception. I will show how one can make accurate quantitative 3Dmeasurements from images and illustrate this by examples in laparoscopy and colonoscopy.

Abstract