High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for...

80
High Level Computer Vision Deep Learning for Computer Vision Part 2 Bernt Schiele - [email protected] Mario Fritz - [email protected] https://www.mpi-inf.mpg.de/hlcv

Transcript of High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for...

Page 1: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

High Level Computer Vision

Deep Learning for Computer Vision Part 2

Bernt Schiele - [email protected] Mario Fritz - [email protected]

https://www.mpi-inf.mpg.de/hlcv

Page 2: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

High Level Computer Vision - June 16, 2o16

Overview Today

• Feature Generalization ‣ “pre-training” on large dataset,

“fine-tuning” on target dataset

• Object Detection ‣ from image classification to object detection

• R-CNN - Regions with CNN features ‣ Region-based Convolutional Networks for Accurate Object Detection and Semantic

Segmentation, R. Girshick, J. Donahue, T. Darrell, J. Malik (CVPR’14, accepted in May’15 for PAMI)

‣ Region Proposal Method: Selective Search for Object Recognition, J.R.R. Uijlings, K.E.A. van de Sande, T. Gevers, A. W. M. SmeuldersIn IJCV’13.

• VGG-network - alternative to AlexNet ‣ Very Deep Convolutional Networks for Large-Scale Image Recognition,

K. Simonyan, A. Zisserman, ICLR’15

2

Page 3: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

Feature Generalization

slides from: Rob Fergus, NIPS’13 tutorial

Page 4: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

Training Features on Other Datasets

• Train model on ImageNet 2012 training set

• Re-train classifier on new dataset– Just the softmax layer

• Classify test set of new dataset

Page 5: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

6 training examples

Caltech 256Zeiler & Fergus, Visualizing and Understanding Convolutional Networks, arXiv 1311.2901, 2013

Page 6: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

Caltech 256

[3] L. Bo, X. Ren, and D. Fox. Multipath sparse coding using hierarchical matching pursuit. In CVPR, 2013.[16] K. Sohn, D. Jung, H. Lee, and A. Hero III. Efficient learning of sparse, distributed, convolutional feature representations for object recognition. In ICCV, 2011.

Zeiler & Fergus, Visualizing and Understanding Convolutional Networks, arXiv 1311.2901, 2013

Page 7: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

Object Detection

slides from: Rob Fergus, NIPS’13 tutorial

Page 8: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

Detection with ConvNets

• So far, all aboutclassification

• What aboutlocalizing objectswithin the scene?

Page 9: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

Sliding Window with ConvNetConv Conv Conv Conv Conv Full Full

Page 10: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

Sliding Window with ConvNet

Input Window

224

224

66

256C

classes

Conv Conv Conv Conv Conv Full Full

Feature Extractor Classifier

Page 11: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

Sliding Window with ConvNet

Input Window

224 6256

Conv Conv Conv Conv Conv Full Full

Feature Extractor167

240

1

No need to compute two separate windowsJust one big input window, computed in a single pass

C classes

Page 12: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

ConvNets for Detection

Feature Extractor

256

256

256

256

FeatureMaps

C=1000

Class Maps

C=1000

C=1000

Classifier

C=1000

Page 13: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

ConvNets for Detection

Feature Extractor

256

256

256

256

FeatureMaps

Class Maps

Classifier

Boat

Boat

Boat

Boat

Page 14: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

ConvNets for Detection

Feature Extractor

256

256

256

256

FeatureMaps

Class Maps

Classifier

Boat

Boat

Boat

Boat

Page 15: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

ConvNets for Detection

Feature Extractor

256

256

256

256

FeatureMaps

256

RegressionNetwork

PredictedBounding

Box

GroundTruth

Output: [X,Y,W,H]

Page 16: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

Bounding Box prediction example[Sermanet et al. CVPR’14]

Page 17: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

Detection Results[Sermanet et al. CVPR’14]

Page 18: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

Detection Results[Sermanet et al. CVPR’14]

Page 19: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

slides from: Ross Girschick - CVPR’14 talk

Page 20: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

20

Page 21: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

21

Page 22: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

22

Page 23: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

23

Page 24: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

24

Page 25: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

25

Page 26: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

26

Page 27: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

27

Page 28: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

28

Page 29: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

29

Page 30: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

30

Page 31: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

31

Page 32: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

32

Page 33: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

33

Page 34: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

34

Page 35: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

35

Page 36: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

High Level Computer Vision - June 16, 2o16

Region Proposal Step

36

Page 37: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

37

Page 38: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

38

Page 39: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

39

Page 40: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

40

Page 41: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

High Level Computer Vision - June 16, 2o16

Method

• compute similarity measure between all adjacent region pairs a and b (e.g.) as:

‣ withencourages small regions to merge early

‣ and are color histograms, encouraging “similar” regions to merge

‣ for slightly more elaborated similarities see their IJCV-paper

41

Ssize(a, b) = 1� size(a) + size(b)

size(image)

ak, bk

Scolor

(a, b) =nX

k=1

min(ak, bk)

S(a, b) = ↵Szize

(a, b) + �Scolor

(a, b)

Page 42: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

42

Page 43: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

43

Page 44: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

44

Page 45: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

45

Page 46: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

46

Page 47: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

High Level Computer Vision - June 16, 2o16

An Evaluation of Region Proposal Methods

• Hosang, Benenson, Dollar, Schiele @ Pami’15 • Recall (of ground truth bounding boxes) as a function of

‣ proposal method

‣ IoU (intersection over union)

‣ number of proposals per image

47

Page 48: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

48

Page 49: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

49

Page 50: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

50

Page 51: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

51

Page 52: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

52

Page 53: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

53

Page 54: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

54

Page 55: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

55

Page 56: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

56

Page 57: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

57

Page 58: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

58

Page 59: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

59

Page 60: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

60

Page 61: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

61

Page 62: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

62

Page 63: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

63

Page 64: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

64

Page 65: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

High Level Computer Vision - June 16, 2o16

Overview Today

• Feature Generalization ‣ “pre-training” on large dataset,

“fine-tuning” on target dataset

• Object Detection ‣ from image classification to object detection

• R-CNN - Regions with CNN features ‣ Region-based Convolutional Networks for Accurate Object Detection and Semantic

Segmentation, R. Girshick, J. Donahue, T. Darrell, J. Malik, PAMI’15 (accepted May 15)

‣ Region Proposal Method: Selective Search for Object Recognition, J.R.R. Uijlings, K.E.A. van de Sande, T. Gevers, A. W. M. SmeuldersIn IJCV’13.

• VGG-network - alternative to AlexNet ‣ Very Deep Convolutional Networks for Large-Scale Image Recognition,

K. Simonyan, A. Zisserman, ICLR’15

65

Page 66: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

66

Page 67: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

67

Page 68: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

68

Page 69: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

69

Page 70: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

70

Page 71: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

71

Page 72: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

72

Page 73: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

73

Page 74: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

74

Page 75: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

75

Page 76: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

76

Page 77: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

77

Page 78: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

78

Page 79: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

79

Page 80: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection ...

80