A Fuller Understanding of Fully Convolutional Networks · 2020-02-13 · BoxSup: Exploiting...

UC Berkeley in CVPR'15, PAMI'16

A Fuller Understanding of Fully Convolutional Networks

Evan Shelhamer* Jonathan Long* Trevor Darrell1

pixels in, pixels out

semanticsegmentation

monocular depth + normals Eigen & Fergus 2015

boundary prediction Xie & Tu 2015optical flow Fischer et al. 2015

colorizationZhang et al.2016

“tabby cat”

1000-dim vector

< 1 millisecond

convnets perform classification

end-to-end learning

~1/10 second

end-to-end learning

lots of pixels, little time?

“tabby cat”

a classification network

becoming fully convolutional

upsampling output

end-to-end, pixels-to-pixels network

conv, pool,nonlinearity

upsampling

pixelwiseoutput + loss

end-to-end, pixels-to-pixels network

spectrum of deep features

combine where (local, shallow) with what (global, deep)

fuse features into deep jet

(cf. Hariharan et al. CVPR15 “hypercolumn”) 11

skip layers

skip to fuse layers!

interp + sum

dense output 12

end-to-end, joint learningof semantics and location

stride 32

no skips

stride 16

1 skip

stride 8

2 skips

ground truthinput image

skip layer refinement

skip FCN computation Stage 1 (60.0ms)

Stage 2 (18.7ms)

Stage 3 (23.0ms)

A multi-stream network that fuses features/predictions across layers

FCN SDS* Truth Input

Relative to prior state-of-the-art SDS:

- 30% relative improvementfor mean IoU

- 286× faster

*Simultaneous Detection and Segmentation Hariharan et al. ECCV14

leaderboard

== segmentation with Caffe

FCNFCNFCNFCNFCNFCNFCNFCNFCNFCNFCN

FCNFCNFCN

care and feeding offully convolutional networks

- train full image at a time without sampling

- reshape network to take input of any size

- forward time is ~100ms for 500 x 500 x 21 output(on M. Titan X)

image-to-image optimization

momentum and batch size

sampling images?no need! no improvement from sampling across images

sampling pixels?no need! no improvement from (partially) decorrelating pixels

23uniform poisson

context?

- do FCNs incorporate contextual cues?

- loses 3-4 % points whenthe background is masked

- can learn from BG/shapealone if forced to!

- Standard 85 IU- BG alone 38 IU- Shape 29 IU

past and future history offully convolutional networks

history

Convolutional Locator NetworkWolf & Platt 1994

Shape Displacement NetworkMatan & LeCun 1992

Scale Pyramid, Burt & Adelson ‘83

pyramids

The scale pyramid is a classic multi-resolution representation

Fusing multi-resolution network layers is a learned, nonlinear counterpart

Jet, Koenderink & Van Doorn ‘87

The local jet collects the partial derivatives at a point for a rich local description

The deep jet collects layer compositions for a rich,learned description

extensions

- detection + instances- structured output- weak supervision

detection: fully conv. proposals

Fast R-CNN, Girshick ICCV'15

Faster R-CNN, Ren et al. NIPS'15

end-to-end detection by proposal FCN RoI classification

fully conv. nets + structured output

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs.Chen* & Papandreou* et al. ICLR 2015. 31

fully conv. nets + structured output

Conditional Random Fields as Recurrent Neural Networks. Zheng* & Jayasumana* et al. ICCV 2015. 32

dilation for structured output

Multi-Scale Context Aggregation by Dilated Convolutions. Yu & Koltun. ICLR 2016 33

- enlarge effective receptive field for same no. params

- raise resolution

- convolutional context model:similar accuracy toCRF but non-probabilistic

[ comparison credit: CRF as RNN, Zheng* & Jayasumana* et al. ICCV 2015 ]

34DeepLab: Chen* & Papandreou* et al. ICLR 2015. CRF-RNN: Zheng* & Jayasumana* et al. ICCV 2015

fully conv. nets + weak supervision

Constrained Convolutional Neural Networks for Weakly Supervised Segmentation.Pathak et al. arXiv 2015.

FCNs expose a spatial loss map to guide learning:segment from tags by MIL or pixelwise constraints

fully conv. nets + weak supervision

BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation.Dai et al. 2015.

FCNs expose a spatial loss map to guide learning:mine boxes + feedback to refine masks

fully conv. nets + weak supervisionFCNs can learn from sparse annotations == sampling the loss

What's the Point? Semantic Segmentation with Point Supervision. Bearman et al. ECCV 2016. 37

fcn.berkeleyvision.org

conclusion

fully convolutional networks are fast, end-to-end models for pixelwise problems

- code in Caffe- models for PASCAL VOC, NYUDv2,

SIFT Flow, PASCAL-Context

caffe.berkeleyvision.org

github.com/BVLC/caffe

model exampleinference examplesolving example

A Fuller Understanding of Fully Convolutional Networks · 2020-02-13 · BoxSup: Exploiting...

Documents

Transcript of A Fuller Understanding of Fully Convolutional Networks · 2020-02-13 · BoxSup: Exploiting...

Shepard Convolutional Neural Networks - Neural Information …papers.nips.cc/paper/5774-shepard-convolutional-neural... · 2016-02-16 · Shepard Convolutional Neural Networks (ShCNN)

Bidirectional Recurrent Convolutional Networks for …papers.nips.cc/paper/5778-bidirectional-recurrent-convolutional... · Bidirectional Recurrent Convolutional Networks for Multi-Frame

Convolutional Neural Networks Arise From Ising Models …sunilpai/convolutional-neural-networks.pdf · Convolutional Neural Networks Arise From Ising Models and Restricted Boltzmann

Convolutional Neural Networks Analyzed via Convolutional ...yromano/posters/ml_csc_poster.pdf · Convolutional Neural Networks ... Recently, convolutional sparse coding (CSC) ...

Multiscale Hierarchical Convolutional Networks · Multiscale Hierarchical Convolutional Networks and code is available online using TensorFlow and Keras1. 2. Deep Convolutional Networks

Deep Convolutional Neural Networks - Overview

Superpixel Convolutional Networks using Bilateral Inceptionsam.is.tuebingen.mpg.de/uploads_file/attachment/... · Superpixel Convolutional Networks using Bilateral Inceptions Raghudeep

Convolutional Neural Networks (CNNs / ConvNets)web.stanford.edu/class/cs379c/archive/2018/class...Convolutional Neural Networks (CNNs / ConvNets) Convolutional Neu ral Networks are

Convolutional Neural Networks - Virginia Techjbhuang/teaching/ece5554-4554/fa17/... · Convolutional Neural Networks Computer Vision ... ImageNet Classification with Deep Convolutional

Lecture 17 Convolutional Neural Networks

Convolutional Neural Networks for Object Recognition...Overview •What is Computer Vision ? •Convolutional Neural Networks •Convolutional Networks for Visual Object Recognition

Convolutional Neural Networks (Part II)lvelho.impa.br/ip17/proj/slides/1110GoingDeeperConv.pdf · Convolutional Neural Networks 1 Convolutional Neural Networks (Part II) 08, 10 &

Deep Neural Networks Convolutional Networks II - Deep Learningdeeplearning.cs.cmu.edu/document/slides/lec9.CNN.pdf · Deep Neural Networks Convolutional Networks II Bhiksha Raj Spring

Intro To Convolutional Neural Networks

Robust Graph Convolutional Networks Against Adversarial ...pengcui.thumedialab.com/papers/RGCN.pdf · convolutional networks and graph adversarial attacks. 2.1 Graph Convolutional

Lecture 10: Convolutional Neural Networks · CS231n: Convolutional Neural Networks K. Simonyan and A. Zisserman, Very Deep Convolutional Networks for Large -Scale Image Recognition,

Convolutional Neural Networks - arXiv

Structure-Aware Convolutional Neural Networks › paper › 7287-structure-aware-convolutional-ne… · Structure-Aware Convolutional Neural Networks (SACNNs) are readily estab-lished.

Querying Web-Scale Information Networks Through Bounding ... · Querying Web-Scale Information Networks Through Bounding Matching Scores ... graph database; ... Keywords Subgraph

Convolutional Deep Belief Networks for Scalable ... · •Convolutional Restricted Boltzmann Machine –Probabilistic max-pooling •Convolutional Deep Belief Networks –Scalable