Deep Residual Learning for Image Recognition*yjlee/teaching/ecs289g... · Deep Residual Learning...

Deep Residual Learning for Image Recognition*

Wei-Pang Jan, Xuanqing Liu

* Most of the figures/tables credit to He et al. Deep Residual Learning for Image Recognition In CVPR 2016

Motivation

Revolution of Depth and Complexity

Revolution of Depth

Is deeper network better at learning?Gradient Vanishing/Exploding

http://neuralnetworksanddeeplearning.com/chap5.html

Batch NormalizationPrevents the gradient at each iteration from becoming too large or too small

S. Ioffe et al. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In ICML 2015

Is deeper network better at learning?

ResNet Intuitions

Identity MappingIf the “extra” layers are identity functions,

The network on the right should perform “at least” as well as the network on the left

Residual Learning(Plain net)

Residual Learning

F(x) = H(x) - x

Residual Learning - Match the Dimension

Weight

Weight Linear transformWx

When input/output channel don’t match:

Shortcuts

Feedforward low level feature to deeper layers

- Feature reuse- Reduces number of parameter

Resolves vanishing gradient

- y = f(x) vs. y = f(x) + x

Resolving Gradient Vanishing Problem

Bottleneck ArchitecturesCompress and then expand channel through 1x1 conv

Experiments

Architecture

ImageNet Experiment Result

CIFAR-10 Experiment Result

Identity vs. Projection Shortcuts

Result Comparison on ImageNet

Model Size

Strength & Weakness● Make super deep networks possible to train and generalize well ☺● Speed-up convergence ☺● Only consider about the depth, ignoring width

Questions?

Extension - ResNeXt

Xie et al. Aggregated Residual Transformations for Deep Neural Networks, in CVPR 2017.

Extension - DenseNet

Huang et al. Densely Connected Convolutional Networks, in CVPR 2017.

Deep Residual Learning for Image Recognition*yjlee/teaching/ecs289g... · Deep Residual Learning...

Documents

Transcript of Deep Residual Learning for Image Recognition*yjlee/teaching/ecs289g... · Deep Residual Learning...

Functions Approximate - University of Southamptoncomp6248.ecs.soton.ac.uk/handouts/deepnetworks-handouts.pdfDeep Residual Learning for Image Recognition Kaiming He Xiangyu Zhang Shaoqing

DENSELY CONNECTED CONVOLUTIONAL NETWORKSREFERENCES • Kaiming He, et al. "Deep residual learning for image recognition" CVPR 2016 • Chen-Yu Lee, et al. "Deeply-supervised nets"

Visualizing and Understanding Recurrent Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/ismail2.pdf · [Visualizing and Understanding Recurrent Networks, Andrej Karpathy*,

Residual Stress & Distortion.ppt - mechshop.irmechshop.ir/.../2016/07/Residual-Stress-Distortion... · 2 Causes of Residual Stresses Residual stresses in metal structures occur for

Residual Stress

Christian Butzke Enology Professor - Food Science › research › labs › enology › ButzkeShelfLife… · Residual Sugar • Recognition threshold for sweetness is about 5 g/L

Facial Expression Recognition using Residual Convnet with ...

Deep Cross Residual Learning for Multitask Visual Recognition

For ECS 289G Presented by Fanyi Xiaoweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2015/... · 2015. 10. 8. · snapshot: 10000 snapshot_prefix: "models/finetune_flickr_style/finetune_flickr_style"

Image Representations a short tutorial on and Fine-Grained ... · Overview Introduction What is a “representation”? ... & Jian Sun. “Deep Residual Learning for Image Recognition”.

RESIDUAL STUDIES

ECS 289G: Visual Recognitionweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-winter2018/research_overview.pdfOverview: Weakly-supervised visual recognition. 1. Learning object detectors

Pose-Robust Face Recognition via Deep Residual Equivariant ...ccloy/files/cvpr_2018_dream.pdf · Deep Learning for Face Recognition. Deep learning is the prominent technique for face

ECS$289G$–$UC$Davis$ Paper$Presenta6on$#1web.cs.ucdavis.edu/.../ecs289g-fall2015/Presentation_MM.pdf · 2015. 10. 2. · Mohammad’Motamedi’ ECS’289G’PAPERPRESENTATION’9’UC’DAVIS

X-ray Diffraction Residual Stress Measurement AN · PDF fileX-ray Diffraction Residual Stress Measurement Importance of Residual Stress Residual stress affects: • Low cycle and high

How to Preserve Residual Kidney Function in Dialysis Patients · PDF file · 2018-01-06The recognition of the importance of residual renal function Why is preserved residual renal

Deep Residual Learning for Image Recognition (CVPR16)

cvpr2016 deep residual learning kaiminghekaiminghe.com/cvpr16resnet/cvpr2016_deep_residual... · 2017-01-22 · Deep Residual Learning for Image Recognition Kaiming He, Xiangyu Zhang,

Deep Residual Learning Microsoft Research for Image ...€¦ · Deep Residual Learning for Image Recognition Authors: Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun Microsoft Research

Deep Residual Bidir-LSTM for Human Activity Recognition ...to natural language processing, speech recognition, and weather prediction, because its design enables gradients to flow