SSD:Single Shot MultiBox Redd, Cheng-Yang Fu, Alexander C...

SSD:Single Shot MultiBox Detector

Presenter: Hongjing Zhang, Chen Zhang

Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Redd, Cheng-Yang Fu, Alexander C.Berg

1. Review of faster R-CNN2. Model of SSD

a. Multi-scale feature mapb. Bounding boxesc. Network architecture

3. Training of SSDa. Matching strategyb. Loss functionc. Hard negative miningd. Data augmentation

4. Experiments

Table of contents

Performance On VOC2007

Source: http://www.cs.unc.edu/~wliu/papers/ssd_eccv2016_slide.pdf

Quick Review of Faster R-CNN

Fast R-CNN + Region Proposal Netsource: faster R-CNN paper by S. Ren

Region Proposal Net: k anchor boxes of different scale and aspect ratio at each position.

SSD: Multi-scale Feature Map

Feature maps from different conv layers of different sizes.

conv kernel

default box

Effects on mAP when using different feature maps.

Anchor box setting comparison: SSD vs. Faster R-CNN

Anchor box setting: aspect ratios

Default Bounding Boxes - scale and shape

Aspect ratio: a in One extra default case:

Aspect ratio: a in

Scale of default boxes computed as a linear function:

Smin = 0.2, Smax = 0.9

One extra default case:

Default Bounding BoxesWhy small boxes in large feature maps?

Source: https://towardsdatascience.com/understanding-ssd-multibox-real-time-object-detection-in-deep-learning-495ef744fab

Default Bounding BoxesWhy small boxes in large feature maps?● large feature map - small receptive field - small object● small feature map - large receptive field - large object

Convolutional predictors for detection

3 x 3 x p kernel Each box: ● cls: # classes (C1, C2, …, Cp)● reg: 4 parameters delta(cx, cy, w, h)

# conv kernels: (Classes+4) x (# Default Boxes)

SSD Network Structure

SSD architecture taken from the original paper

VGG-16 network: by Oxford's Visual Geometry Group (VGG)

Source: https://blog.heuritech.com/2016/02/29/a-brief-report-of-the-heuritech-deep-learning-meetup-5/

● 3*3 conv kernel● 2*2 pooling with stride = 2

Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for large-scale image recognition." arXiv preprint arXiv:1409.1556 (2014).

Bounding Box Matching Strategy

● Jaccard Overlap (Intersection over Union)

● Matching default boxes to ground truth boxes with IoU > threshold(0.5)

Training Objective● After pairing groundtruth and default boxes, we can write the objective function:

: matching the i-th default box to the j-th ground truth box of category p.

N: matched default boxes. c: class confidence.l: predicted bounding boxg: ground truth bounding box

Training Objective

Hard Negative Mining

● Instead of using all the negative examples, we sort them using the highest confidence loss for each default box and pick the top ones.

● The ratio between negative examples and positive examples is 3:1.

● This method leads to faster optimization and a more stable training.

Data Augmentation

● Making the model more robust to various input object sizes and outputs:

1.Original Images.

2.Sample patch with minimal jaccard scores as 0.1, 0.3, 0.5, 0.7 or 0.9.

3.Randomly sample a patch.

Experiments

Table from paper SSD

PASCAL VOC2007 test detection results.

mAP: SSD > Faster > FastmAP: SSD512 > SSD300 ResolutionMore training data is better07+12+COCO

Experiments

Data AugmentationMore default box shapes is better

Table from Paper SSD

Effects of various design choices and components on SSD performance.

Experiments

Effects of using multiple output layers.

Multiple Output Layers Are Better

Experiments

Plot from paper SSD

Visulization of performance for SSD512 on animals, vehicles and furniture from VOC2007 test.

ExperimentsVisualization of performance for SSD512 on animals, the cumulative fraction of detections.

Plot from paper SSD

ExperimentsVisualization of performance for SSD512 on animals, the distribution of top-ranked false positive types.

Plot from paper SSD

ExperimentsInference Time(Results on VOC2007 test).

Detection Results

Images from paper SSD

Strength Drawbacks ● High Speed

● High Accuracy

● Simple Training(single shot)

● The classification task for small objects is relatively hard for SSD.

Questions

Atrous Algorithm(Dilated Convolution)

Receptive Field

L2 Normalization on Conv4

Figure from paper ParseNet

Why Conv6,Conv7?

About 1x1 conv kernels

In GoogLeNet architecture, 1x1 convolution is used for two purposes

● To reduce the dimensions inside this “inception module”.● To add more non-linearity by having ReLU immediately after every 1x1 convolution.

ExperimentsSensitivity and impact of different object characteristics on VOC2007 test set.

Plot from paper SSD

Object Detection Leaderboard(VOC2012)

SSD:Single Shot MultiBox Redd, Cheng-Yang Fu, Alexander C...

Documents

Transcript of SSD:Single Shot MultiBox Redd, Cheng-Yang Fu, Alexander C...

SSD: Single Shot MultiBox Detector - Welcome to the UNC ...wliu/papers/ssd.pdf · SSD: Single Shot MultiBox Detector Wei Liu1, Dragomir Anguelov2, Dumitru Erhan3, Christian Szegedy3,

SSD:Single Shot MultiBox Redd, Cheng-Yang Fu, Alexander ...web.cs.ucdavis.edu/~yjlee/teaching/ecs289g-winter2018/...SSD:Single Shot MultiBox Detector Presenter: Hongjing Zhang, Chen

SSD: Single Shot MultiBox Detector - Computer Sciencewliu/papers/ssd_eccv2016_slide.pdf · ant of the single shot detection (SSD) network from [10] slower) detector followed by a

Real-time object detection for autonomous vehicles using ...uu.diva-portal.org/smash/get/diva2:1356309/FULLTEXT01.pdfSingle Shot MultiBox Detector which is trained and evaluated on

TensorBoard and Applications - Cross Entropy · Single Shot MultiBox Detector (SSD) •Feature Maps •SSD Architecture •Matching Ground Truth Boxes to Prediction •SSD Loss Function

Single Shot Text Detector with Regional Attention · Single Shot Text Detector with Regional Attention ... Single shot multibox detector, ECCV, 2016. [5] C. Szegedy, W. Liu, Y. Jia,

Multibox 4 - asgteknoloji.com · Multibox 4 K-RTL Multibox 4 K-RTL is used for the individual room temperature control and maximum limitation of the return temperature with, for instance,

OBJECT DETECTION FROM MMS IMAGERY USING DEEP …€¦ · 2016), Single Shot MultiBox Detector (SSD) (Liu et al., 2016) and two-stage methods such as Fast R-CNN (Girshick, R., 2015)

MULTIBOX - Mounting Brackets, Standpipe Instrument Pillar ... · MULTIBOX - Mounting Brackets, Standpipe Instrument Pillar, Wall Mounting and Mounting Plate . It is generally recommended

SSD: Single Shot MultiBox Detectorpdfs.semanticscholar.org/12bf/156b71ed9aacbb640d5cbef709626b560e71.pdfSSD: Single Shot MultiBox Detector Wei Liu(1), Dragomir Anguelov(2), Dumitru

IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: …repository.essex.ac.uk/21723/1/08307427.pdf · the diversity of targets and their mobility, single shot multibox detector (SSD)

Real-time pedestrian detection via hierarchical convolutional feature VERSION.pdf · objects detector SSD (Single Shot MultiBox Detector), we proposed a hierarchical convo-lution

Blood Cell detection using Singleshot Multibox Detectornoiselab.ucsd.edu/ECE228_2018/Reports/Report20.pdfSingle Shot Multibox Detector have great balance of accuracy and speed. As

arXiv:1512.02325v2 [cs.CV] 30 Mar 2016 · PDF fileSSD: Single Shot MultiBox Detector Wei Liu1, Dragomir Anguelov2, Dumitru Erhan3, Christian Szegedy3, Scott Reed4, Cheng-Yang Fu 1,

Multi-level Semantic Feature Augmentation for One-shot ...lsigal/Publications/tip2019chen.pdf · One-shot Learning Zitian Chen y, Yanwei Fu , Yinda Zhang, Yu-Gang Jiang?, Xiangyang

Multi-Shot Pedestrian Re-Identification via Sequential Decision …openaccess.thecvf.com/content_cvpr_2018/papers/Zhang... · 2018-06-11 · Multi-shot Pedestrian Re-identiﬁcation

Multibox - IMI Hydronic Engineering Min. working temperature: 2°C For all Multibox models, ... combined floor/ radiator heating systems. Multibox K-RTL is also used in wall ... Venting

Single Shot Multibox Detector

Learning a Deep Embedding Model for Zero-Shot Learningopenaccess.thecvf.com/content_cvpr_2017/papers/...Learning a Deep Embedding Model for Zero-Shot Learning Li Zhang Tao Xiang Shaogang

SSD: Single Shot MultiBox Detector - arXiv.org e-Print archive · PDF fileSSD: Single Shot MultiBox Detector Wei Liu1, Dragomir Anguelov2, Dumitru Erhan3, Christian Szegedy3, Scott