Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale...

16
Hierarchical Bilinear Network for High Performance Face Detec7on Jiangjing Lv, Xiaohu Shao, Junliang Xing, Pengcheng Liu, Xiangdong Zhou, Xi Zhou

Transcript of Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale...

Page 1: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

HierarchicalBilinearNetworkforHighPerformanceFaceDetec7on

JiangjingLv,XiaohuShao,JunliangXing,PengchengLiu,XiangdongZhou,XiZhou

Page 2: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

Background

•  Tradi7onalfacedetec7on– Real-7me– Limitedtosevererviewvaria7ons

•  DeepConvolu7onalNetwork(DCN)basedfacedetec7on– Highperformance– Sufferfromhighcomplexityandlargesizeofmodel

Page 3: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

Mo7va7on

•  BackboneNetwork– Arbitraryresolu7onasinput– Featurepyramidformul7-scalefacedetec7on

•  BilinearNetwork– Confidencesub-networkpredictsfaceconfidence– Localiza7onsub-networkregressesthefaceboundingboxes

– Weightssharing(7nymodelsize)

Page 4: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

Conv1 Pool1

SubNetConf3 SubNetLoc3

···

BackboneNetwork

BilinearNetwork

Conv2 Pool2

Conv3 Pool3 Conv4 Pool4 ConvN

SubNetConf4 SubNetLoc4

BilinearNetwork

SubNetConfN SubNetLocN

BilinearNetwork

···

···

input

output

TheframeworkofHBN

Page 5: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

BackboneNetwork

•  CH is channel number, default set to 48.

•  Total 45K parameters.

•  Spatial resolution is gradually reduced.

Layer Name

Filter Size Stride Pad Parameter Number

Conv1 16 × 5 × 5 2 2 1.2K

Pool1 2×2 2 0 0

Conv2 24 × 3 × 3 1 1 3.4K

Pool2 2×2 2 0 0

Conv3 CH × 3 × 3 1 1 10.1K

Pool3 2×2 2 0 0

Conv4 CH × 3 × 3 1 1 10.1K

Pool4 2×2 2 0 0

Conv5 CH × 3 × 3 1 1 10.1K

Pool5 2×2 2 0 0

Conv6 CH × 3 × 3 1 1 10.1K

Page 6: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

Layer Conv3 Conv4 Conv5 Conv6

Scale 122 242 482 962 1442 1922 2882 3842

Proposal

12×12 24×24 48×48 96×96 144×144 192×192 288×288 384×384

8×17 17×34 34×68 68×136 102×204 136×272 204×407 272×543

17×8 34×17 68×34 136×68 204×102 272×136 407×204 543×272

BackboneNetwork

•  Referenceboundingboxes– aspectra7os{1,1/2,2}

•  FaceSize:– From8pixelsto543pixels

Page 7: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

BilinearNetwork

•  Confidencesub-networks–  Classifica7on–  Incep&on:Contextualinforma7onwithdifferentresolu7ons

–  SoZMaxloss

•  LocalizaAonSub-network–  3×3convolu7onallayerisusedtopredictoffsetinforma7on

–  SmoothL1loss

Page 8: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

Experiments

•  ExperimentalseEngs– Caffepla]orm– WIDERFACEtrainingdataset– Hardnega7vemining

•  Thera7obetweenthenega7vesandposi7vesis3:1– Dataaugmenta7on

•  Scale,Blur,Noises,Mirrorflip

–  etc.

Page 9: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

ModelAnalysesModel Conv3 Conv4 Conv5 Conv6

Baseline C C C C

HBN-1 I+C C C C

HBN-2 I+C I+C C C

HBN-3 I+C I+C I+C C

HBN-4 I+C I+C I+C I+C

The configuration of different confidence sub-networks, I and C

represent the Inception module and the convolutional layer

respectively

Page 10: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

0 200 400 600 800 1000 1200 1400 1600 1800 20000.7

0.75

0.8

0.85

0.9

0.95

1

False positive

True

pos

itive

rate

BaselineHBN−1HBN−2HBN−3HBN−4

0 200 400 600 800 1000 1200 1400 1600 1800 20000.7

0.75

0.8

0.85

0.9

0.95

1

False positive

True

pos

itive

rate

BaselineHBN−1HBN−2HBN−3HBN−4

(a)CH=48 (b)CH=64

Baseline HBN-1 HBN-2 HBN-3 HBN-4

Page 11: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

Modelsizesandspeeds Model Baseline HBN-1 HBN-2 HBN-3 HBN-4

CH=48

CS(MB) 0.6 1.3 2.0 2.7 3.7

TS(MB) 0.4 1.1 1.1 1.1 1.1

Speed(FPS) 110 79 72 67 62

CH=64

CS(MB) 0.8 1.5 2.2 3.0 3.7

TS(MB) 0.6 1.3 1.3 1.3 1.3

Speed(FPS) 97 71 69 65 58

CSandTSrepresentthesizeofmodelstoragedbytheCaffepla]ormandthetheore7calsizeofmodelrespec7vely.

Page 12: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

Comparisonwithothermethods

0 200 400 600 800 1000 1200 1400 1600 1800 20000.7

0.75

0.8

0.85

0.9

0.95

1

False positive

True

pos

itive

rate

HBNDP2MFDFacenessCascadeCNNCCFACF

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10

0.10.20.30.40.50.60.70.80.9

1

Recall

Prec

ision

Multitask Cascade CNN−0.848HBN−0.794Faceness−WIDER−0.713Multiscale Cascade CNN−0.691Two−stage CNN−0.681ACF−WIDER−0.659

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10

0.10.20.30.40.50.60.70.80.9

1

Recall

Prec

ision

Multitask Cascade CNN−0.825HBN−0.739Multiscale Cascade CNN−0.664Faceness−WIDER−0.634Two−stage CNN−0.618ACF−WIDER−0.541

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10

0.10.20.30.40.50.60.70.80.9

1

Recall

Prec

ision

Multitask Cascade CNN−0.598HBN−0.554Multiscale Cascade CNN−0.424Faceness−WIDER−0.345Two−stage CNN−0.323ACF−WIDER−0.273

(a)FDDB (b)EasySet

(c)MediumSet (d)HardSet

Page 13: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

Examplesofourfacedetec7onmethod

Page 14: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

Examplesofourfacedetec7onmethod

Page 15: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

Conclusion

•  End-to-endfacedetec7onmethodfordifferentscalefaces.

•  Tinymodelsizebyweightssharing.•  Fastfacedetec7on.•  Promisingresults.

Page 16: Hierarchical Bilinear Network for High Performance Face ... · – Feature pyramid for mul7-scale face detec7on • Bilinear Network – Confidence sub-network predicts face confidence

Thank You!

[email protected]