Probing the Structure of Deep Neural Networks with ...

29
Probing the Structure of Deep Neural Networks with Universal Adversarial Perturbations Sanjit Bhat (Acton-Boxborough RHS), Mentor: Dimitris Tsipras (MIT) PRIMES Conference, May 18, 2019 1

Transcript of Probing the Structure of Deep Neural Networks with ...

Page 1: Probing the Structure of Deep Neural Networks with ...

Probing the Structure of Deep Neural Networks with Universal Adversarial PerturbationsSanjit Bhat (Acton-Boxborough RHS), Mentor: Dimitris Tsipras (MIT)PRIMES Conference, May 18, 2019

1

Page 2: Probing the Structure of Deep Neural Networks with ...

Acknowledgements

Thank you to:

● My parents● Dimitris Tsipras, for the useful discussions and guidance● The Madry Lab, for making me feel at home at MIT● Prof. Srini Devadas, for the PRIMES CS track● Dr. Slava Gerovitch, for the PRIMES program

2

Page 3: Probing the Structure of Deep Neural Networks with ...

Introduction

3

Page 4: Probing the Structure of Deep Neural Networks with ...

Deep Learning (DL) can surpass humans

4

Page 5: Probing the Structure of Deep Neural Networks with ...

DL in security-critical applications

5

Page 6: Probing the Structure of Deep Neural Networks with ...

Is DL ready for this?

6

Page 7: Probing the Structure of Deep Neural Networks with ...

Deep Neural Network (DNN) - Natural Setting

7V. Fischer, M. Kumar, J. Metzen, T. Brox“Adversarial Examples for Semantic Image Segmentation”

Page 8: Probing the Structure of Deep Neural Networks with ...

DNN - Adversarial Setting

8

Page 9: Probing the Structure of Deep Neural Networks with ...

Why do we need robust DNNs?

Alignment with human intelligence

● Goal of ML: Make intelligent systems

● Most humans wouldn’t get fooled, but these systems do

9

Robustness to real-world perturbs

● Some natural perturbations (e.g., rain) can trick classifiers

● Train models that are more reliable in the natural world

Page 10: Probing the Structure of Deep Neural Networks with ...

Background

10

Page 11: Probing the Structure of Deep Neural Networks with ...

How do we train robust DNNs?

11

Page 12: Probing the Structure of Deep Neural Networks with ...

Adversarial Training - A robust training methodNatural Training Set

Model Parameters12

Page 13: Probing the Structure of Deep Neural Networks with ...

Adversarial Training - A robust training methodAdversarial Training Set

Model Parameters13

Page 14: Probing the Structure of Deep Neural Networks with ...

Universal Adversarial Perturbations (UAPs)

14

Regular Adversarial Perturbations

● Image-specific (one perturbation per image)

● Stronger, more targeted

UAPs

● Class-specific (one perturbation for all images in a particular class)

● More general● Location-invariant

Page 15: Probing the Structure of Deep Neural Networks with ...

Goal: Use UAPs to study the general dynamics of Adversarial Training

15

Page 16: Probing the Structure of Deep Neural Networks with ...

Methodology

16

Page 17: Probing the Structure of Deep Neural Networks with ...

UAP Generation

Averaging

● Simplest, most obvious method

Singular Value Decomposition (SVD)

● Goal: Explain away variance● Inputs: Data● Outputs: Vectors that explain the

most variance in data (eigenvectors) and their associated eigenvalues

17

Page 18: Probing the Structure of Deep Neural Networks with ...

UAP Generation Cont.

18

● Pre-trained natural and adversarial models from Madry et al.● UAPs generated and evaluated on MNIST (handwriting recognition)

and CIFAR-10 (image recognition) test sets● Focus on adversaries bounded in L2 norm - more interpretable

perturbations

Page 19: Probing the Structure of Deep Neural Networks with ...

Experiments

19

Page 20: Probing the Structure of Deep Neural Networks with ...

Adversarial Training Induces More Human-Interpretable Features

20

Page 21: Probing the Structure of Deep Neural Networks with ...

MNIST

21

Naturally Trained Adversarially Trained

Page 22: Probing the Structure of Deep Neural Networks with ...

CIFAR-10

22

Naturally Trained Adversarially Trained

Page 23: Probing the Structure of Deep Neural Networks with ...

Multiple UAP Directions Exist for MNIST

23

Page 24: Probing the Structure of Deep Neural Networks with ...

The Eigenvalue Spectra

24

MNIST

● No large drop● Multiple universal directions● Cause: Linear separability

CIFAR-10

● Order of magnitude drop● One main universal direction● Cause: No linear separability,

images mesh together

2.7 2.5 2.1 2.0 1.8 54.4 4.5 4.1 3.4 3.0

Page 25: Probing the Structure of Deep Neural Networks with ...

Adversarial Training Causes Local Loss Landscape Smoothening

25

Page 26: Probing the Structure of Deep Neural Networks with ...

Optimization Trajectories - MNIST

26

Naturally Trained Adversarially Trained

Page 27: Probing the Structure of Deep Neural Networks with ...

Optimization Trajectories - CIFAR-10

27

Naturally Trained Adversarially Trained

Page 28: Probing the Structure of Deep Neural Networks with ...

Distribution of Cos Similarities for Single Image

Naturally Trained Adversarially Trained

28

Page 29: Probing the Structure of Deep Neural Networks with ...

Thank You!

Questions?

29