With Great Training Comes Great Vulnerability: Practical Attacks against Transfer Learning

Bolun Wang*, YuanshunYao, Bimal Viswanath§

Haitao Zheng, Ben Y. Zhao

University of Chicago, * UC Santa Barbara, § Virginia Tech

bolunwang@cs.ucsb.edu

Deep Learning is Data Hungry

• High-quality models are trained using large labeled datasets• Vision domain: ImageNet contains over 14 million labeled images

Where do small companies get such large datasets?

A Prevailing Solution: Transfer Learning

Company X Limited Training Data Highly-trained Model

High-quality ModelStudent A Student B Student CRecommended by Google, Microsoft, and Facebook DL frameworks

Teacher

StudentTransfer and re-use pre-trained model

Deep Learning 101

3Photo credit: Google

Transfer Learning: Details

Output

! Layers! − 1 Layers

In general, first $ layers can be directly transferred

($ = ! − 1)

Insight: high-quality features can be re-used

Teacher Teacher-specific classification layer

StudentOutputStudent-specific

classification layer

Transfer Learning: Example

• Face recognition: recognize faces of 65 people

Teacher (VGG-Face)

900 images/person2,622 people

StudentTransfer 15 out of 16 layers

10 images/person65 people

Classification AccuracyWithout Transfer Learning With Transfer Learning

1% 93.47%

Company X

Is Transfer Learning Safe?

• Transfer Learning lacks diversity• Users have very limited choices of Teacher models

Same Teacher

Company A Attacker

Help attacker exploit all Student models

Company B

In This Talk

• Adversarial attack in the context of Transfer Learning

• Impact on real DL services

• Defense solutions

Background: Adversarial Attack

• Adversarial attack• Misclassify inputs by adding carefully engineered perturbation

+ " #Misclassified as

Imperceptible perturbation

Attack Models of Prior Adversarial Attacks

• White-box attack: assumes full access to model internals• Find the optimal perturbation offline

• Black-box attack: assumes no access to model internals• Repeated query to reverse engineer the victim• Test intermediate result and improve

Not practical

Easily detected

Our Attack Model

• We propose a new adversarial attack targeting Transfer Learning

• Attack model

StudentTeacher

Black-box

• Model internals are hidden and kept secure

White-box

• Model internals are known to the attacker

Default access model today• Teachers are made public by popular DL services• Students are trained offline and kept secret

Attack Methodology: Neuron Mimicry

+SourceImage

TargetImage

Perturbation∆

If two inputs match at layer ), then they produce the same result regardless of changes above layer ).

"('()"('()

&(" '( )&(" '( )

Same as Teacher

How to Compute Perturbation?

• Compute perturbation (∆) by solving an optimization problem• Goal: mimic hidden-layer representation• Constraint: perturbation should be indistinguishable by humans

"#: source image"$: target image

min ()*+,-./(12 "# + ∆ , 12("$))*. +. 7/8+98:_<,=-)+9>/ "# + ∆, "# < @ABCDE$

Minimize L2 distance between internal representations

Constrain perturbation

DSSIM: an objective measure for image distortion

12 " : internal representation at layer F of image "

Attack Effectiveness

• Targeted attack: randomly select 1,000 source, target image pairs

• Attack success rate: percentage of images successfully misclassified into the target

Source Adversarial Target

Face recognition92.6% attack success rate

Source Adversarial Target

Iris recognition95.9% attack success rate

Attack in the Wild

• Q1: given Student, how to determine Teacher?• Craft “fingerprint” input for each Teacher candidate• Query Student to identify Teacher among candidates

• Q2: would attack work on Students trained by real DL services?• Follow tutorials to build Student using following services

• Attack achieves >88.0% success rate for all three services

Student

Teacher A

Teacher B

Which Teacher is used?

Fingerprint input

In This Talk

• Adversarial attack in the context of Transfer Learning

• Impact on real DL services

• Defense solutions

Intuition: Make Student Unpredictable

• Modify Student to make internal representation deviate from Teacher• Modification should be unpredictable by the attacker → No countermeasure• Without impacting classification accuracy

min $%&''()*%&+,(,./01, ,3/14)'. *. 78 9 − ;8(9) < > >.? for 9 ∈ D./EFG

Maintain classification accuracyTeacher

RobustStudent

Transfer using an updated objective function

Updated objective function

Guarantee difference between Teacher and Student

Effectiveness of Defense

Model Face Recognition Iris Recognition

Before Patching Attack Success Rate

92.6% 100%

After Patching

Attack Success Rate

30.87% 12.6%

Change of Classification

Accuracy↓ 2.86% ↑ 2.73%

One More Thing

• Findings disclosed to Google, Microsoft, and Facebook

• What’s not included in the talk• Impact of Transfer Learning approaches• Impact of attack configurations• Fingerprinting Teacher• …

Code, models, and datasets are available at https://github.com/bolunwang/translearn

Thank you!

With Great Training Comes Great Vulnerability: Practical Attacks against Transfer Learning ·...

Transcript of With Great Training Comes Great Vulnerability: Practical Attacks against Transfer Learning ·...

With Great Training Comes Great Vulnerability: Practical Attacks against Transfer Learning ·...

Documents

Transcript of With Great Training Comes Great Vulnerability: Practical Attacks against Transfer Learning ·...

PREVENTION OF VULNERABILITY ON DDOS ATTACKS TOWARDS ...

Power Grid Vulnerability to Geographically …...to physical attacks, such as an Electromagnetic Pulse (EMP) attack [17], [34]. Thus, we focus on the vulnerability of the power grid

With Great Training Comes Great Vulnerability: Practical Attacks ...people.cs.uchicago.edu/~ravenben/publications/pdf/translearn-useni… · Transfer learning is a powerful approach

On the Vulnerability of the Proportional Fairness Scheduler to Retransmissions Attacks

In-Memory Attacks explained-nginx - IBM Research · • Analyze the system to find a vulnerability • Casting mismatch vulnerability 6 Nginx – Step 1

Predicting Organization Attacks via Mining Crowdsourcing Data · 2018-09-12 · Predicting attacks Predicting exploitable Vulnerability and Expected Exploit Time. Existing Works •Heuristics

Cyber Risk Analysis of Combined Data Attacks Against Power ... · risk of combined attacks to reliable system operation is evaluated using the results from vulnerability assessment

Study of SQL Injection Attacks and · PDF fileSQL injection prevention and detection approaches. Furthermore, for each type of vulnerability, we provide descriptions of how attacks

Evaluation of Smart Grid and Civilian UAV Vulnerability to GPS Spoofing Attacks

Vulnerability Report - Ruhr University Bochum · Vulnerability Report Attacks bypassing the signature validation in PDF ... an overview of the ﬁle structure and explain how the

Side Channel Attacks: Vulnerability Analysis of …Side Channel Attacks: Vulnerability Analysis of PRINCE and RECTANGLE using DPA Ravikumar Selvam, Dillibabu Shanmugam, and Suganya

Vulnerability of macroalgae of the Great Barrier Reef … · Vulnerability of macroalgae of the Great Barrier Reef to ... Vulnerability of macroalgae of the GBR to climate change

CAP6135: Malware and Software Vulnerability Analysis The Next …czou/CAP6135-09/P2P-botnet.pdf · 2009-03-06 · Vulnerability Analysis The Next Generation Peer-to-Peer Botnet Attacks

Shield: Vulnerability-Driven End- Host Firewall for Preventing Known Vulnerability Attacks Sigcomm ’04.

E-SPIN™ Vulnerability Management (VM) eEye Digital Product ...€¦ · •Can stop 100% of client side attacks on un-patched hosts •The only agent based vulnerability assessment

Strengthen Your Defenses Against Cybercrime · Website vulnerability Ransomware attacks Distributed denial of service (DDoS) attacks Respondents whose organization has seen an increase

Shield: Vulnerability-Driven End-Host Firewall for Preventing Known Vulnerability Attacks

Blended Attacks Exploits, Vulnerabilities and Buffer ... BLENDED ATTACKS EXPLOITS, VULNERABILITIES AND BUFFER-OVERFLOW TECHNIQUES IN COMPUTER VIRUSES √ Types of Vulnerability BUFFER

Network Vulnerability Assessment€¦ · Network vulnerability assessments are an important component of continuous monitoring to proactively determine vulnerability to attacks and

Vulnerability Analysis and Attacks on NFC-enabled Mobile ... · Vulnerability Analysis and Attacks on NFCenabled Mobile Phones 1st IWSS Introduction NFC phones and services are just