Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the...

18
Xudong Pan 1 , Mi Zhang 1 , Duocai Wu 1 , Qifan Xiao 1 , Shouling Ji 2,3 and Min Yang 1 1. Fudan University 2. Zhejiang University 3. Ant Financial Justinian’s GAAvernor: Robust Distributed Learning with Gradient Aggregation Agent

Transcript of Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the...

Page 1: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Xudong Pan1, Mi Zhang1, Duocai Wu1, Qifan Xiao1, Shouling Ji2,3 and Min Yang1

1. Fudan University 2. Zhejiang University 3. Ant Financial

Justinian’s GAAvernor: Robust Distributed Learning with Gradient

Aggregation Agent

Page 2: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

I want to train an Enemy Detector for my

palace with Deep Learning techniques, yet I lack

training data.

G

Take 1. Justinian, a Byzantine emperor with Deep Learning knowledge, ponders.

🤪 🙂Enemy Friend

Page 3: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

I send my 3 generals to each of my borderlands

and train the enemy detector with local

data.

V0

V2

V1

Take 2. Justinian seems know the paradigm of distributed learning.

G

Page 4: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Every 10 days, they 1report me with their latest

local model updates. I 2aggregate the updates by

average and 3update my palace’s detector. They

4copy the latest params and 5resume the

training.

V0 V2V1

Take 3-A: (Naive Technical Option). Gradient Aggregation by Average

3

+ +G

Page 5: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Finally, I get my palace an effective enemy

detector, because of the ultimate loyalty of all my

generals.

Take 4-GE: Following the intuition ideally brings Justinian an effective enemy detector.

V0

3

V1

+ +G

V2

🤪

EnemyEffective!

Yes, you are an EMEMY!

Page 6: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Finally, my empire was under fire. One of my generals must have

betrayed me and reported a malicious update :(

V0 V2

G

=3

V1

+ +

🤪

Friendmisclassification

but, I’m an EMEMY!

😭

Take 4-BE: Byzantium was under fire.

Page 7: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

I aggregate the updates by “majority

voting” and update my palace’s detector! It

eliminates the malicious minority.

V0 V2

?G

V1

Take 3-B: (Mainstream Technical Options). Majority Voting

Algorithmic Voting

• Vector-wise Voting (e.g., Krum, GeoMed)Eliminate the updates which are less close to the majority in vector distance.

• Coordinate-wise Voting (e.g., Bulyan, Coord-Med)Do elimination and aggregation respectively for each coordinate.

Aggregate

Page 8: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Finally, I get my palace an effective enemy

detector, even when I later knows one of my

general is a traitor!

Take 4-GE: Majority Voting usually brings me an effective enemy detector when I have no more than 49.9% traitors.

V0 V2

G

V1

Algorithmic Voting

🤪

EnemyEffective!

Yes, you are an EMEMY!

V1/V2Aggregate

Page 9: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Finally, my empire was under fire. A

Majority of my generals all betrayed me and provided

me with a malicious voting result :(😭

Take 4-BE: Byzantium was under fire.V0 V2

G

V1

Algorithmic Voting

🤪

Friendmisclassification

No, but I’m an EMEMY!

V0/V1Aggregate

Malicious Majority can happen in e.g., Federated Learning systems.

Page 10: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

I hired a learnable GAAvernor to help

me deal with the aggregation. He adjusts the “credit” on each general

depending the situation.

V0 V2G

=V1

+ +

Take 3-C: (Our Contribution). Gradient Aggregation Agent (GAA)

Initially, 0.33 0.34 0.33

x x x

0.01 0.01 0.98

x x xLater on,

V0 V2G

=V1

+ +

How GAA becomes clever? Reinforcement Learning

Page 11: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Emperor’s GAA Spec.: Action• Parametrize Linear GAR:

(0,1,0)

(1,0,0)

(0,0,1)

•Action on Environment

“credit”

Page 12: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Emperor’s GAA Spec.: State & Policy Network

avg. pooling

Page 13: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Emperor’s GAA Spec.: Quasi-Validation (QV) Set

•Definition: GAA has a small set of data samples, which respects a similar distribution to that of the generals (i.i.d.).

•A Quasi-Validation set can be

A. An existing validation set or training set, for classical distributed learning systems (e.g., Train ResNet18 on ImageNet).

B. A public dataset from similar application domains as that of the client (e.g., in an FL Sys. among hospitals, a QV set can be medical data published for research.)

Page 14: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Emperor’s GAA Spec.: Reward

•Reward := the relative change of loss on the QV set (usually 10 samples)

•Cumulative Reward (as the learning objective):

Page 15: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Overview of the Byzantine Game

The whole system is differentiable, and thus learns effectively with back-prop.

Page 16: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Take 4-GE: GAA brings me an effective enemy detector in more cases. • 4 Benchmarks under static attacks

GAA defends static attacks, where a β ratio of generals report random updates.

Finally, I get my palace an effective enemy

detector, even when I later know only one of my

generals is loyal!

• Time Consumption for 100 Rounds

Page 17: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

More Applicable Scenarios

• Limitations: GAA may be limited when the QV set is fully known by the adversary.

A. Randomized Attack: The role of all but one generals randomly reverses at a period.

B. Pretense Attack: A β ratio of generals pretend to be loyal until the riot.

C. Adaptive Attack: The traitors have partial knowledge about the QV set and report carefully-crafted malicious gradients.

Random sampling, N-version scheme, …

Page 18: Justinian’s GAAvernor: Robust Distributed Learning …V2 V1 Take 2. Justinian seems know the paradigm of distributed learning. G Every 10 days, they 1report me with their latest

Thank you for Audience!For more details, welcome to read our paper!