A General Introduction to Tomography & Link Delay Inference with EM Algorithm

54
1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

description

A General Introduction to Tomography & Link Delay Inference with EM Algorithm. Presented by Joe, Wenjie Jiang 21/02/2004. Outline of Talk. Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link delay using EM algorithm - PowerPoint PPT Presentation

Transcript of A General Introduction to Tomography & Link Delay Inference with EM Algorithm

Page 1: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

1

A General Introduction to Tomography & Link Delay

Inference with EM Algorithm

Presented by Joe, Wenjie Jiang21/02/2004

Page 2: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

2

Outline of Talk Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link

delay using EM algorithm Conclusion

Page 3: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

3

Terminology “Tomography”

Brain Tomography

Access is difficult!

Network Tomography

Access is difficult!Vardi 1996

Page 4: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

4

Why tomography?What is the: Bandwidth? Loss rate? Link Delay? Traffic demands? Connectivity of links

in the network? (Topology Inference)

Path: a connection between two end nodes, each consisting of several links.

Link: a direct connection with no intermediate routes/hosts.

Page 5: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

5

Motivation Identify congestion points and

performance bottlenecks Dynamic routing Optimized service providing Security: detection of

anomalous/malicious behavior Capacity planning

Page 6: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

6

Why tomography - Difficulty Decentralized, heterogeneous and unregulated

nature of the internal network. No incentive for individuals to collect and

distribute these info freely. Collecting all statistics impose an impracticable

overhead expense ISP regards the statistics highly confidential Relaying measurements to decision-making

point consumes bandwidth.

Page 7: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

7

Why tomography - Solution Widespread internal

network monitoring is expensive and infeasible

Edge-based measurement and statistical analysis is practical and scalable

Page 8: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

8

Brain Tomography

Page 9: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

9

Network Tomography

Page 10: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

10

Where are you? Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link

delay using EM algorithm Conclusion

Page 11: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

11

Introduction to tomography Use a limited number of measurements to

infer network (link) performance parameters, using:-- Maximum Likelihood Estimator -- Estimation Maximization-- Bayesian Inference

and assuming a prior model. Categories of problems:

-- Link level parameter estimation-- Sender-Receiver traffic intensity.-- Topology Inference

Page 12: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

12

Introduction to tomography (2) Two forms of network tomography:

-- link-level metric estimation based on end-to-end, traffic measurements (counts of sent/received packets, time delays between sent/received packets)-- path level (sender-receiver path) traffic intensity estimation based on link-level measurements (counts of packets through nodes)

Passive or Active measurements? Multicast or Unicast?

Page 13: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

13

Problem Description To solve the linear system:

A, ө and εhave special structures. Goal: to maximize the likelihood function

Page 14: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

14

Problem Description (2)

A = routing matrix (graph) ө = packet queuing delays

for each link y = packet delays measu

red at the edge ε= noise, inherent rando

mness in traffic measurements

Statistical likelihood function

Page 15: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

15

Problem Description (3)

An virtual multicast tree with four receivers

l1

l2 l3

l4 l5 l6 l7

l1 l2 l3 l4 l5 l6 l7

Y1 Y2 Y3 Y4

Y1=X1+X2+X4

Page 16: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

16

Where are you? Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link del

ay using EM algorithm Conclusion

Page 17: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

17

Physical Topology

Measure end-to-end (from sender to receiver) delays

Page 18: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

18

Logical Topology

Logical topology is formed by considering only the branching points in the physical topology

Infer the logical link-level queuing delay distributions!

Page 19: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

19

The basic idea of internal link delay tomography

Send a back-to-back packet pair from a sender, each packet heading to a different receiver

Use the fact that delays are highly correlated on shared links

Queuing delay difference between these two end can be attributed to the unshared links

Page 20: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

20

Delay Estimation Measure end-to-end delay of packet

pairsPackets experience the same delay on link1

d2=dmin=0 d3>0 Extra delay on link 3!

Page 21: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

21

Packet-pair measurements

)()2( nx

)()2( ny )()1( ny

Key Assumptions• Fixed known routes

• Temporal independence

• Spatial independence

• Packet-pair delays are identical on share links.

N delay measurements in all

Page 22: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

22

Parameters

α1

α2α3

α4 α5

α6 α7 α8α9

αi = parameter of delay pmf on link i

Page 23: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

23

Link delay model• αi = delay pmf on link i• Link delay model could be m

ultinomial• quantized delay model: dela

y= {0, 1, 2, 3,…,L,∞}

• αi= {αi0,αi1,αi2,...,αiL,αi ∞ }

• αij=P{ delay(link i) = j }

• αi0+αi1+αi2,...,αiL+αi ∞=10

0. 020. 040. 060. 080. 10. 120. 140. 160. 180. 2

0 1 2 3 4 … … L

probabi l i ty

Page 24: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

24

Goal

);( YL

N

n

nypYL1

)( );();(

);( )( nyp is the probability of the event of n-th measurement

is the probability of the event of all measurements

Our goal: find );(maxarg

YL

Page 25: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

25

Where are you? Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link

delay using EM algorithm Conclusion

Page 26: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

26

Review of MLE (Maximum Likelihood Estimation)

Page 27: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

27

Review of MLE (Maximum Likelihood Estimation) The basic idea of MLE: God always let the event

with the biggest probability happen the most likely -- The MLE of ө is to make the sample occur the most likely

Note we assume X={x1,…xN} to be i.i.d The solution could be easy or hard depending on

the form of p(ө|X) e.g. p(ө|X) is a single Gaussian ө=(μ, σ2), we can

set the derivative of logL(ө|X) to zero and solve it directly.

Page 28: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

28

Complete Data The sample X={x1,…xN} together with the

missing (or latent) data Y is called complete data.

The complete likelihood is

where p(x, y|ө) is the joint density of X and Y given the parameter ө.

The complete log-likelihood is

Page 29: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

29

Complete MLE By the definition of conditional density,

where p(y|x,ө) is the conditional density of Y given X=x and ө

The complete MLE

Page 30: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

30

Basic idea of EM Given X=x and ө= өt-1, where өt-1 is the current estim

ates the unknown parameters log p(x,Y| ө) is a function of Y whose unique best Me

an Squared Error (MSE) predicator is

Page 31: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

31

EM steps

Page 32: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

32

The magic of EM the direct MLE of

is relatively hard to solve But the MLE of complete log-likelihood is

relatively easier to obtain since is a function of x and y, (y is

hidden), we use the expectation of y under x and

So E-step

M-step

Page 33: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

33

Where are you? Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link del

ay using EM algorithm Conclusion

Page 34: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

34

EM in link delay inference

α1

α2α3

α4 α5

α6 α7 α8α9

x1

x2 x3

x4 x5x6 x7

x8

x9

Note that here notation x and y have opposite meaning of x, y stated in previous EM algorithm

Page 35: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

35

EM in link delay inference (2) Complete data Z=(X,Y) the complete data log-likelihood:

Pα[Y|X] has nothing to do with α

mi,j is the total number of packets experience a delay j on link i over N measurements.

][log]|[log][]|[log],[log);,( XPXYPXPXYPYXPYXL

Liii mi

mi

mii

M

ii

LXPXPXPXP

XPXPYXL

,1,0, ][]1[]0[][

][log][log);,(1

Page 36: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

36

EM in link delay inference (3)

Liii

Liii

mLi

mi

mi

mi

mi

mii LXPXPXPXP

,1,0,

,1,0,

,1,0,

][]1[]0[][

The MLE of αwould be

L

jji

jiji

m

m

1,

,,

Page 37: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

37

EM in link delay inference (4)nm

nmmmp 210

210

n

ii mmm

m

21

MLE

which is the frequency of event mi1i

i

A simple example is that we toss a die, P( the result i)=αi

(i=1,2…6) mi= how many times we see result i

Page 38: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

38

EM in link delay inference (5) We notice that is similar to

only different that should be replaced by

So the MLE

);,( YXL

jim ,

],|[ )1(,,

ijiji YmEm

L

jji

jiji

m

m

1,

,,

Page 39: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

39

EM in link delay inference (6)

],|)([

],|1[],|1[

1

],|[

)1(1

)1()(11

)1()(,

1 )(,

)1(,,

iNn

ijidelay

Nn

Nn

ijidelayji

Nn jidelayji

ijiji

YjidelayP

YEYEm

m

YmEm

Probability Propagation

Page 40: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

40

A simple example

delay on each link fall into {0,1,2,3}

y1 y2

x1

x2 x3

0

1

2 3

}41,

41,

41,

41{},,,{

}41,

41,

41,

41{},,,{

}41,

41,

41,

41{},,,{

333231303

232221202

131211101

αij=P{ delay (link i) = j }

Page 41: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

41

A simple example (2)Suppose there are 5 measurements:{ (3,2), (4,2), (6,5), (0,0), (4,1)}

y1 y2

x1

x2 x3

0

1

2 3

)](),(|0[

],|)([

2115

10,1

)1(1,

)0( nynyxPm

YjidelayPm

n

iNnji

Page 42: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

42

A simple example (3)

y1 y2

x1

x2 x3

0

1

2 3

41

41

41

]0[]2[]3[]0[]2,3[

]0[]0|2,3[

][]|2,3[

]0[]0|2,3[

]2,3|0[)]1(),1(|0[

132

132

1121

3

01121

1121

211211 )0()0(

xPxPxPxPxxP

xPxyyP

jxPjxyyP

xPxyyP

yyxPyyxP

j

Bayes Formula

Page 43: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

43

A simple example (4)

y1 y2

x1

x2 x3

0

1

2 3

31

64/164/164/164/1]2,3|0[

0410

41

]3[]3|2,3[41

41

41

]2[]2|2,3[41

41

41

]1[]1|2,3[

211

1121

1121

1121

yyxP

xPxyyP

xPxyyP

xPxyyP

Page 44: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

44

A simple example (5)

y1 y2

x1

x2 x3

0

1

2 3

340100

31

0]1,4|0[1]0,0|0[0]5,6|0[0]2,4|0[31]2,3|0[

0,1

211

211

211

211

211

m

yyxPyyxPyyxPyyxP

yyxPsimilarly:

Page 45: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

45

A simple example (6) ji

0 1 2 3

1 4/3 11/6 5/6 1

2 1 1/3 5/6 17/6

3 17/6 5/6 4/3 0

jim ,

mi,j computed in the first iteration.

Page 46: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

46

A simple example (7)

154

16/56/113/43/4

3,12,11,10,1

0,10,1

1,

,,

mmmmm

m

mL

jji

jiji

the physical meaning of α1,0 is that: the number of packets that experience delay 0 on link i divided by the total number of packets that travel through link i

Page 47: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

47

A simple example (8) ji

0 1 2 3

1 4/15 11/30 1/6 1/5

2 1/5 1/15 1/6 17/30

3 17/30 1/6 4/15 0

ji ,

αi,j computed in the first iteration

Page 48: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

48

A simple example (9)

j i

0 1 2 3

1 0.4 0.4 0 0.22 0.2 0 0 0.83 0.4 0.2 0.4 0

ji ,

Iteration: iterate E-step and M-step, until some termination criteria is satisfied!

After 6 iterations, αi,j converges to a fixed value.

Page 49: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

49

A simple example (9){ (3,2), (4,2), (6,5), (0,0), (4,1)}

y1 y2

x1

x2 x3

0

1

2 3

00. 10. 20. 30. 40. 50. 60. 70. 8

0 1 2 3

l i nk1l i nk2l i nk3

Page 50: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

50

Complexity

Page 51: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

51

Where are you? Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link

delay using EM algorithm Conclusion

Page 52: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

52

Conclusion+

The field is just emerging. Deploying measurement/probing schemes and inference

algorithms in larger networks is the next key step.

Page 53: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

53

Problems The spatial-temporally stationary and

independent traffic model has limitations, especially in heavily loaded networks.

A trend for highly uncooperative environment for active probing – passive traffic monitoring techniques, for example based on sampling TCP traffic streams

Page 54: A General Introduction to Tomography & Link Delay Inference with EM Algorithm

54

Thank you!

The End