013-Telkamp-How Full is Full

28
www.cariden.com 2009 © Cariden Technologies How full is full? Engineer your network to meet SLA’s Thomas Telkamp DENOG, Nov. 5th 2009 Frankfurt/Main, Germany

description

A

Transcript of 013-Telkamp-How Full is Full

Page 1: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

How full is full?

Engineer your network to meet SLA’s

Thomas Telkamp

DENOG, Nov. 5th 2009 Frankfurt/Main, Germany

Page 2: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Network Planning Methodology 2. The relationship between SLAs and network planning targets ...

2

Simulation

Traffic Demand Matrix

Demand Forecast

Service Subscription

Logical Topology

Connectivity Model

(IGP, MPLS TE)

Physical Topology

SLA Reqts (Overprovision-

ing Factors)

Step 1 Step 2 Step 3

Capacity Requirements

QOS Model

IP / MPLS Network

Optimisation

Network Provisioning

Step 4 Step 5 Step 6

Network Operations

Network Configuration (changeover)

Page 3: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Demand Characterization

•  Long-Term –  Measured Traffic

•  E.g. P95 (day/week) –  “unforeseen” events and growth

•  Short-Term –  Critical scale for queuing –  Determine over-provisioning factor

that will prevent queue buildup against micro-bursts

100%

0%

micro-bursts long-term variation

measured traffic

24 hours

Page 4: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Relevant Timescales

•  Long-Term: > 5 minutes •  Short-Term: < 5 minutes

100ms 1sec 1h 0 10sec 1min

Aggregate Flows Intra-Flow

Users/Applications

TCP (RTT) Flow Sizes/Durations Diurnal variation

Timescale

Dynamics

Characteristics

Page 5: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Fiber Tap (Gigabit Ethernet)

Tap

Analyzer

Page 6: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Raw Results 30 sec of data, 1ms scale

•  Mean = 950 Mbps •  Max. = 2033 Mbps •  Min. = 509 Mbps

•  95-percentile: 1183 Mbps •  5-percentile: 737 Mbps

•  (around 250 packets per 1ms interval)

Page 7: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Traffic Distribution Histogram (1ms scale)

•  Fits normal probability distribution very well (Std. dev. = 138 Mbps)

•  No Heavy-Tails •  Suggests small

overprovisioning factor

Page 8: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Autocorrelation Lag Plot (1ms scale)

•  Scatterplot for consecutive samples

•  Are periods of high usage followed by other periods of high usage?

•  Autocorrelation at 1ms is 0.13 (=uncorrelated)

Page 9: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Utilization 30 sec of data, 1 sec scale

•  Mean = 950 Mbps •  Max. = 978 Mbps •  Min. = 927 Mbps

•  Clearly longer derivations from the mean

•  High autocorrelation at 1 sec. (0.68)

Page 10: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Relevant Timescales

100ms 1sec 1h 0 10sec 1min

Aggregate Flows Intra-Flow

Users/Applications

TCP (<RTT) Flow Sizes/Durations Diurnal variation

White Noise LRD Smooth Behavior

Timescale

Dynamics

Characteristics

Page 11: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Traffic Behavior

•  < 150ms: Gaussian White Noise •  I.I.D, Normal distribution, no correlation •  Short-Range Dependent (H=0.5)

•  150ms – 30sec: Long-Range Dependence •  Non-summable autocorrelation •  Statistically Self-Similar •  X =d m1-H X(m), “Clustering”, Fractional Brownian Motion (fBM)

•  > 30 sec: Smooth •  Degenerate case: autocorrelation = 1 •  Almost constant, diurnal variation

Page 12: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Queuing Models/Simulation

•  M/M/1 queuing formula

•  Markovian –  Poisson-process –  Infinite number of sources

•  “Circuits can be operated at over 99% utilization, with delay and jitter well below 1ms” [2] [3]

•  Self-Similarity

•  Traffic is bursty at many or all timescales

•  “Scale-invariant burstiness (i.e. self-similarity) introduces new complexities into optimization of network performance and makes the task of providing QoS together with achieving high utilization difficult” [4]

Page 13: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Telephony Traffic (inter-city on 6/3/2002)

0

5000

10000

15000

20000

25000

30000

35000

40000

0:001:453:305:157:008:4510:3012:1514:0015:4517:3019:1521:0022:45

Centi-Erlang

Page 14: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Voice Capacity Allocation

•  Erlang traffic model(s) •  1 Erlang = 1 hour of calls

–  Average numbers of calls in an hour

•  Busy Hour Traffic: about 330 Erlang •  Erlang B formula (for 330 Erlang):

–  Blocking 1% -> 354 lines required –  Blocking 0.1% -> 376 lines required

•  Reverse calculation: –  100 lines available, blocking 0.1%:

•  75 Erlang (=average of 75 calls in an hour)

Page 15: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

IP Capacity Allocation

•  Measurement data –  E.g. 5-min average utilization

•  Performance objectives –  E.g. packet loss = 0%, jitter < 10ms –  End-to-end: convert to per-hop objective

•  But we don’t have an “Erlang formula”… •  Two paths towards a solution:

–  1) Model the traffic, and fit parameters –  2) Empirically derive guidelines

Page 16: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Queuing Simulation

FIFO Queue

Sampled Traffic

Fixed Service Rate

Monitor Queuing Delay

Sampled Traffic

Sampled Traffic

•  Feed multiplexed sampled traffic data into FIFO queue •  Measure amount of traffic that violates the delay bound

622 Mbps 572 Mbps

126 Mbps

240 Mbps

206 Mbps

Example: 92% Utilization

Page 17: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Queuing Simulation: Results

Page 18: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Scaled Delay Function (1G)

Page 19: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

P99.9 Delay Function

Page 20: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

P99.9 Delay Function

Page 21: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Cao, Cleveland, Sun

•  Bandwidth Estimation for Best-Effort Internet Traffic –  Jin Cao, William S. Cleveland, and Don X. Sun –  Statist. Sci. Volume 19, Number 3 (2004), 518-543.

•  Data: –  BELL, AIX, MFN, NZIX

•  Model depends on: –  #connections, traffic rate, delay target –  Avg. bandwidth per connection: 16384 bps

•  Best-Effort Delay Formula:

Page 22: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Best-Effort Delay Model

Page 23: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Queuing Simulation: Numeric Results

•  1 Gbps (Gigabit Ethernet) –  1 ms delay bound for 999 out of 1000 packets (99.9-

percentile): •  90% maximum utilization

•  622 Mbps (STM-4c/OC-12c) –  1 ms delay bound for 999 out of 1000 packets (99.9-

percentile): •  85% maximum utilization

Page 24: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Multi-hop Queueing

1 hop

Avg: 0.23 ms P99.9: 2.02 ms

2 hops

Avg: 0.46 ms P99.9: 2.68 ms

Page 25: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Multi-hop Queueing (1-8 hops)

Page 26: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Queueing: Summary

•  Queueing Simulation: –  Gigabit Ethernet (backbone) link

•  overprovisioning percentage in the order of 10% is required to bound delay/jitter to less than 1 ms

–  Lower speeds (<1G) •  overprovisioning factor is significant,

–  Higher speeds (2.5G/10G) •  overprovisioning factor becomes very small

•  P99.9 multi-hop delay/jitter is not additive

Page 27: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

Future work

•  Use more conservative delay formula •  Estimate avg. bandwidth per connection

(NetFlow?) •  Investigate VoIP traffic (ongoing) •  Integrate numbers in end-to-end planning and

engineering process –  Include DiffServ –  E.g.using Cariden MATE

Page 28: 013-Telkamp-How Full is Full

www.cariden.com 2009 © Cariden Technologies

•  Web: http://www.cariden.com

•  Phone: +1 650 564 9200

•  Fax: +1 650 564 9500

•  Address: 888 Villa Street, Suite 500 Mountain View, CA 94041

USA