Evaluating Potential Routing Diversity for Internet Failure Recovery
description
Transcript of Evaluating Potential Routing Diversity for Internet Failure Recovery
![Page 1: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/1.jpg)
1/27
Evaluating Potential Routing Diversity for Internet Failure
Recovery*Chengchen Hu, +Kai Chen, +Yan Chen, *Bin
Liu*Tsinghua University,
+Northwestern University
![Page 2: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/2.jpg)
2/27
Internet FailuresFailure is part of everyday life in IP networks
e.g., 675,000 excavation accidents in 2004 [Common Ground Alliance]
Network cable cuts every few days …Real-world emergencies or disasters can lead
to substantial Internet disruptionEarthquakesStormsTerrorist incident: 9.11 event…
![Page 3: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/3.jpg)
3/27
Example: Taiwan earthquake incident
Large earthquakes hit south of Taiwan on 26 December 2006
Only two of nine cross-sea cables not affected
There were still abundant physical level connectivity there, but it took too long for ISPs to find them and use them.
3
figures cited from "Aftershocks from the Taiwan Earthquakes: Shaking up Internet transit in Asia, NANOG42"
![Page 4: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/4.jpg)
4/27
How reliable the Internet is?Internet is not as reliable as people expected!
[Wu, CoNEXT’07]32% ASes are vulnerable to a single critical
customer-provider link cut93.7% Tier-1 ISP’s single-homed customers are lost
from the peered ISP due to Tier-1 depeering
Our question: can we find more resources to increase the Internet reliability especially when Internet emergency happens?
![Page 5: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/5.jpg)
5/27
RoadmapBackgroundWhere are the potential resources?How much potential resources are there?How to use the potential resources?
![Page 6: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/6.jpg)
6/27
Basic IdeaTwo places where we can find more routing
diversities:Internet eXchange Points (IXPs)
Co-location where multiple ASes exchange their trafficParticipant ASes in an IXP may not be connected via BGP
Internet valley-free routing policyAS relationships: customer-provider, peering, siblingPeering relaxation (PR): allow one AS to carry traffic from
the other to its providerMentioned in [Wu, CoNEXT’07], but no evaluation
Our main focus: How much can we gain from these two potential
resources, i.e., IXP and PR?
![Page 7: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/7.jpg)
7/27
RoadmapBackgroundWhere are the potential resources?How much help could provide?How to use the potential resources?
![Page 8: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/8.jpg)
8/27
Dataset for EvaluationMost complete AS topology graph
BGP data Route Views, RIPE/RIS, Abilene, CERNET BGP View
P2P tracerouteTraceroute data from 992, 000 IPs in over 3, 700 ASes
In total, 120K AS links with AS relationshipshttp://aqualab.cs.northwestern.edu/projects/
SidewalkEnds.html [Chen et al, CoNEXT’09]IXP data
PCH + Peeringdb + Euro-IX (~200 IXPs)3468 participant ASes
![Page 9: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/9.jpg)
9/27
Failure ModelsTier-1 depeering
Real example: Cogent and Level3 depeeringTier-1 provider-customer link teardown
Reported in NANOG forumMixed types of link breakdown
9.11 event, Taiwan earthquakes, 2003 Northeast blackout
![Page 10: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/10.jpg)
10/27
Evaluation MetricsRecovery Ratio
# of recovered <src-dst> AS pairs versus total # of affected <src-dst> AS pairs
Path Diversity# of increased link-disjoint AS paths between
affected <src-dst> AS pairsShifted Path
# of link-disjoint AS paths shifted onto a normal link after we use IXP or PR resources
More results please check our report "Evaluating Potential Routing Diversity for Internet Failure Recovery" @ http://s-router.cs.tsinghua.edu.cn/pub/IER_report.pdf
![Page 11: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/11.jpg)
11/27
Results: Tier-1 Depeering36 experiments for 9 Tier-1 ASesRecovery ratio: most of the lost AS pairs can
be recovered
![Page 12: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/12.jpg)
12/27
Results: Tier-1 DepeeringPath diversity: multiple AS paths between lost
AS pairs
![Page 13: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/13.jpg)
13/27
Results: Tier-1 DepeeringShifted path
On average, 3.75 ~ 17.2 for all 36 experimentsModerate traffic load shifted onto the unaffected
links
![Page 14: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/14.jpg)
14/27
RoadmapBackgroundWhere are the potential resources?How much help could provide?How to use the potential resources?
![Page 15: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/15.jpg)
15/27
Economic modelB pays to A for recovery
Business modelRisk alliance (like airlines): price is determined
beforehandpay on bandwidth & duration or bits (95
percentile)
A Bpeer
A BP-CA BP-C
A BIXP
![Page 16: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/16.jpg)
16/27
Communication channel Search for peers
Have direct connections to peers
Search for co-located ASes in the same IXPASes are connected by switches in modern IXPsMessages are broadcasted via switchesMessage confidentiality through public key crypto
![Page 17: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/17.jpg)
17/27
Automatic communications: query phase
victim AS
potential helper A
potential helper D
potential helper B
potential helper C
who can connect to specific destination
ASes?
query
![Page 18: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/18.jpg)
18/27
Automatic communications: Check availability
victim AS
potential helper A
potential helper D
potential helper B
potential helper C
1. Check connectivity (traceroute)
2. Check available bandwidth (IGI/Yaz/pathload/spruce)
![Page 19: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/19.jpg)
19/27
Automatic communications: reply phase
victim AS
potential helper A
potential helper D
potential helper B
potential helper C
I can provide X bandwidth to it
reply
reply
I can provide Y bandwidth to it
Message confidentiality with public key crypto
![Page 20: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/20.jpg)
20/27
Automatic communications: ACK phase
victim AS
potential helper A
potential helper D
potential helper B
potential helper C
I would like buy Z (<=Y)
ACK
![Page 21: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/21.jpg)
21/27
Automatic communications: new BGP session
victim AS
potential helper A
potential helper D
potential helper B
potential helper C
Set up BGP sessionCan be withdrawn later
![Page 22: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/22.jpg)
22/27
Optimal selection of helper ISPsFrom a single victim ISP perspective
Buy transit from a minimal number of ASesRecover all the (prioritized) traffic Least cost
![Page 23: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/23.jpg)
23/27
Selection heuristic Lost connectivity to {Di}, with bandwidth demand
{Bi}
is how much bandwidth AS j could provide to Di;ijx
![Page 24: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/24.jpg)
24/27
Selection heuristic Lost connectivity to {Di}, with bandwidth demand
{Bi}
Score each (helper) AS j with Select the AS with largest score (select the one with lowest price if same score)
min( / ,1)ij iix B
3 2.3
5 2.1
![Page 25: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/25.jpg)
25/27
Selection heuristic Update Lost connectivity to {Di}, with bandwidth demand {Bi}
updated
![Page 26: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/26.jpg)
26/27
Selection heuristic rescore and selectLost connectivity to {Di},
with bandwidth demand {Bi}
1 0.3
0.10
![Page 27: Evaluating Potential Routing Diversity for Internet Failure Recovery](https://reader036.fdocuments.us/reader036/viewer/2022062521/56816733550346895ddbe194/html5/thumbnails/27.jpg)
27/27
SummaryPoint out a new venue for Internet failure
recovery.Evaluate the potential routing diversity via IXP
and PR with the most complete AS topology graph.
40%-80% of affected <Src, Dst> AS pairs can be recovered via IXP and PR with multiple paths and moderate shifted paths.
Possible and practical mechanisms to utilize potential routing diversity.
Look forward to feedback and collaborations from IXP/ISPs!