Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

23
Toward Better Toward Better Geolocation: Geolocation: Improving Internet Distance Improving Internet Distance Estimates Using Route Estimates Using Route Traces Traces Chandrika Jayant Chandrika Jayant Ethan Katz-Bassett Ethan Katz-Bassett

description

Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces. Chandrika Jayant Ethan Katz-Bassett. Outline. Motivations for geolocation Constraint-Based Geolocation Problems with CBG Our Approach PlanetLab Experiments Conclusion/ Future Work. Geolocation?. - PowerPoint PPT Presentation

Transcript of Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Page 1: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Toward Better Toward Better Geolocation:Geolocation:

Improving Internet Distance Improving Internet Distance Estimates Using Route TracesEstimates Using Route Traces

Chandrika JayantChandrika Jayant

Ethan Katz-BassettEthan Katz-Bassett

Page 2: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

OutlineOutline

Motivations for geolocationMotivations for geolocation Constraint-Based GeolocationConstraint-Based Geolocation Problems with CBGProblems with CBG Our ApproachOur Approach PlanetLab ExperimentsPlanetLab Experiments Conclusion/ Future WorkConclusion/ Future Work

Page 3: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Geolocation?Geolocation?

Infer the geographic location of an Infer the geographic location of an Internet hostInternet host

Many applications would benefit from Many applications would benefit from this informationthis information

Advertising, EBS, location sensitive Advertising, EBS, location sensitive infoinfo

Different levels of granularityDifferent levels of granularity

Page 4: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Constraint-Based Constraint-Based GeolocationGeolocation

Landmarks: Set of hosts with known Landmarks: Set of hosts with known locationslocations

Each landmark estimates distance to Each landmark estimates distance to targettarget

Set performs multilateration using Set performs multilateration using these distancesthese distances

-Gueye, Ziviani, Crovella, Fdida (2004)-Gueye, Ziviani, Crovella, Fdida (2004)

Page 5: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

CBG MultilaterationCBG Multilateration

Page 6: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

CBG MultilaterationCBG Multilateration

Page 7: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

CBG MultilaterationCBG Multilateration

Page 8: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

CBG MultilaterationCBG Multilateration

Page 9: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

CBG Bestline Distance CBG Bestline Distance EstimatesEstimates

Page 10: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

CBG Breakdowns (CBGB’s)CBG Breakdowns (CBGB’s)

Estimates are not tight and vary Estimates are not tight and vary widely widely large confidence regions, large confidence regions, need many probes to get a few tight need many probes to get a few tight onesones

No better at estimating training set vs. No better at estimating training set vs. other hosts (in general)other hosts (in general)

More data trained on, worse accuracy More data trained on, worse accuracy (in general)(in general)

Still underestimate some distancesStill underestimate some distances

Page 11: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Our ApproachOur Approach

Intuition: Targets that have similar Intuition: Targets that have similar routes have similar delay routes have similar delay distance distance conversions conversions

Use route info to achieve more accurate Use route info to achieve more accurate estimatesestimates

Want to fit into CBG frameworkWant to fit into CBG framework 2 main techniques, still using bestline fit:2 main techniques, still using bestline fit:

Path-BasedPath-Based Router-BasedRouter-Based

Page 12: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Path-Based EstimationPath-Based Estimation

• Landmark learns routes to its training setLandmark learns routes to its training set• Traceroute target up to TTL = xTraceroute target up to TTL = x• Find longest partial path shared with a subset of Find longest partial path shared with a subset of training hoststraining hosts• Calculate bestline using only this subsetCalculate bestline using only this subset

Page 13: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Router-Based EstimationRouter-Based Estimation

Landmark learns routes to its training Landmark learns routes to its training setset

Send packet to target with TTL = xSend packet to target with TTL = x Find subset of training hosts with paths Find subset of training hosts with paths

through this router through this router Calculate bestline using only this Calculate bestline using only this

subsetsubset In practice, use xIn practice, use x11,, xx22,…,,…, xxn n

Page 14: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

PlanetLab ExperimentsPlanetLab Experiments

110 PlanetLab hosts in North America110 PlanetLab hosts in North America Lat/long available for eachLat/long available for each Used Scriptroute to gather delay and Used Scriptroute to gather delay and

routes between hostsroutes between hosts 26 landmarks (after munging)26 landmarks (after munging) Path-Based used TTL up to 12Path-Based used TTL up to 12 Router-Based used TTLs (12,9,6)Router-Based used TTLs (12,9,6)

Page 15: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Map of Landmark Hosts (26)Map of Landmark Hosts (26)

Page 16: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Path Length vs. AccuracyPath Length vs. Accuracy

Page 17: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Router TTL vs. Accuracy Router TTL vs. Accuracy

Page 18: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Overall Accuracy of Overall Accuracy of EstimationsEstimations

Page 19: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Estimations to Collocated Estimations to Collocated TargetsTargets

Page 20: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Effects of Training Set SizeEffects of Training Set Size

Page 21: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

ConclusionsConclusions

Geolocation has powerful potentialGeolocation has powerful potential CBG is interesting, but needs CBG is interesting, but needs

improvementimprovement Route information improves accuracy Route information improves accuracy

of distance estimatesof distance estimates With more accurate estimations, likely With more accurate estimations, likely

need fewer landmarks to locate need fewer landmarks to locate targetstargets

Page 22: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Future WorkFuture Work

Modify CBG to better handle Modify CBG to better handle underestimates/ use other line fitsunderestimates/ use other line fits

Test on larger data setsTest on larger data sets Numerical analysisNumerical analysis Extend to hosts w/ slower connectionsExtend to hosts w/ slower connections Use delay distribution to “normalize” Use delay distribution to “normalize”

measurementsmeasurements

Page 23: Toward Better Geolocation: Improving Internet Distance Estimates Using Route Traces

Questions?Questions?