The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang,...

24
The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003

description

Aug 12, 2003CoDeeN Overview - IRIS/PlanetLab3 What Does It Do? An Academic Content Distribution Network Redirects/caches HTTP requests Based on our OSDI 2002 paper on CDN performance An Open Proxy Network Probably the largest in existence

Transcript of The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang,...

Page 1: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

The CoDeeN Content Distribution Network

Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry PetersonPrinceton UniversityAugust 12, 2003

Page 2: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 2

Content Distribution Networks

Replicates Web content broadlyRedirects clients to “best” copy

Load, locality, proximityOffloads work from origin serversMultiplexes load spikes

Reduces overprovisioningEx: Akamai, Mirror Image, Speedera

Page 3: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 3

What Does It Do?An Academic Content Distribution Network

Redirects/caches HTTP requestsBased on our OSDI 2002 paper on CDN performance

An Open Proxy NetworkProbably the largest in existence

Page 4: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 4

Who Is The Target Audience?Now

Users wanting better performancePeople seeking “anonymity”

NextContent providers seeking load sharing

LaterGeneral support for absorbing flash crowdsAvoid the “Slashdot Effect”

Page 5: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 5

How Does It Work?Server surrogates (proxies) on most North American sites

Originally everywhere, but we cut backClients specify proxy to use

Cache hits served locallyCache misses forwarded to CoDeeN nodes• Maybe forwarded to origin servers

Page 6: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 6

Request Forwarding

Page 7: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 7

When Will It Be Ready?January – development started

Reliability & stability major concernsMarch – stable enough for daily useApril – security problems begin

Shut down for one monthJune – Restarted “beta”Expecting “production” soon

Page 8: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 8

Decisions – Good & BadUse commercial proxy with API [USITS 2003]

Good – mostly layer 7 concernsBad – limits deployment size (donated licenses)

Deployment on PlanetLabGood – otherwise impossible“Bad” – vulnerable to other experiments

Allow open accessGood – generates real trafficBad – some traffic just plain mean

Page 9: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 9

Lots of Malicious TrafficSpammers

SMTP tunnels, POST forms, IRC channelsBandwidth hogs

Google crawls, steganographers, X-PacificHackers & Spreaders

Yahoo dictionary attacks, IIS vuln testsContent thieves

E-journals/databases, local content

Restrict ports & HTTP methods

Multi-scale req & bw accounting

Signature database & Robot test

Determine location & privilege

Page 10: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 10

Protecting Privilege

Page 11: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 11

Attempted SMTP Tunnels/Day

Page 12: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 12

By The Numbers…Restarted in late May

In continuous operationStats from first 8 weeks

Over 59,000 unique IPs as clientsOver 24 million requests servicedValid rates up to 15K reqs/hourRoughly 1 million reqs/day aggregate

Page 13: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 13

More Production InfoAbout 2000 lines of code

About ¼ is actual decision logicUptimes limited by upgrades

Generally 1-2 times/weekDowntimes of 20 seconds/node

Currently on ~40 nodes

Page 14: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 14

Daily Requests (Serviced)

Page 15: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 15

Welcome

Page 16: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 16

Avoiding

sorted by # avoiding

Page 17: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 17

Load

sorted by # load average

Page 18: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 18

Total

sorted by # total req rate

Page 19: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 19

Users

sorted by # users

Page 20: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 20

The Troubles We’ve CausedRoutinely trigger open proxy alerts

Educating sysadmins, othersResource checks generate noise

Got onto planetlab-supportReally good honeypots

6000 SMTP flows/minute at CMUSpammers do ~1M HTTP ops/day

Page 21: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 21

What We’ve LearnedParallel ssh is a must

General commands/queriesBasis for parallel scpUsed to detect out-of-date files

Monitoring is a mustToo hard to see anomalies in 40+ nodesAlmost looks like a demo

Be careful accepting outside requests

Page 22: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 22

What We Still NeedBetter layer 4 tools

Hard to tell why things dieBuilding complete heartbeats isn’t fun

Better isolation on most resourcesCPU/OS: Java, VServers, ???Others: FD exhaustion, disk space

Page 23: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

Aug 12, 2003 CoDeeN Overview - IRIS/PlanetLab 23

What We Wouldn’t Mind…Customizable DNS mapping

Map project.planet-lab.org to some nodeProjects could provide feedback• Node availability, utility, etc

Most IP geolocation seems locked up

Page 24: The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.

24CoDeeN Overview - IRIS/PlanetLabAug 12, 2003

More Infohttp://codeen.cs.princeton.edu