InstaGENI and GENICloud: An Architecture for a Scalable Testbednv/2nd-rick.pdf · 2012-09-24 ·...
Transcript of InstaGENI and GENICloud: An Architecture for a Scalable Testbednv/2nd-rick.pdf · 2012-09-24 ·...
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
InstaGENI and GENICloud: An Architecture for a Scalable TestbedRick McGeer
HP Labs
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.3
3
The “Grand Challenge” Phase of Research
Transition from individual experimenter to institution or multi-institution team
Typically necessitated because problems go beyond the scale of an individual research group
Investigation of new phenomena required dramatic resources
Ex: particle physics 1928-1932
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.4
The era of institutional systems research
Computer Systems Research, 1980-2010Dominated by desktop-scale systems
1980-~1995: The desktop was the experimental systemEx: Original URL of Yahoo! was akebono.cs.stanford.edu/yahoo.html
Akebono was Jerry Yang’s Sun workstation!• Named for a prominent American Sumo wrestler ‒ Jerry had spent a term in Kyoto in 1992
Sometimes “servers” used to offload desktops
But rarely: “Server” ca. 1990 was a VAX 11, less powerful than a SUN or DEC workstation
~1995-~2005: Used servers primarily because desktop OS unsuitable for serious work
~2005-: Need clusters (and more) for any reasonable experiment
The Era of Institutional Systems Research has begun
4
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.5
Why?
Activity in 21st Century Systems Research focused on massively parallel, loosely-coupled, distributed computingContent Distribution NetworksKey-Value StoresCloud Resource Allocation and ManagementWide-Area Redundant StoresFault Recovery and Robust ProtocolsEnd-system multicastMulticast messaging
Key Problem: Emergent Behavior at ScaleCan’t anticipate phenomena at scale from small-scale behavior
Hence: Moderate-to-large scale testbeds:G-Lab, PlanetLab, OneLab,…
5
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.6
6
Experimental Physics Before 1928
Dominated by tabletop apparatus
• Ex: Rutherford’s discovery of the nucleus, 1910
• Done with tabletop apparatus, shown here
• Major complication: had to observe in darkened room
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.7
7
Example: Chadwick and the Neutron
Chadwick used high-energy particles from polonium to bombard nucleusNeutron only method to account for high-energy radiation from bombardmentKey apparatus “leftover plumbing” ‒ pipe used to focus radiation beamDate: February, 1932
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.8
8
Entry of Institutional Physics
Nuclear Fission, Cockcroft and Walton, April, 1932
Key: needed high voltages (est250,000+ volts) to split nucleus
Room(!) to hold apparatus major constraint
Needed major industrial help (Metropolitan-Vickers)
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.9
9
What a difference two months makes..
Chadwick, 2/32 Cockcroft/Walton, 4/32
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.10
1
Since Then…
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.11
1
Key Differences
Apparatus now takes many years to construct, costs billionsRequires multi-national consortia
Discoveries made by large teams of scientistsHundreds on the Top Quark team¥Thousands on the Higgs Team
Experiments last for 30+ yearsEx: ALICE at LHC, Babar at SLAC
Experimental devices measured by energies of collisions producedDriven by cost and complexity of apparatusCockcroft and Walton heralded era of institutional Grand Challenge physics
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.12
1
Why Computer Science is undergoing a phase change
Key: need to understand planetary-scale systems
Systems and services that run all over the planet
Critical, pervasive, robust
Emergent behavior at scale
Can only understand by experimenting near scale
Require millions of simultaneous users (or at least simulations of that scale)
Ex: Twitter crashed at 1m users (and needed to rebuild infrastructure)
Requires planetary-scale testbed and deployment platform
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.13
Problem: We can’t Build it
13
Industrial scale is rapidly outstripping academic/research scale
Ex:Yahoo! “Clique” is 20,000 servers
20 VMs/server
400,000 VMs
Far beyond any existing testbed
PlanetLab + OneLab: 1000+ nodes
Emulab: ~500 nodes
Glab…
Single Yahoo Clique 20x our best testbeds
So what do we do? Federation…
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.14
Why Federate?
14
Because we can each afford a piece…
Federate a large number of small clouds and testbeds
Agree on common APIs and form of authorization
Ad-hoc federation
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.15
InstaGENI and GENICloud/TransCloudTwo complementary elements of Federation Architecture
Inspiration: the Web
Can we do for Clouds what the web did for computation?
Make it easy, safe, cheap for people to build small Clouds
Make it easy, safe, cheap for people to run Cloud jobs at many different sites
GENICloud/TransCloud
Common API across Cloud Systems
Access Control without identity
Equivalent of http
InstaGENI
“Just works” out of the box small cloud
“Apple II”/reference webserver of Clouds
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.16
Key Assumption
Each facility implements Slice-Based Facility Interface
Standard, unified means of allocatingVirtual machines at each layer of the stack (“slivers”)
Networks/sets of virtual machines (“slices”)
Already supported by PlanetLab, ORCA, ProtoGENI
Now supported by Eucalyptus and OpenStack (our contribution)
16
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.17
What we need, what we don’t
What we needMethod of creating slices on clouds and distributed infrastructuresMethod of communicating between clouds and distributed infrastructuresMethod of interslice communication between clouds
What we don’tSingle sign-on!Single AUPSingle resource allocation policy or procedureUnified security policy
Principle of Minimal AgreementWhat is the minimum set of standards we can agree on to make this happen?
17
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.18
What do we need from the clouds
Building BlocksEucalyptus: Open-source clone of EC-2OpenStack: Open-source Widespread developer mindshare (easy to use, familiar)
What we want: Slice-Based Federation ArchitectureMeans of creating/allocating slicesAuthorization by Attribute-Based Access Control (ABAC)Delegation primitiveExplicit costs/resource allocation primitives• Need to be able to control costs for the developer
18
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.19
Why GENICloud?
Minimal set of facilities to permit seamless interconnection without trust
Motivation: the Web
Web sites mutually untrusting
Key facilities: DNS, HTTP. HTML
What are the equivalents for Clouds?
Our cut: Slices, ABAC, DNS conventions
<instancename>.<sitename>.<slicename>.<authorityname>.transcloud.net
19
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.20
Introduction – TransCloud
TransCloud = A Cloud Where Services Migrate, Anytime, Anywhere In a World Where Distance Is EliminatedJoint Project Between GENICloud, iGENI, G-LabGENICloud Provides Seamless Interoperation of Cloud Resources Across N-Sites, N-Administrative DomainsiGENI Optimizes Private Networks of Intelligent DevicesG-Lab contributes networking and advanced cloud resources
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.21
Seamless Computation Services Available Anytime, Anywhere“The Cloud” offers the prospect of ubiquitous information and services…BUT…Performance of Cloud services Highly Dependent On Location• Of End-User, Applications, Middle Processes, Network Topology• Of Cloud Data, Compute Processes, Storage, etc
Why?Performance of Legacy Protocols Highly Dependent on Latency Therefore:Want to compute anywhere convenientWant to be able to compute everywhere
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.22
What do we need to make this work?
Advanced Networking and CachingFirm guarantees on bandwidth and latency on a per-application basisApplication support at Layer 3 and Layer 2Means: Private Network where possible
Access to platforms wherever data livesBut data lives everywhere!No organization has Points of Presence (PoP)s everywhereNeed for an individual to be able to make arrangements with an cloud service provider, anywhere, efficiently, minimal overheadCommon form of identityCommon identity not requiredCommon AUP not required
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.23
What do we need to make this work?
Ability to instantiate and run a program anywhere
Common API at each level of the stack
IaaS/NaaS (VM/VN Creation)
PaaS (guaranteed OS/Progamming environment)
OaaS (Standard Query/Data Management API)
Easy, Standard Naming Scheme
I need to know the name of my VM’s, logins, store etc without asking
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.24
Solution – TransCloudIntroducing TransCloud PrototypeAn Early Instantiation of the ArchitectureA Distributed Environment That Enables Component and Interoperability EvaluationA Testbed On Which Early Experimental Research Can Be ConductedAn Environment That Can Be Used To Explain/Showcase New Innovative Architecture/Concepts Through Demonstrations
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.25
DEMO
What is the World’s Greenest City?
Answering this question through analysis of landsat data
Perfect job for distributed cloud
Currently running on HP Labs GENICloud
But we can distribute it anywhere…
2
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.26
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.27
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.28
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.29
TransCloud Today
Approx 40 nodes at 4 sites, 10 Gb/s connectivity
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.30©2010 HP Created on xx/xx/xxxxof 222
The Instageni rack
• Designed for GENI Meso-scale deploymentEight 2012 deployments, 24 2013 deployments
• ProtoGENI and FOAM as native Aggregate Managers and Control Frameworks
Boots to ProtoGENI instance with OpenFlow switch
• Designed for wide-area PlanetLab federationPlanetLab image provided with boot
InstaGENI PlanetLab Central stood up
• Designed for expandabilityApprox 30U free in rack
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.31
3©2010 HP Created on xx/xx/xxxxof 222
Understanding the instageni rack
•Two big things:IT’S JUST ProtoGENIIt’s this thing
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.32
3©2010 HP Created on xx/xx/xxxxof 222
It’s just protogeni•Key Design criterion behind the InstaGENI rack
Reliable, proven control framework
Familiar UI to GENI experimenters and administrators
Well-understood support and administrative model
•We’re not inventing new Control Frameworks, we’re deploying Control
Frameworks and Aggregate Managers you understand and know how to useNetwork of baby ProtoGENI’s, with SDN native to the racks
•Allocation of resources with familiar toolsFlack...
•Easy distribution and proven ability to run many images
•Support model well-understood
If something goes wrong, we know how to fix it...
•PlanetLab and OpenFlow integration out-of-the-box
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.33
3©2010 HP Created on xx/xx/xxxxof 222
The “Apple-II of Clouds”
•Key insight: the Apple II wasn’t the first mass market computer because it was
innovative, but because it was packaged
•Pre Apple-II, computers were all hobbyist kit
“Much Assembly, Configuration, Software Writing, Installation required”•But the Apple-II worked out of the box
Plug it in and turn it on
And that’s what made a revolution
•Same Idea
Plug in the InstaGENI Rack
Put in the wide-area network connection
Rob will install the software and bring it up over the net
You’re on the Mesoscale!
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.34
3©2010 HP Created on xx/xx/xxxxof 222
The InstaGENI rack
•Designed for easy deployability
Power: 220V L6-20 receptacle (or two 110V)
Network: 10/100/1000 Base-T
•Pre-wired from the factory
•On the Mesoscale
Network connections pre-allocated
VLANs and connectivity pre-wired before the rack arrives
•Designed for Remote Management
HP iLO on each node
•Designed for flexible networking
4 1G NICs/node, 20 1G NICs, v2 linecards OpenFlow switch
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.35
3©2010 HP Created on xx/xx/xxxxof 222
instageni rack hardware
•Control Node for ProtoGENI Boss, ProtoGENI users, FOAM Controller, Image storage…
HP ProLiant DL 360G7, quad-core, single-socket, dual NIC (1 Gb/sec), 12GB RAM, 4TB Disk (RAID), iLO
•Five Experiment Nodes
HP ProLiant DL 360G7, six-core, dual-socket, quad NIC (1 Gb/sec), 48GB RAM, 1TB Disk, iLO
•OpenFlow Switch
HP E 5406, 20 1 Gb/s, v2 linecards
Hybrid mode
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.36
3©2010 HP Created on xx/xx/xxxxof 222
Instageni planned deployment
•GENI funding8 sites in Year 1
24 sites in Year 2
All in USA
•Other RacksUS Public Sector except Federal Government: Special HP program
• Contact Michaela Mezo, HP SLED
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.37
3©2010 HP Created on xx/xx/xxxxof 222
Instageni year 1 sites
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.38
3©2010 HP Created on xx/xx/xxxxof 222
Instageni rack diagram
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.39
3©2010 HP Created on xx/xx/xxxxof 222
Instageni rack topology
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.40
4©2010 HP Created on xx/xx/xxxxof 222
instageni photo
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.41
4©2010 HP Created on xx/xx/xxxxof 222
Instageni software architecture
ProtoGENI (Hardware as a Service, Infrastructure as a Service)
FOAM (Networks as a Service)
ProtoGENI Image
ProtoGENI Image
PlanetLabImage
InstaGENI PLC
Layer 2 and 3 connectivity
GENI L2/L3 Slice
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.42
4©2010 HP Created on xx/xx/xxxxof 222
Control Infrastructure
Control / External switch
Data Plane Switch
Control Node: Xen Hypervisor
ProtoGENI “boss”
ProtoGENI “ops”
FOAM
FlowVisor
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.43
4©2010 HP Created on xx/xx/xxxxof 222
(rE)Provisioning Nodes
ProtoGENIShared
ProtoGENIExclusive
ProtoGENIExclusive
ProtoGENIExclusive
PlanetLabShared
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.44
4©2010 HP Created on xx/xx/xxxxof 222
GENI Integration
• Will ship with full support for GENI AM (likely v3)
Updates as GENI APIs evolve
• Support for Tom Lehman’s RSpec stitching extension
• Will have local FOAM and FlowVisor instances for OpenFlow integration
• Will start by affiliating with the ProtoGENI clearinghouse
• Switch affiliation to the GENI Clearinghouse once up
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.45
4©2010 HP Created on xx/xx/xxxxof 222
Software Management
• Frequent control software updates
Rarely affects running slivers
VM snapshots to roll back failed updates
Major software changes, rather than on a set schedule
• All updates done by InstaGENI personnel
(Sites can make local modifications, but this “voids the warranty”)Testing period on Utah rack first
• Updating disk images
New version of standard images distributed nightly
Voluntary updates for exclusive-use nodes and VM images
Scheduled updates for VM host images
Security updates will be handled differently on case-by-case basis
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.46
4©2010 HP Created on xx/xx/xxxxof 222
Operations and Management
• Providing GMOC with:
Visibility into current users and slices
Health and historical data
“Kill switch” credentials for emergency shutdown
• Local administrators get the same access
• Automatic verification of slices upon setup
Local admins get mail about hardware failures
• PlanetFlow-based mapping of address/packets to slices
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.47©2010 HP Created on xx/xx/xxxxof 222
InstaGENI Sites and Network: Y1University of Utah, Princeton University, GPO, Northwestern University, Clemson University, Georgia Tech , University of Kansas , New York UniversityUniversity of VictoriaGENInet
(GENI Backbone) SL/MREN
MAGPI
NOX
GPNUEN
SOX
NYSERNET
CANARIEBCNET
MAX
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.48©2010 HP Created on xx/xx/xxxxof 222
StarLight/MREN
University of IllinoisUrbana Champaign
I2 at StarLight
ESnet at StarLight
GENInet atStarLight/MREN Facility
MRENE1200lSwitch
OpticalSwitch
ICCN/I-WIRE
StarLightE1200Switch
NLRAt
StarLightOpticalSwitch
I2ION
ESnet
OpticalSwitch
Multiple EU, Asian, SouthAmerican Sites
NDDI
DYNES
InstaGENI RackWith OF SWiCAIR
GENI OF SWiCAIR
Multiple National Regional, State Net Connections
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.49
4©2010 HP Created on xx/xx/xxxxof 222
Selected Other Interconnections
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.50
Conclusions and Future Work
Described TransCloud, a set of proposed standards to permit computation anywhere
GENICloud, the first TransCloud federate
InstaGENI, a works-out-of-the-box miniature Cloud
For the Future:
GENICloud/TransCloud is an open set of standards, and open federation
Standards very much a work in progress
“Slice-Around-The-World” Demos throughout this year
Join us!
5