OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st...

28
OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August 26, 2003 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technologies Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD

Transcript of OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st...

Page 1: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

OptIPuter and ENDfusion-Eliminating Bandwidth as an Obstacle in

Data Intensive Sciences

21st NORDUnet Networking Conference

Reykjavik, Iceland

August 26, 2003

Dr. Larry Smarr

Director, California Institute for Telecommunications and Information Technologies

Harry E. Gruber Professor,

Dept. of Computer Science and Engineering

Jacobs School of Engineering, UCSD

Page 2: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

AbstractThe OptIPuter is a radical distributed visualization, teleimmersion, data mining, and computing architecture. The National Science Foundation recently awarded a six-campus research consortium a five-year large Information Technology Research grant to construct working prototypes of the OptIPuter on campus, regional, national, and international scales. The OptIPuter project is driven by applications leadership from two scientific communities, the US National NSF's EarthScope and the National Institutes of Health's Biomedical Imaging Research Network (BIRN), both of which are beginning to produce a flood of large 3D data objects (e.g., 3D brain images or a SAR terrain datasets) which are stored in distributed federated data repositories. The project is led by the California Institute for Telecommunications and Information Technology and by the Electronic Visualization Laboratory at the University of Illinois at Chicago. Essentially, the OptIPuter is a "virtual metacomputer" in which the individual "processors" are widely distributed Linux PC clusters; the "backplane" is provided by Internet Protocol (IP) delivered over multiple dedicated 1-10 Gbps optical wavelengths; and, the "mass storage systems" are large distributed scientific data repositories, fed by scientific instruments as OptIPuter peripheral devices, operated in near real-time. Collaboration, visualization, and teleimmersion tools are provided on tiled mono or stereo super-high definition screens directly connected to the OptIPuter to enable distributed analysis and decision making. A new proposal called "ENDfusion: End-to-End Data Fusion in a National-Scale Urban Emergency Collaboratory" adapts and extends some of the OptIPuter concepts to support collaboratories for high resolution geographic information systems and earthquake response. 

Page 3: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Where is Telecommunications Research Performed?A Historic Shift

Source: Bob Lucky, Telcordia/SAIC

U.S. Industry

Non-U.S. Universities

U.S. Universities

Percent Of The Papers Published IEEE Transactions On Communications

70%

85%

Page 4: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Cal-(IT)2– Research on the Future of the Internet

www.calit2.net

220 UC San Diego & UC Irvine FacultyWorking in Multidisciplinary Teams

With Students, Industry, and the Community

The California Institute for Telecommunications and Information Technology

Page 5: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Application Barrier One:Shared Internet Limits Speed of File Transfers

• NASA Earth Observation System– Over 100,000 Users

– Two Million Data Products Delivered per Year

• Measured Throughput for Data Transfers– 10-40 Mbps (May 2003) Mainly Over Abilene

– Interactive Megabyte Possible

Page 6: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Application Barrier Two:Gigabyte Science Data Objects

• Hundred Million Pixel 2-D Images– Microscopy or Telescopes

– Remote Sensing

• GigaZone 3-D Objects– Supercomputer Simulations

– Seismic or Medical Imaging

• Interactive Analysis and Visualization of Such Data Objects is Impossible Over Shared Internet

Page 7: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Very Large Biological Montage Images

• 2-Photon Laser Confocal Microscope– High Speed -

Ultrawide Field – On-line Capability

• Image Sizes Exceed 16x Highest Resolution Monitors– ~150 Million Pixels!

Source: David Lee, NCMIR, UCSD

IBM 9M Pixels

Page 8: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

TeraFLOP Computing Enables High Resolution of 3D Flow Details

1024x1024x1024-A Billion Zone Computation of Compressible

Turbulence

This Simulation Run on Los Alamos ASCI

SGI Origin Array

U. Minn.SGI Visual Supercomputer Renders Images

VorticityLCSE, Univ of Minnesota

Page 9: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Removing User Networking Barriers:Global Intellectual Convergence

• SERENATE is a Strategic Study into the Evolution of European Research and Education Networking Over the Next 5-10 Years

• Some Findings– On A Multi-year Timescale, Move Towards Optical Switching

– Evolution Towards Heterogeneous NREN Networks (and GÉANT), with General Internet Use (Many-to-many) via Classical Packet Switching and:

– Specialised High-Speed Traffic (Few-to-Few) via Optical Paths? OptIPuter Project

– Even End-to-End Paths?? ENDfusion Project

Source: David Williams, CERN

Page 10: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

From SuperComputers to SuperNetworks--Changing the Grid Design Point

• The TeraGrid is Optimized for Computing– 1024 IA-64 Nodes Linux Cluster– Assume 1 GigE per Node = 1 Terabit/s I/O– Grid Optical Connection 4x10Gig Lambdas = 40 Gigabit/s– Optical Connections are Only 4% Bisection Bandwidth

• The OptIPuter is Optimized for Bandwidth– 32 IA-64 Node Linux Cluster– Assume 1 GigE per Processor = 32 gigabit/s I/O– Grid Optical Connection 4x10GigE = 40 Gigabit/s– Optical Connections are Over 100% Bisection Bandwidth

Page 11: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

OptIPuter LambdaGrid Global Laboratory

• NSF Large Information Technology Research Grant– $13.5 Million Over Five Years

• UCSD and UIC Lead Campuses—Larry Smarr PI– Co-PIs: Tom DeFanti, Jason Leigh, Phil Papadopoulos, Mark Ellisman– Project Manager, Maxine Brown

• Partnering Campuses– USC, UCI, SDSU, NU, Texas A&M, Univ. Amsterdam

• Industrial Partners: – IBM, Sun, Telcordia/SAIC, Chiaro Networks, Calient, Glimmerglass

• Driven by Large NSF and NIH Applications

www.optiputer.net

Lake Tahoe

BrainTissue

NSFEarthScope

NIHBiomedicalInformaticsResearchNetwork

Page 12: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Science Drivers for a Radical New Net-Centric Architecture—The OptIPuter

• Data Intensive Neuro & Earth Sciences– Each Data Object is 3D and Gigabytes– Data in Distributed Federated Repository– Want to Interactively Analyze and Visualize– Need End-to-End Deterministic Networks

• OptIPuter Science Requirements – Computing PC Clusters– Communications Dedicated Lambdas– Data Large Lambda Attached Storage – Visualization Viz Clusters– Global Collaboration Multi-Scale Latencies

Goal: Punch a Hole Through the Internet Between

Researcher’s Lab and Remote Data!

Page 13: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

What is the Best Application Usageof Routed vs. Switched Lambdas?

• OptIPuter Evaluating Both– Routers

– Chiaro– Juniper– Cisco– Force10

– Optical Switches– Calient– Glimmerglass

• UCSD Focusing on Routing Initially• UIC Focusing on Switching initially• Next Year Merge into Mixed Optical Fabric

Page 14: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

½ Mile

SIO

SDSC

CRCA

Phys. Sci -Keck

SOM

JSOE Preuss

6th College

SDSCAnnex

Node M

Earth Sciences

SDSC

Medicine

Engineering High School

To CENIC

Collocation

Source: Phil Papadopoulos, SDSC; Greg Hidley, Cal-(IT)2

Funded by NSF OptIPuter Grant

and UCSD

ChiaroEstara

JuniperT320

The UCSD OptIPuter DeploymentOptIPuter

Campus-Scale Experimental Network

Page 15: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

OptIPuterMetro-Scale Experimental Network

• Linked UCSD and SDSU– Dedication March 4, 2002

Linking Control Rooms

Cox, Panoram,SAIC, SGI, IBM,

TeraBurst NetworksSD Telecom Council

UCSD

SDSU44 Miles

of Cox Fiber

Page 16: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Proposed OptIPuterState-Scale Experimental Network

UCSD SDSU

USCUCI

NASAAmes?

Source: CENIC

Page 17: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Proposed OptIPuter Dedicated Optical Fiber National-Scale Experimental Network

Source: John Silvester, Dave Reese, Tom West-CENIC

“National Lambda Rail”

Chicago OptIPuter

StarlightNU, UIC

SoCalOptIPuter

USC, UCI UCSD, SDSU

Page 18: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

OptIPuter Uses TransLight Lambdas to Connect Current and Potential International-Scale Partners

Source: Tom DeFanti,

UIC

TheOptIPuter

WasBorn

Global!

StarlightNU, UIC

Univ. of AmsterdamNetherLight

CurrentOptIPuter

Page 19: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

OptIPuter Open Source LambdaGrid Software for Distributed Virtual Computers

Source: Andrew Chien, UCSDOptIPuter Software Architect

Page 20: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

OptIPuter Protocol Experimentson TeraGrid Lambdas

• SDSC To NCSA—2x10Gbps Lambdas– 30 Itanium Cluster Nodes at Each End– Streamed 2 Gigabytes of Data

– 100 Times, Each At A Rate Of 1 Gb• Quanta’s Reliable Blast UDP Protocol (RBUDP)

– Quanta Is An Extensive Toolkit For Data Sharing www.evl.uic.edu/cavern/quanta

– Throughput of 18.6Gbps / 20Gbs • Original User Transfer Rate

– TCP/IP 10 Mbps Over 10 Gb Lambda– Paul Woodward, Fluid Dynamics Simulation Data– 1000x Improvement

Source: Jason Leigh, UIC EVLwww.evl.uic.edu/cavern/rg/20030817_he

Page 21: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Invisible Nodes, Elements,

Hierarchical,Centrally Controlled,

Fairly Static

Traditional Provider Services:Invisible, Static Resources,

Centralized Management

OptIPuter: Distributed Device, Dynamic Services,

Visible & Accessible Resources, Integrated As Required By Apps

Limited Functionality,Flexibility

Unlimited Functionality,Flexibility

Source: Joe Mambretti, Oliver Yu, George Clapp

LambdaGrid Control Plane Paradigm Shift

Page 22: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Extending to IPv6Amsterdam to Japan Using Native IPv6 Network

JuniperM40

SDSC

6tap/StarLight

TransPACAPAN OC3

R

Osaka University

UHVEM(Osaka, Japan)

oc3

TokyoXP

ATMSW

ATMSW

SDSCV6 services

Gb Ether

ESnet

Native IPv6oc12 peer

SURFnetR

IGRID 2002(Amsterdam, Sept 2002)

SURFnet

JuniperT640

Supercomputing 2002Baltimore, Nov 2002

VBNSWIDE networkIPv6 via JGN

Gb Ether

oc192

Abilene

NCMIR(San Diego)

Source: UCSD’s Tom Hutton, SDSC & David Lee, NCMIR

Last WeekPartially On Lambdas!

Page 23: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

17x10G

b

17x10Gb

17x10Gb

17x1

0Gb

17x10Gb

17x10Gb17x10Gb

CalOffice of

Emergency Services

UCI

SDSU

San DiegoDowntown

USGeological

Survey

ACCESSDC

UIC UC/ANL

NCSAFacility

UCSDJacobs& SIO

StarLight@ NU

ENDfusion: End-to-End Networks for Data Fusion in a National-Scale Urban Emergency Collaboratory

Source: Maxine Brown, EVL, UIC

Width Of The Rainbows = Amount of Bandwidth Managed As Lambdas

Blue Lines Are Conventional Networks

Page 24: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Real-Time Earthquake AlertsVery Important in Iceland!

http://hraun.vedur.is

Page 25: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Planning for Optically Linking Crisis Management Control Rooms in California

California Office of Emergency Services, Sacramento, CA

Page 26: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

Crisis Management Will RequireUltra-High Resolution Remote Imaging

• US Geological Survey EROS Center Data:– 133 Urban Areas:

– One Foot Resolution – 100,000 x 100,000 Pixels for 20 sq.mile Urban Area– 10 Billion Pixels/Image!

• JuxtaView (UIC EVL) for PerspecTile LCD Wall– Digital Montage Viewer

– 6000x3000 Pixel Resolution

• Display Is Powered By – 16 PCs with Graphics Cards

– 2 Gigabit Networking per PC

Source: Jason Leigh, EVL, UIC; USGS EROS

Page 27: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

ENDfusion Virtual 3D High Resolution CampusWith High Resolution Stereo Imagery

SDSUCampus Center4 cm

ResolutionInfrared

Source: Laurie Cooper, SDSU Eric Frost, Dawn Wise, SDSU-OptIPuter

Each Square Meter Will Have a Unique IPv6 Internet Address

Four IPv6 Addresses

Page 28: OptIPuter and ENDfusion- Eliminating Bandwidth as an Obstacle in Data Intensive Sciences 21 st NORDUnet Networking Conference Reykjavik, Iceland August.

A High Definition Access Grid as Imagined In 2007 In A HiPerCollab

Source: Jason Leigh, EVL, UIC

Augmented Reality

SuperHD StreamingVideo

100-MegapixelTiled Display

ENDfusion Project