Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake...

74
https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011

Transcript of Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake...

Page 1: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

What FutureGrid Can Do for You?

TeraGrid’11 BOF Session

1

Salt Lake City, UtahJuly 20th 2011

Page 2: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Agenda

• FutureGrid from User’s Perspective, Geoffrey Fox • How to Access FutureGrid, Gregor von Laszewski• HPC on FutureGrid, Warren Smith• Cloud Computing on FutureGrid, Kate Keahey • Training, Education and Outreach, Renato Figueiredo

• Experimental Framework Support, Warren Smith• Open discussion

2

Page 3: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

FutureGrid BOFOverview

TG 11Salt Lake City July 18 2011

Geoffrey [email protected]

http://www.infomall.org https://portal.futuregrid.orgDirector, Digital Science Center, Pervasive Technology Institute

Associate Dean for Research and Graduate Studies,  School of Informatics and Computing

Indiana University Bloomington

Page 4: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

FutureGrid key Concepts I• FutureGrid supports Computer Science and Computational Science

research in cloud, grid and parallel computing (HPC)• The FutureGrid testbed provides to its users:

– An interactive development and testing platform for middleware and application users looking at interoperability, functionality, performance or evaluation with or without virtualization

– A rich education and teaching platform for advanced cyberinfrastructure (computer science) classes

• FutureGrid has a complementary focus to both the Open Science Grid and the other parts of XSEDE.

• Note that significant current use in Education, Computer Science Systems and Biology/Bioinformatics

Page 5: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

FutureGrid key Concepts II• Rather than loading images onto VM’s, FutureGrid supports

Cloud, Grid and Parallel computing environments by dynamically provisioning software as needed onto “bare-metal” using Moab/xCAT – Image library for MPI, OpenMP, MapReduce (Hadoop, Dryad, Twister),

gLite, Unicore, Xen, Genesis II, ScaleMP (distributed Shared Memory), Nimbus, Eucalyptus, OpenNebula, OpenStack, KVM, Windows …..

• Growth comes from users depositing novel images in library• FutureGrid has ~4300 (will grow to ~5000) distributed cores

with a dedicated network and a Spirent XGEM network fault and delay generator

Image1Image1 Image2Image2 ImageNImageN…

LoadChoose Run

Page 6: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

FutureGrid Partners• Indiana University (Architecture, core software, Support)• Purdue University (HTC Hardware)• San Diego Supercomputer Center at University of California San Diego

(INCA, Monitoring)• University of Chicago/Argonne National Labs (Nimbus)• University of Florida (ViNE, Education and Outreach)• University of Southern California Information Sciences (Pegasus to manage

experiments) • University of Tennessee Knoxville (Benchmarking)• University of Texas at Austin/Texas Advanced Computing Center (Portal)• University of Virginia (OGF, Advisory Board and allocation)• Center for Information Services and GWT-TUD from Technische Universtität

Dresden. (VAMPIR)• Red institutions have FutureGrid hardware

Page 7: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

FutureGrid: a Grid/Cloud/HPC Testbed

PrivatePublic FG Network

NID: Network Impairment Device

Page 8: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Compute HardwareName System type # CPUs #

Cores TFLOPS Total RAM (GB)

Secondary Storage

(TB)Site Status

india IBM iDataPlex 256 1024 11 3072 339 + 16 IU Operational

alamo Dell PowerEdge 192 768 8 1152 30 TACC Operational

hotel IBM iDataPlex 168 672 7 2016 120 UC Operational

sierra IBM iDataPlex 168 672 7 2688 96 SDSC Operational

xray Cray XT5m 168 672 6 1344 339 IU Operational

foxtrot IBM iDataPlex 64 256 2 768 24 UF Operational

Bravo* Large Disk & memory 32 128 1.5

3072 (192GB per

node)

144 (12 TB per Server) IU Early user

Aug. 1 general

Delta*Large Disk & memory With Tesla GPU’s

1616 GPU’s 96 ? 3

1536 (192GB per

node)

96 (12 TB per Server) IU ~Sept 15

Total 1064 4288 45 16TB

* Teasers for next machine

Page 9: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Storage HardwareSystem Type Capacity (TB) File System Site Status

DDN 9550(Data Capacitor)

339 shared with IU + 16 TB dedicated

Lustre IU Existing System

DDN 6620 120 GPFS UC New System

SunFire x4170 96 ZFS SDSC New System

Dell MD3000 30 NFS TACC New System

IBM 24 NFS UF New System

Page 10: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Network Impairment Device

• Spirent XGEM Network Impairments Simulator for jitter, errors, delay, etc

• Full Bidirectional 10G w/64 byte packets• up to 15 seconds introduced delay (in 16ns

increments)• 0-100% introduced packet loss in .0001%

increments• Packet manipulation in first 2000 bytes• up to 16k frame size• TCL for scripting, HTML for manual configuration

Page 11: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

FutureGrid: Inca Monitoring

Page 12: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

5 Use Types for FutureGrid• 122 approved projects July 17 2011

– https://portal.futuregrid.org/projects• Training Education and Outreach (13)

– Semester and short events; promising for small universities• Interoperability test-beds (4)

– Grids and Clouds; Standards; from Open Grid Forum OGF• Domain Science applications (42)

– Life science highlighted (21)• Computer science (50)

– Largest current category• Computer Systems Evaluation (35)

– TeraGrid (TIS, TAS, XSEDE), OSG, EGI• Clouds are meant to need less support than other models; FutureGrid

needs more user support …….

12

Page 13: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Create a Portal Account and apply for a Project

13

Page 14: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org 14

https://portal.futuregrid.org/projects

Page 15: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Selected Current Education projects• System Programming and Cloud Computing, Fresno

State, Teaches system programming and cloud computing in different computing environments

• REU: Cloud Computing, Arkansas, Offers hands-on experience with FutureGrid tools and technologies

• Workshop: A Cloud View on Computing, Indiana School of Informatics and Computing (SOIC), Boot camp on MapReduce for faculty and graduate students from underserved ADMI institutions

• Topics on Systems: Distributed Systems, Indiana SOIC, Covers core computer science distributed system curricula (for 60 students)

15

Page 16: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Selected Current Interoperability Projects

• SAGA, Louisiana State, Explores use of FutureGrid components for extensive portability and interoperability testing of Simple API for Grid Applications, and scale-up and scale-out experiments

• Unicore,Genesis, gLite, Virginia, OGF standard end points

16

Page 17: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Selected Current Bio Application Projects

• Metagenomics Clustering, North Texas, Analyzes metagenomic data from samples collected from patients

• Genome Assembly, Indiana SOIC, De novo assembly of genomes and metagenomes from next generation sequencing data

17

Page 18: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Selected Current Non-Bio Application Projects

• Physics: Higgs boson, Virginia, Matrix Element calculations representing production and decay mechanisms for Higgs and background processes

• Business Intelligence on MapReduce, Cal State - L.A., Market basket and customer analysis designed to execute MapReduce on Hadoop platform

18

Page 19: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Selected Current Computer Science Projects

• Data Transfer Throughput, Buffalo, End-to-end optimization of data transfer throughput over wide-area, high-speed networks

• Elastic Computing, Colorado, Tools and technologies to create elastic computing environments using IaaS clouds that adjust to changes in demand automatically and transparently

• The VIEW Project, Wayne State, Investigates Nimbus and Eucalyptus as cloud platforms for elastic workflow scheduling and resource provisioning

19

Page 20: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Selected Current Technology Projects

• ScaleMP for Gene Assembly, Indiana Pervasive Technology Institute (PTI) and Biology, Investigates distributed shared memory over 16 nodes for SOAPdenovo assembly of Daphnia genomes

• XSEDE, Virginia, Uses FutureGrid resources as a testbed for XSEDE software development

• Globus Online, Indiana PTI, Chicago, Investigates the feasibility of providing DemoGrid and its Globus services on FutureGrid IaaS clouds

20

Page 21: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org 21

Typical FutureGrid Performance StudyLinux, Linux on VM, Windows, Azure, Amazon Bioinformatics

Page 22: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

ADMI Cloudy View on Computing Workshop June 2011

• Jerome took two courses from IU in this area Fall 2010 and Spring 2011 on FutureGrid

• ADMI: Association of Computer and Information Science/Engineering Departments at Minority Institutions

• Offered on FutureGrid • 10 Faculty and Graduate Students from ADMI Universities• The workshop provided information from cloud programming models to case

studies of scientific applications on FutureGrid. • At the conclusion of the workshop, the participants indicated that they would

incorporate cloud computing into their courses and/or research.

Concept and Delivery byJerome Mitchell: Undergraduate ECSU, Masters Kansas, PhD Indiana

Page 23: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

ADMI Cloudy View on Computing Workshop Participants

Page 24: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

FutureGrid Viral Growth Model• Users apply for a project• Users improve/develop some software in project• This project leads to new images which are placed in

FutureGrid repository• Project report and other web pages document use

of new images• Images are used by other users• And so on ad infinitum ………• Please bring your nifty software up on FutureGrid!!

24

Page 25: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Questions?

25

Page 26: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Elementary FG Access Services

Gregor von LaszewskiIndiana University

Page 27: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Getting on FG is simple

Page 28: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

FG Portal• Coordination of Projects and users

– Project management• Membership• Results

– User Management• Contact Information• Keys, OpenID

• Coordination of Information– Manuals, tutorials, FAQ, Help– Status

• Resources, outages, usage, …• Coordination of the Community

– Information exchange: Forum, comments, community pages

– Feedback: rating, polls• Technology has been established• Transition technical development to

TACC as much as possible so we can focus on other areas at IU

• Focus on support of additional FG processes through the Portal

Page 29: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Apply for a Portal Account

Page 30: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Apply for a Portal Account

Page 31: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Check your Account Status• Goto:

– Accounts-My Portal Account• Check if the account status

bar is green– Errors will indicate an issue or

a task that requires waiting• Since you are already here:

– Upload a portrait– Check if you have other

things that need updating– Add ssh keys if needed

Page 32: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Get access

Project Lead1. Create a portal account2. Create a project3. Add project members

Project Member1. Create a portal account2. Ask your project lead to

add you to the project

Once the project you participate in is approvedOnce the project you participate in is approved

1. Apply for an HPC & Nimbus account• You will need an ssh key

2. Apply for a Eucalyptus Account

Page 33: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Which Services can you find?

Page 34: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Selected List of Services Offered

(will be added in future)

Page 35: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Services Offered

1. ViNe can be installed on the other resources via Nimbus

2. Access to the resource is requested through the portal

3. Pegasus available via Nimbus and Eucalyptus images

Page 36: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Questions?

36

Page 37: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

HPC on FutureGrid

Warren SmithTexas Advanced Computing Center (TACC)

37

Page 38: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

HPC on FutureGrid

• HPC-style usage is supported• Many of the clusters have an HPC partition• Clusters well suited to HPC

– Infiniband networks– Attached parallel file systems

38

Page 39: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Compute HardwareName System type # CPUs #

Cores TFLOPS Total RAM (GB)

Secondary Storage

(TB)Site Status

india IBM iDataPlex 256 1024 11 3072 339 + 16 IU Operational

alamo Dell PowerEdge 192 768 8 1152 30 TACC Operational

hotel IBM iDataPlex 168 672 7 2016 120 UC Operational

sierra IBM iDataPlex 168 672 7 2688 96 SDSC Operational

xray Cray XT5m 168 672 6 1344 339 IU Operational

foxtrot IBM iDataPlex 64 256 2 768 24 UF Operational

Bravo* Large Disk & memory 32 128 1.5

3072 (192GB per

node)

144 (12 TB per Server) IU Early user

Aug. 1 general

Delta*Large Disk & memory With Tesla GPU’s

1616 GPU’s 96 ? 3

1536 (192GB per

node)

96 (12 TB per Server) IU ~Sept 15

Total 1064 4288 45 16TB

* Teasers for next machine

Page 40: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

HPC Access• ssh to login nodes

– alamo.futuregrid.org, hotel.futuregrid.org, …– Uses the public key you’ve uploaded to the portal

• Modules to manage your environment• Intel and Gnu compilers (others wanted?)• MPI, OpenMP• Torque and Moab to schedule access to compute

nodes– Reservations?

• Scientific libraries?40

Page 41: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Performance Tools

• Provide a number of tools to analyze performance

• Full support of partner tools

• Best effort support of external tools

41

Page 42: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Questions?

42

Page 43: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Cloud Computing on FutureGridwith Nimbus

Kate [email protected]

Argonne National Laboratory, University of Chicago

43

Page 44: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

What is Nimbus?

04/18/23 44

Enable providers to build IaaS cloudsEnable providers to build IaaS clouds

Enable users to use IaaS cloudsEnable users to use IaaS clouds

Nimbus Infrastructure

Nimbus Platform

Workspace Service

Workspace Service CumulusCumulus

Context Broker

Context Broker Cloudinit.dCloudinit.d

High-quality, extensible, customizable, open source implementation

GatewayGatewayElastic Scaling Tools

Elastic Scaling Tools

Enable developers to extend, experiment and customize

Page 45: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Using Nimbus Infrastructure

04/18/23 45

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Nimbus

Page 46: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Using Nimbus Infrastructure

04/18/23 46

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Poolnode

Nimbus publishesinformation about each VM

Users can find outinformation about their

VM (e.g. what IPthe VM was bound to)

Users can interact directly with their VM in the same

way the would with a physical machine.

Nimbus

Page 47: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Nimbus on FutureGrid• Hotel (University of Chicago) -- Xen

41 nodes, 328 cores• Foxtrot (University of Florida) -- Xen

26 nodes, 208 cores• Sierra (SDSC) -- Xen

18 nodes, 144 cores• Alamo (TACC) -- KVM

15 nodes, 120 cores

04/18/23 47

Page 48: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Sky Computing• Sky Computing = a Federation of

Clouds• Approach:

– Combine resources obtained in multiple Nimbus clouds in FutureGrid and Grid’ 5000

– Combine Context Broker, ViNe, fast image deployment

– Deployed a virtual cluster of over 1000 cores on Grid5000 and FutureGrid – largest ever of this type

• Grid’5000 Large Scale Deployment Challenge award

• Demonstrated at OGF 29 06/10• TeraGrid ’10 poster

• More at: www.isgtw.org/?pid=1002832

04/18/23 48

Work by Pierre Riteau et al, University of Rennes 1

“Sky Computing”IEEE Internet Computing, September 2009

Page 49: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Backfill: Lower the Cost of Your Cloud• Challenge: utilization, catch-22 of

on-demand computing• Solution: new instances

– Backfill• Bottom line: up to 100%

utilization• Who decides what backfill VMs

run?• Spot pricing• Research by Paul Marshall,

University of Colorado• Open Source community

contributions via Google Summer of Code (GSoC), Paolo Gomez

• Nimbus release 2.7• Paper @ CCGrid 2011

04/18/23 49

16%

31%

47%

62%

78%

94%

1 March 2010 through 28 February 2011

Page 50: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

• BarBar Experiment at SLAC in Stanford, CA

• Using clouds to simulating electron-positron collisions in their detector

• Exploring virtualization as a vehicle for data preservation

• Approach:– Appliance preparation and

management– Distributed Nimbus clouds– Cloud Scheduler

• Running production BaBar workloads

04/18/23 50

UVIC EffortsWork by the UVIC team

Page 51: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Cloud Computing on FutureGrid

• Several Infrastructure-as-a-Service clouds– Nimbus, Eucalyptus, OpenStack (experimental)

• Supported patterns– Experimenting with middleware on top of

infrastructure clouds– Modifying and experimenting with infrastructure

clouds– Paradigm testing

• What would you like to work on? 51

Page 52: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Questions?

52

Page 53: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

FutureGridTraining, Education and

OutreachPresented by Renato Figueiredo

[email protected] Associate Professor, University of Florida

Page 54: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Overview

• Traditional ways of delivering hands-on training and education in parallel/distributed computing have non-trivial dependences on the environment

• Difficult to replicate same environment on different resources (e.g. HPC clusters, desktops)

• Difficult to cope with changes in the environment (e.g. software upgrades)

• Virtualization technologies remove key software dependences through a layer of indirection

Page 55: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

TEO Infrastructure - guiding principles

• Fidelity: TEO activities should use full-fledged, executable software: education/training modules– Learn using the proper tools

• Reproducibility: Creators of content should be able to install, configure, and test their modules once, and be assured of the same functional behavior regardless of where the module is deployed– Incentive to invest effort in developing, testing and

documenting new modules

Page 56: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

TEO Infrastructure - guiding principles

• Deployability: Students and users should be able to deploy modules in a simple manner, and in a variety of resources– Reduce barriers to entry; avoid dependences upon

a particular infrastructure

• Community-oriented: Modules should be simple to share, discover, reuse, and expand– Create conditions for growth

Page 57: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Towards this vision in FutureGrid

• Executable modules – virtual appliances– Deployable on FutureGrid resources– Deployable on other cloud platforms, as well as on

virtualized desktops

• Community sharing – Web 2.0 portal, appliance image repositories– An aggregation hub for executable modules and

documentation

Page 58: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Virtual appliances

• Leverage existing virtual networking software and virtual appliance images used in other projects

• Focus: integration with FutureGrid resources– Leverage network virtualization software

• FutureGrid includes ViNe and GroupVPN

– Image deployment, testing, documentation, tutorials• KVM/Xen, Nimbus/Eucalyptus

– FutureGrid portal, ability for users to contribute content

Page 59: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Virtual appliance clusters• Same image, different VPNs

copy

instantiate

Hadoop+

VirtualNetwork A Hadoop worker Another Hadoop worker

Repeat…

Virtual machine

GroupVPN

GroupVPNCredentials

Virtual IP - DHCP10.10.1.1

Virtual IP - DHCP10.10.1.2

Page 60: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

University ofArkansas

Indiana University

University ofCalifornia atLos Angeles

Penn State

IowaState

Univ.Illinois at Chicago

University ofMinnesota Michigan

State

NotreDame

University of Texas at El Paso

IBM AlmadenResearch Center

WashingtonUniversity

San DiegoSupercomputerCenter

Universityof Florida

Johns Hopkins

July 26-30, 2010 NCSA Summer School Workshophttp://salsahpc.indiana.edu/tutorial

300+ Students (200 on sites from 10 institutes; 100 online)IU MapReduce and UF Virtual Appliance technologies are supported by FutureGrid.

(Slide courtesy of Judy Qiu)

Activities: Big Data for Science

Page 61: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Activities: Courses

• Graduate-level “Cloud computing for Data-Intensive Sciences” (Judy Qiu, Fall 2010)– Virtualization technologies and tools– Infrastructure as a service– Parallel programming (MPI, Hadoop)

• FutureGrid supported activities in a new semester-long class offered Fall 2010 at LSU (Gabrielle Allen, Shantenu Jha)– A practical and comprehensive graduate course

preparing students for research involving scientific computing

Page 62: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Activities: Cloud computing class

62

Page 63: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Activities: ADMI Workshop

• Cloudy View on Computing workshop– 10 faulty members and graduate students from HBCUs

interested in cloud computing. – Cloud programming models, case studies of scientific

applications on FutureGrid.

63

Page 64: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Questions?

64

Page 65: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Experiment Management on FutureGrid

Warren SmithTexas Advanced Computing Center (TACC)

65

Page 66: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Experiment ManagementGoals

• Support rigorous experimentation– Define experiments in detail– Record experimental results

• User-specified measurements (placement and granularity)

– Share experiment information• Experiments can be repeated and verified• Variations on experiments can be performed

• Convenient execution of experiments– FutureGrid has distributed resources and services– Isn’t one true way to run an experiment

Page 67: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Experiment ManagementApproach

• Provide tools to execute distributed experiments– Access (potentially many) resources– Interact with a number of services– Support execution of experiment plans

• Support several usage models– Workflow (often large, automatic, batched, unattended)– Interactive (attended)– Hybrid

• Store experiment information for later use– Plans (workflows or recordings) and results– Searchable and shareable– Re-run experiments or run modified versions

Page 68: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Experiment ManagementAvailable Components

• Pegasus– Workflow-based experiment management– Builds on existing Pegasus software

• Kickstart to record job execution and its environment• Details of Pegasus presented elsewhere

• TakTuk– Basic interactive experiment management– Reuse tool deployed on Grid 5000

• Host List Manager– Organize provisioned systems into groups, generate host lists for

TakTuk– Set of simple command line programs

Page 69: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Experiment ManagementPlanned Components

• Messaging-based Execution and Monitoring System (MEMS)– More sophisticated interactive experiment management– Integrated message streams for commands, results, and monitoring

• Pegasus provisioning workflows– Include resource provisioning into workflow

• Experiment Repository– Store and retrieve information about experiments– Uses the FG Image Repository as component.

• User Portal integration• Convert experiment plans

– Help users migrate from one tool to another– TakTuk commands <-> MEMS messages <-> Pegasus Workflows <->

Page 70: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Questions?

70

Page 71: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Open Discussion

71

Page 72: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Research on FutureGrid• Were there ever experiments you could not run and if

so what were the obstacles?• What do you need to obtain results for your next paper?

– Resources, repositories, middleware?

• What kind of experiment management tools do you use today and how could they be improved?

• How do you collaborate with colleagues on developing complex experiments?

• What would make you come to FutureGrid rather than use resources at your institution?

72

Page 73: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Education on FutureGrid

• What types of resources would help you teach a class?– Access to hardware? Integrated set of readymade

course materials? Ease of use?

• What would help you teach your next tutorial?

• How would you like to share teaching materials with others?

73

Page 74: Https://portal.futuregrid.org What FutureGrid Can Do for You? TeraGrid’11 BOF Session 1 Salt Lake City, Utah July 20 th 2011.

https://portal.futuregrid.org

Usage Modalities and Outreach

• What is your ideal scenario of usage?• What would prevent you from using

infrastructure such as FG?• Where and how do you typically find

information about services that enhance your mode of work?

• What concerns do you have about using FG?

74