Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology...

24
Ian Foster Argonne National Lab University of Chicago Globus Project www.mcs.anl.gov/~foster The Grid and Meteorology Meteorology and HPN Workshop, APAN 2003, Busan, August 26, 2003 Image Credit: Electronic Visualization Lab, UIC

Transcript of Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology...

Page 1: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

Ian Foster

Argonne National Lab

University of Chicago

Globus Project

www.mcs.anl.gov/~foster

The Grid and Meteorology

Meteorology and HPN Workshop, APAN 2003, Busan, August 26, 2003

Image Credit: Electronic Visualization Lab, UIC

Page 2: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

Overview

The Grid: why and what– Global knowledge communities

– Resource sharing technologies

– Open standards and software The Grid and meteorology

– Opportunities

– Espresso interface

– Earth System Grid project

Page 3: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

It’s Easy to ForgetHow Different 2003 is From 1993

Enormous quantities of data: Petabytes– For an increasing number of communities,

gating step is not collection but analysis Ubiquitous Internet: 100+ million hosts

– Collaboration & resource sharing the norm Ultra-high-speed networks: 10+ Gb/s

– Global optical networks Huge quantities of computing: 100+ Top/s

– Moore’s law gives us all supercomputers

Page 4: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

Consequence: The Emergence ofGlobal Knowledge Communities

Teams organized around common goals– Communities: “Virtual organizations”

With diverse membership & capabilities– Heterogeneity is a strength not a weakness

And geographic and political distribution– No location/organization possesses all required

skills and resources Must adapt as a function of the situation

– Adjust membership, reallocate responsibilities, renegotiate resources

Page 5: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

For Example: High Energy Physics

Page 6: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

Grid TechnologiesAddress Key Requirements

Infrastructure (“middleware”) for establishing, managing, and evolving multi-organizational federations– Dynamic, autonomous, domain independent

– On-demand, ubiquitous access to computing, data, and services

Mechanisms for creating and managing workflow within such federations– New capabilities constructed dynamically and

transparently from distributed services

– Service-oriented, virtualization

Page 7: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

The Grid World: Current Status Substantial number of Grid success stories

– Major projects in science

– Emerging infrastructure deployments

– Growing number of commercial deployments Open source Globus Toolkit® a de facto standard

for major protocols & services– Simple protocols & APIs for authentication,

discovery, access, etc.: infrastructure

– Large user and developer base

– Multiple commercial support providers Global Grid Forum: community & standards Emerging Open Grid Services Architecture

Page 8: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

What We Can Do Today A core set of Grid capabilities are available and

distributed in good quality form, e.g.– Globus Toolkit: security, discovery, access, data movement,

etc.

– Condor: scheduling, workflow management

– Virtual Data Toolkit, NMI, EDG, etc. Deployed at moderate scales

– WorldGrid, TeraGrid, NEESgrid, DOE SG, EDG, … Usable with some hand holding, e.g.

– US-CMS event prod.: O(6) sites, 2 months

– NEESgrid: earthquake engineering experiment

Page 9: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

Page 10: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

NEESgrid Earthquake Engineering Collaboratory

2

Network for Earthquake Engineering Simulation

Field Equipment

Laboratory Equipment

Remote Users

Remote Users: (K-12 Faculty and Students)

High-Performance Network(s)

Instrumented Structures and Sites

Leading Edge Computation

Curated Data Repository

Laboratory Equipment (Faculty and Students)

Global Connections(fully developed

FY 2005 –FY 2014)

(Faculty, Students, Practitioners)

U.Nevada Reno

www.neesgrid.org

Page 11: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

CMS Event Simulation Production Production Run on the Integration Testbed

– Simulate 1.5 million full CMS events for physics studies: ~500 sec per event on 850 MHz processor

– 2 months continuous running across 5 testbed sites

– Managed by a single person at the US-CMS Tier 1

Page 12: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

Key Areas of Concern

Integration with site operational procedures– Many challenging issues

Scalability in multiple dimensions– Number of sites, resources, users, tasks

Higher-level services in multiple areas– Virtual data, policy, collaboration

Integration with end-user science tools– Science desktops

Coordination of international contributions Integration with commercial technologies

Page 13: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

Overview

The Grid: why and what– Global knowledge communities

– Resource sharing technologies

– Open standards and software The Grid and meteorology

– Opportunities

– Espresso interface

– Earth System Grid project

Page 14: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

The Grid and Meteorology:Opportunities

Inter-personal collaboration– E.g., Access Grid, CHEF

On-demand access to simulation models– E.g., Espresso

Access to, and integration of, data sources– E.g., Earth System Grid

Dynamic, virtual computing resources– “Metacomputing”

Integration of all of the above– Collaborative, computationally intensive analysis of

large quantities of online data

Page 15: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

Expresso Modeling Interface(Michael Dvorak, John Taylor)

“Meteorology on demand”

Page 16: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

Earth System Grid (ESG)

Goal: address technical obstacles to the sharing & analysis of high-volume data from advanced earth system models

Page 17: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

Page 18: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

ESG: Strategies

Move data a minimal amount, keep it close to point of origin when possible– Data access protocols, distributed analysis

When we must move data, do it fast and with minimum human intervention– Storage Resource Management, fast networks

Keep track of what we have, particularly what’s on deep storage– Metadata and Replica Catalogs

Harness a federation of sites, web portals– GT -> Earth System Grid -> UltraDataGrid

Page 19: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

OPeNDAP-g-Transparency-Performance-Security-Authorization-(Processing)Typical Application

Data(local)

netCDF lib

Application

Data(remote)

OPeNDAP Client

Application

OPeNDAPViahttp

Big Data(remote)

ESG client

Application

ESG+

DODS

OpenDAP Server ESG Server

Distributed Application

dataOPeNDAP

ViaGrid

Distributed Data AccessProtocols

Page 20: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

ESG: Metadata Services

METADATAEXTRACTION

METADATAEXTRACTION

METADATADISPLAY

METADATADISPLAY

METADATABROWSING

METADATABROWSING

METADATAQUERY

METADATAQUERY

ESG CLIENTS API & USER INTERFACES

Data &MetadataCatalog

Dublin CoreDatabase

COARDSDatabase

mirrorDublin CoreXML Files

COMMENTSXML Files

METADATA HOLDINGS

METADATAANNOTATION

METADATAANNOTATION

METADATAVALIDATION

METADATAVALIDATION

METADATA ACCESS(update, insert, delete, query)

METADATA ACCESS(update, insert, delete, query)

SERVICE TRANSLATIONLIBRARY

SERVICE TRANSLATIONLIBRARY

CORE METADATA SERVICES

METADATAAGGREGATION

METADATAAGGREGATION

METADATADISCOVERY

METADATADISCOVERY

METADATA & DATA REGISTRATION

METADATA & DATA REGISTRATION

PUBLISHINGPUBLISHING

HIGH LEVEL METADATA SERVICES

SEARCH & DISCOVERYSEARCH & DISCOVERYADMINISTRATIONADMINISTRATION BROWSING & DISPLAYBROWSING & DISPLAY

ANALYSIS & VISUALIZATIONANALYSIS & VISUALIZATION

Page 21: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

XML encoding of metadata (and data) of any generic netCDF file Objects: netCDF, dimension, variable, attribute Beta version reference implementation as Java Library

(www.scd.ucar.edu/vets/luca/netcdf/extract_metadata.htm)

ESG: NcML Core Schema

netCDFnetCDF

nc:netCDFType

nc:dimension

nc:variable

nc: attribute

nc:attribute

nc:values

nc:VariableType

Page 22: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

Page 23: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

Collaborations & Relationships

CCSM Data Management Group OPeNDAP/DODS (multi-agency) NSF National Science Digital Libraries

Program (UCAR & Unidata THREDDS Project) U.K. e-Science and British Atmospheric Data

Center NOAA NOMADS and CEOS-grid Earth Science Portal group (multi-agency,

international)

Page 24: Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology Meteorology and HPN Workshop, APAN.

[email protected] ARGONNE CHICAGO

For More Information

The Globus Project®– www.globus.org

Earth System Grid– www.earthsystemgrid.org

Global Grid Forum– www.ggf.org

Background information– www.mcs.anl.gov/~foster

GlobusWORLD 2004– www.globusworld.org

– Jan 20–23, San Francisco2nd Edition: November 2003