Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology...
-
Upload
valerie-karin-wiggins -
Category
Documents
-
view
214 -
download
0
Transcript of Ian Foster Argonne National Lab University of Chicago Globus Project foster The Grid and Meteorology...
Ian Foster
Argonne National Lab
University of Chicago
Globus Project
www.mcs.anl.gov/~foster
The Grid and Meteorology
Meteorology and HPN Workshop, APAN 2003, Busan, August 26, 2003
Image Credit: Electronic Visualization Lab, UIC
[email protected] ARGONNE CHICAGO
Overview
The Grid: why and what– Global knowledge communities
– Resource sharing technologies
– Open standards and software The Grid and meteorology
– Opportunities
– Espresso interface
– Earth System Grid project
[email protected] ARGONNE CHICAGO
It’s Easy to ForgetHow Different 2003 is From 1993
Enormous quantities of data: Petabytes– For an increasing number of communities,
gating step is not collection but analysis Ubiquitous Internet: 100+ million hosts
– Collaboration & resource sharing the norm Ultra-high-speed networks: 10+ Gb/s
– Global optical networks Huge quantities of computing: 100+ Top/s
– Moore’s law gives us all supercomputers
[email protected] ARGONNE CHICAGO
Consequence: The Emergence ofGlobal Knowledge Communities
Teams organized around common goals– Communities: “Virtual organizations”
With diverse membership & capabilities– Heterogeneity is a strength not a weakness
And geographic and political distribution– No location/organization possesses all required
skills and resources Must adapt as a function of the situation
– Adjust membership, reallocate responsibilities, renegotiate resources
[email protected] ARGONNE CHICAGO
For Example: High Energy Physics
[email protected] ARGONNE CHICAGO
Grid TechnologiesAddress Key Requirements
Infrastructure (“middleware”) for establishing, managing, and evolving multi-organizational federations– Dynamic, autonomous, domain independent
– On-demand, ubiquitous access to computing, data, and services
Mechanisms for creating and managing workflow within such federations– New capabilities constructed dynamically and
transparently from distributed services
– Service-oriented, virtualization
[email protected] ARGONNE CHICAGO
The Grid World: Current Status Substantial number of Grid success stories
– Major projects in science
– Emerging infrastructure deployments
– Growing number of commercial deployments Open source Globus Toolkit® a de facto standard
for major protocols & services– Simple protocols & APIs for authentication,
discovery, access, etc.: infrastructure
– Large user and developer base
– Multiple commercial support providers Global Grid Forum: community & standards Emerging Open Grid Services Architecture
[email protected] ARGONNE CHICAGO
What We Can Do Today A core set of Grid capabilities are available and
distributed in good quality form, e.g.– Globus Toolkit: security, discovery, access, data movement,
etc.
– Condor: scheduling, workflow management
– Virtual Data Toolkit, NMI, EDG, etc. Deployed at moderate scales
– WorldGrid, TeraGrid, NEESgrid, DOE SG, EDG, … Usable with some hand holding, e.g.
– US-CMS event prod.: O(6) sites, 2 months
– NEESgrid: earthquake engineering experiment
[email protected] ARGONNE CHICAGO
[email protected] ARGONNE CHICAGO
NEESgrid Earthquake Engineering Collaboratory
2
Network for Earthquake Engineering Simulation
Field Equipment
Laboratory Equipment
Remote Users
Remote Users: (K-12 Faculty and Students)
High-Performance Network(s)
Instrumented Structures and Sites
Leading Edge Computation
Curated Data Repository
Laboratory Equipment (Faculty and Students)
Global Connections(fully developed
FY 2005 –FY 2014)
(Faculty, Students, Practitioners)
U.Nevada Reno
www.neesgrid.org
[email protected] ARGONNE CHICAGO
CMS Event Simulation Production Production Run on the Integration Testbed
– Simulate 1.5 million full CMS events for physics studies: ~500 sec per event on 850 MHz processor
– 2 months continuous running across 5 testbed sites
– Managed by a single person at the US-CMS Tier 1
[email protected] ARGONNE CHICAGO
Key Areas of Concern
Integration with site operational procedures– Many challenging issues
Scalability in multiple dimensions– Number of sites, resources, users, tasks
Higher-level services in multiple areas– Virtual data, policy, collaboration
Integration with end-user science tools– Science desktops
Coordination of international contributions Integration with commercial technologies
[email protected] ARGONNE CHICAGO
Overview
The Grid: why and what– Global knowledge communities
– Resource sharing technologies
– Open standards and software The Grid and meteorology
– Opportunities
– Espresso interface
– Earth System Grid project
[email protected] ARGONNE CHICAGO
The Grid and Meteorology:Opportunities
Inter-personal collaboration– E.g., Access Grid, CHEF
On-demand access to simulation models– E.g., Espresso
Access to, and integration of, data sources– E.g., Earth System Grid
Dynamic, virtual computing resources– “Metacomputing”
Integration of all of the above– Collaborative, computationally intensive analysis of
large quantities of online data
[email protected] ARGONNE CHICAGO
Expresso Modeling Interface(Michael Dvorak, John Taylor)
“Meteorology on demand”
[email protected] ARGONNE CHICAGO
Earth System Grid (ESG)
Goal: address technical obstacles to the sharing & analysis of high-volume data from advanced earth system models
[email protected] ARGONNE CHICAGO
[email protected] ARGONNE CHICAGO
ESG: Strategies
Move data a minimal amount, keep it close to point of origin when possible– Data access protocols, distributed analysis
When we must move data, do it fast and with minimum human intervention– Storage Resource Management, fast networks
Keep track of what we have, particularly what’s on deep storage– Metadata and Replica Catalogs
Harness a federation of sites, web portals– GT -> Earth System Grid -> UltraDataGrid
[email protected] ARGONNE CHICAGO
OPeNDAP-g-Transparency-Performance-Security-Authorization-(Processing)Typical Application
Data(local)
netCDF lib
Application
Data(remote)
OPeNDAP Client
Application
OPeNDAPViahttp
Big Data(remote)
ESG client
Application
ESG+
DODS
OpenDAP Server ESG Server
Distributed Application
dataOPeNDAP
ViaGrid
Distributed Data AccessProtocols
[email protected] ARGONNE CHICAGO
ESG: Metadata Services
METADATAEXTRACTION
METADATAEXTRACTION
METADATADISPLAY
METADATADISPLAY
METADATABROWSING
METADATABROWSING
METADATAQUERY
METADATAQUERY
ESG CLIENTS API & USER INTERFACES
Data &MetadataCatalog
Dublin CoreDatabase
COARDSDatabase
mirrorDublin CoreXML Files
COMMENTSXML Files
METADATA HOLDINGS
METADATAANNOTATION
METADATAANNOTATION
METADATAVALIDATION
METADATAVALIDATION
METADATA ACCESS(update, insert, delete, query)
METADATA ACCESS(update, insert, delete, query)
SERVICE TRANSLATIONLIBRARY
SERVICE TRANSLATIONLIBRARY
CORE METADATA SERVICES
METADATAAGGREGATION
METADATAAGGREGATION
METADATADISCOVERY
METADATADISCOVERY
METADATA & DATA REGISTRATION
METADATA & DATA REGISTRATION
PUBLISHINGPUBLISHING
HIGH LEVEL METADATA SERVICES
SEARCH & DISCOVERYSEARCH & DISCOVERYADMINISTRATIONADMINISTRATION BROWSING & DISPLAYBROWSING & DISPLAY
ANALYSIS & VISUALIZATIONANALYSIS & VISUALIZATION
[email protected] ARGONNE CHICAGO
XML encoding of metadata (and data) of any generic netCDF file Objects: netCDF, dimension, variable, attribute Beta version reference implementation as Java Library
(www.scd.ucar.edu/vets/luca/netcdf/extract_metadata.htm)
ESG: NcML Core Schema
netCDFnetCDF
nc:netCDFType
nc:dimension
nc:variable
nc: attribute
nc:attribute
nc:values
nc:VariableType
[email protected] ARGONNE CHICAGO
[email protected] ARGONNE CHICAGO
Collaborations & Relationships
CCSM Data Management Group OPeNDAP/DODS (multi-agency) NSF National Science Digital Libraries
Program (UCAR & Unidata THREDDS Project) U.K. e-Science and British Atmospheric Data
Center NOAA NOMADS and CEOS-grid Earth Science Portal group (multi-agency,
international)
[email protected] ARGONNE CHICAGO
For More Information
The Globus Project®– www.globus.org
Earth System Grid– www.earthsystemgrid.org
Global Grid Forum– www.ggf.org
Background information– www.mcs.anl.gov/~foster
GlobusWORLD 2004– www.globusworld.org
– Jan 20–23, San Francisco2nd Edition: November 2003