ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera...

15
1 ESE Grid Prototypes at the Goddard Space Flight Center Ken McDonald NASA/GSFC April 7, 2004

Transcript of ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera...

Page 1: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

1

ESE Grid Prototypesat the Goddard Space Flight Center

Ken McDonaldNASA/GSFCApril 7, 2004

Page 2: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

2

Flight Operations,Data Capture,Initial Processing,Backup Archive

DataTransportto DAACS/SIPS

Science DataProcessing,Info Mgmt, DataArchive, & Distribution

Distribution,Access,Interoperability,Reuse

EOSSpacecraft

Internet

Value-AddedProviders

InteragencyData

Centers

Int’l Partners& Data Centers

Data Acquisition

White SandsComplex(WSC)

Tracking& Data

Relay Satellite(TDRS)

ResearchUsers

EducationUsers

DistributedActive

ArchiveCenters

InstrumentTeams and

SIPSs

Data Processing

&MissionControl

EOS Polar Ground Stations

MediaPublic

ESIP2/3’s

RESACsRACs

(Search,order,

distribution)

(Distribution)

EOSDIS Context

Page 3: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

3

SEDACHuman Interactions in

Global Change

GSFCUpper Atmosphere

Atmospheric DynamicsGlobal Biosphere

LaRC (ASDC)Radiation Budget,CloudsAerosols, Tropospheric

Chemistry

ORNLBiogeochemical

DynamicsEOS Land Validation

ASFSAR Products

Sea IcePolar Processes

NSIDCCryosphere

Polar ProcessesEDC (LP DAAC)Land Processes

& Features

JPL (PODAAC)Ocean Circulation

Air-Sea Interactions

GHRCGlobal

Hydrology

DAAC Alliance Data Centers

Page 4: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

4

Science Investigator-led Processing Systems (SIPSs)

GSFCGLAS, MODIS, OMI

LaRCCERES, SAGE III

NCAR, U of Col.HIRDLS, MOPITT, SORCE

JPLMLS, TES

GHRCAMSR-E

San DiegoACRIM

Page 5: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

5

• A total of 68 widely distributed data centers (some of which are at the same location).

• ESE has recently updated our peer reviewed data and information producing centers through the Research, Education and Applications, Solutions Network Cooperative Agreement Notice (REASoN CAN) for development of next-generation architectures.

ESE Data Center Locations

Distributed Active Archive Centers (DAACs)

REASoN Projects

(3)

(2)

(2)

(2)(3)

(2)

(2)

(2)

Page 6: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

6

Grid: Selected Activities Within NASA/ESE

Grid Prototypes• Several projects of the Earth Science Enterprise (ESE) are

sponsoring or managing prototypes that use Grid technologies• Examples:

• AIST - Integration of OGC and Grid Technologies for Earth Science Modeling and Applications - George Mason U

• LDCM/GSFC - Advanced Data Grid - GSFC Science Data Systems Branch

• ESDIS - Remote Data Storage - EDS/RaytheonGrid Collaboration and Information Exchange• NASA is leading a Grid Task Team of the CEOS Working

Group on Information Systems and Services (WGISS)

Page 7: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

7

GMU Grid Prototype

Description and ObjectivesIntegrate Grid and OGC technologies to make Grid-managed data accessible through NWGISS OGC servers and allow users to focus on science rather than issues with data receipt, format, and data manipulationLeverages the OGC-compliant NASA HDF-EOS Web GIS Software Suite (NWGISS), CEOS Grid testbed, Globus and NASA information Power Grid (IPG) and DOE’s Earth System Grid (ESG)

Schedule and Deliverables• Testbed running (8/03)• Demonstrate Grid-secured NWGISS WCS (4/04)• Demonstrate WCS access to data pool grid (11/04)• Demonstrate Grid-enabled WRS, WMS, and WCS for

accessing non-virtual data (4/05)• Demonstrate Grid-enabled NWGISS access to data pools &

ESG (11/05)• Demonstrate NWGISS WCS virtual data access (4/06)

ApproachPhase 1: Testbed and initial integration (set up development

environment, preliminary integration design, implementation of WCS access to Grid-managed data)

Phase 2: Data naming and location transparency (investigate use of Data Grid & Replica Services)

Phase 3: Virtual dataset research & development

Liping Di, George Mason University

Co-I’s/PartnersWilliam Johnston, ARC/Lawrence Berkley LabDean Williams, Lawrence Livermore National Lab

Application/MissionFacilitate access to EOSDIS data by Earth science modeling and applications communities

Science ThemesAtmospheric Composition Carbon cycleClimate Weather Water & Energy Cycle

OGC and Grid Technology Integration for Earth Science Modeling and Applications

Page 8: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

8

The Advanced Data Grid (LDCM) PrototypeSponsored by the Landsat Data Continuity Mission and the ISD (580)

Description and ObjectivesThe objective of the LDCM ADG prototype is to assess the applicability and effectiveness of a data grid to serve as the infrastructure for research scientists to generate virtual Landsat-like data products.Grid technology serves as a key enabler in the creation of scientific Virtual Organizations, promotes a flexible and scalability infrastructure, facilitates the exchange of data, and maximizes the use of available resources

Schedule and DeliverablesPrototype start (12/03)Demo of Phase 1 grid infrastructure (6/04)Demo of Phase 1 capability (12/04)Demo of Phase 2 grid infrastructure (3/05)Demo of Phase 2 capability (6/05)

ApproachPhase 1: Provide and demonstrate a basic grid infrastructure that

enables a simple data fusion algorithm to access remote heterogeneous instrument data at multiple GSFC labs and EDC.

Phase 2: Enable the data fusion algorithm to obtain datasets, execute, and store the results on any resource within the Virtual Organization (GSFC labs, EDC, ARC IPG).

Co-I’s/PartnersEDC, NASA ARC/IPG, GSFC 920 Scientists

Application/MissionAllow scientists at resource-poor sites access to remote resource-rich sites, enabling greater scientific research. Serve as a key enabler in the creation of scientific Virtual Organizations and by extension, facilities. Maximize utility of existing resources, limiting the expense of building new facilities.

Science ThemesVirtual scientific data productsRemote instrument data accessCollaborative computing for the science communityResource sharing and data discovery

LDCM Virutal Organization

GSFC

GSFCB23 LabStorage

CPU

GSFCB32

Science User_1Network

EDCData Storage

ARC/IPGHPC/CPU

GSFCB32

Science U ser_2Mileston e C1

Mileston e C2

POC: Jeff Lubelczyk, 586Gail McConaughy, 586Beth Weinstein, 586

Page 9: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

9

Remote Data Storage (RDS) Prototype

Description and ObjectivesDemonstrate high volume remote data backup and and recovery capability over an IP WAN for portions of the Goddard Earth Sciences DAAC (GDAAC) data holdingsProvide heterogeneous storage management functionality capable of managing a storage hierarchy including transient and persistent on-line storage resources via a uniform interfaceImplement a preliminary data grid infrastructure making the data holdings accessible to external “data grid” usersAssess enabling technologies in the context of NASA Earth Science mission needs

Schedule and DeliverablesRelease 1 prototype (SGI Server & RAID) installed at NASA IV&V Facility in Fairmont, WV in 12/02, receiving MODIS direct broadcast data dailyRelease 2 prototype (integrated SGI server, RAID, CXFS software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia, MD in June 2003. High speed OC-12 link between NASA IV&V Facility and GDAAC operational in October 2003Delivery of Release 3 prototype to NASA IV&V Facility and GDAAC in January 2004

ApproachDevelop a testbed to prototype the use of advanced technology to provide a back up and restore capability for non-reproducible EOSDIS data.RDS prototypes 1 through 3 will culminate in an operational prototype system to be deployed at the NASA IV&V Facility and the GDAAC in early 2004

Co-I’s/PartnersGoddard DAACNASA IV&V Facility, Fairmont, WVa

GDAAC

RDS

Science Users

Backup/Recovery Service IP WAN

(OC-12)50% capacity

SRB Client

Archival/RetrievalService

Anon. FTP I/F

Data Pool I/F

ECS I/F

SRB Server

Persistent On-lineStorage(Centera)

TransientOn-lineStorage(CXFS)

MCAT

ScienceUsers

GDAAC

ECS

Data PoolDB

Anon. FTP I/F

Data PoolDisk

SRB I/F

DAAC Operator

RecoveryRequest

Operator

GDAAC

RDS

Science Users

Backup/Recovery Service IP WAN

(OC-12)50% capacity

SRB Client

Archival/RetrievalService

Anon. FTP I/F

Data Pool I/F

ECS I/F

SRB Server

Persistent On-lineStorage(Centera)

TransientOn-lineStorage(CXFS)

MCAT

ScienceUsers

GDAAC

ECS

Data PoolDB

Anon. FTP I/F

Data PoolDisk

SRB I/F

DAAC Operator

RecoveryRequest

Operator

POC: Chris Bock, ESDIS

Page 10: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

10

CEOS WGISS

Committee on Earth Observation Satellites (CEOS)• Members are international agencies that operate Earth

observing satellites and affiliated science organizations.• Purpose is to promote cooperation the acquisition, exchange

and utilization of Earth observation data.CEOS Working Group on Information Systems and Services (WGISS).• One of several working groups established to focus on particular

CEOS areas of interest.• Provides an effective forum CEOS partners cooperate in

applying advanced, data system technology to meet CEOS goals and objectives.

Page 11: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

11

WGISS - Working Group Structure

WGISS

Technology and Services

Subgroup

Projects and Applications

Subgroup

Current Tasks:Developing Countries CD-ROMCEOS Information InfrastructureWGISS Test Environment

Current Tasks:International Directory Network

CEOS Interoperable Catalog SystemData Services

NetworksArchive

EOGEO WorkshopGRID

Current Tasks:Global DatasetsGlobal Mapping BookWTF CEOPWTF Core Sites (WGCV)

Page 12: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

12

CEOS Grid Task Team

Background• Initiated about two years ago.• Grid workshop held in conjunction with regular WGISS

meeting.• Invited Grid experts interacted with WGISS members.• Multiple WGISS agencies initiated the development of Grid

technology prototypes.• WGISS also interested in potential for incorporating Grid

technology into its initiatives.Approach• Form a task team of interested WGISS members.• Share lessons among prototype teams.• Provide engineering support to teams.• Set up a CEOS Grid Testbed• Interact with broader Grid technology community.• Report back to full WGISS membership.

Page 13: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

13

CEOS Grid Task Team Coordination

Technology and Services SG

Wyn Cudlip

Network Task Team (TT)

Jeff Smith

CEOS Grid TTYonsook Enloe

Existing Grids• Information Power Grid• Earth Systems Grid• EU Data Grid and Data TAG

Engineering

Allan Doyle

USGS Data Delivery

Stu Doescher

NOAA/NCDCNOMADS

Glenn Rutledge

ESA Data Integration

Luigi Fusco

GSFC Advanced Data GridJeff Lubelczyk/

Gail McConaughy

negotiated relationshipsnetwork support

GMU OGC/Grid Integration

Liping Di

UAH Scientific Data Mining

Sara Graves

Dutch Space GREASE Project

Ruud Grim

China Remote Sensing Ground Station (RSGS)

Dingsheng Liu

Page 14: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

14

CEOS Grid Task Team

Accomplishments - Focus on implementing infrastructure• Teams installed Globus Toolkit, CEOS Grid common baselined

components• Prototyped monitoring tool (to all computers on CEOS Grid)• Prototyping resource directory (CEOS Grid computers,

applications and processes, CPU processing, data collections)• Published user documentation for participants

• CEOS Grid Implementers Guide• Firewall Document

• Using CEOS version of IPG certificates for security• Two way interaction with Open Source community

• Globus Metadata Catalog Service implementation of spatial and temporal search.

Plans• Facilitate the development of agency applications.• Plan multi-agency demonstration(s).

Additional Information• Public Task Team Web site http://lennier.gsfc.nasa.gov/grid

Page 15: ESE Grid Prototypes at the Goddard Space Flight CenterApr 07, 2004  · software, EMC Centera storage, Nirvana SRB software) successfully demonstrated at EMC laboratory in Columbia,

15

Purpose: revitalize Goddard’s Information Technology infrastructure and prepare for new paradigms in computing and information systems• Center-wide participation from IT professionals

• Atmospheric Modeling & Geosciences• High Performance Computing / Mass Storage• Earth Science Data System WGs & EOSDIS

Activity to Date• Kickoff June 03• Goddard new Geoscience Network (GEON) collaborator• Goddard IRAD: establish GSFC-Scripps Lambda Network (via NLR)

& demonstrate (e.g. ESMF/atmospheric modeling application) • OptIPuter workshop Jan 04

On-going Analysis Activities• Goddard Semantic Web Interest Group (science applications)• “Data web” for Earth Science data analysis • NSF cyber-infrastructure paradigm

• Grid computing (Goddard, Ames, JPL, and Langley)

NASA Goddard IT Pathfinder Working Group