Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of...

36
Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago

Transcript of Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of...

Page 1: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Grid TechnologiesResearch

and Development

Ian FosterArgonne National Laboratory

The University of Chicago

Page 2: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Credits

Globus project Co-PI: Carl Kesselman, USC Globus resarchers and developers at ANL,

USC/ISI, NCSA, and elsewhere Steve Tuecke, Randy Butler, Steve

Fitzgerald, Brian Toonen, Gregor von Laszewski, and many others

Research supported by DARPA, DOE, NSF, NASA; equipment from Cisco Systems

Page 3: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Grid Services Architecture:An Emerging Grid Computing Framework

Archives, networks, computers, display devices, etc.;associated local services

Protocols, authentication, policy, resource management, instrumentation, discovery, etc., etc.

GridFabric

GridServices

ApplnToolkits

Applns

...

… a rich variety of applications ...

Remoteviz

toolkit

Remotecomp.toolkit

Remotedata

toolkit

Remotesensorstoolkit

Async.collab.toolkit

Page 4: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Overview

Why Grid Services? Review of existing Grid services

Security Information/directory Resource management Data access

Our current research focus areas Grid Forum and -Grid project

Page 5: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Creating a Usable Grid :Grid Services (“Middleware”)

Standard grid services that Provide uniform, high-level access to a wide

range of resources (including networks) Address interdomain issues of security,

policy, etc. Permit application-level management and

monitoring of end-to-end performance Middleware-level and higher-level APIs and

tools targeted at application programmers Map between application and Grid

Page 6: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

The Challenge of Heterogeneity Group

Institutions, people; policies Resources

Hardware: computers, archives, networks, ... Interface

Software, mechanisms Distance

Local, campus, metropolitan, wide area Scale

Single CPU, cluster, supercomputer, ...

Page 7: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Grid Services Approach

Define and deploy standard Grid services that encapsulate heterogeneity Simple: Cost of joining Grid is low Noncoercive: Sites retain local control Uniform: Cost of using Grid is low

Use a Grid information service to represent structure and status of Grid elements Resource discovery Application configuration and optimization

Build Grid-enabled tools to enable applications

Page 8: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Grid Services

Security: authentication, authorization Information: publication, delivery Resource management: reservation,

allocation, monitoring, control Data: data access, replica management,

metadata access + fault detection, executable

management, accounting, others

Page 9: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Grid Services (1):Grid Security Infrastructure

Define uniform authentication and authorization mechanisms that allow cooperating sites to accept credentials while retaining local control

Benefit: Only one A/A infrastructure needs to be maintained at each site; enables inter-site resource sharing & interoperability

Requires Authentication/authorization standards Certification authority policies

Page 10: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Authentication

“Grid Security Infrastructure” Single sign-on via global credential, PKI

mechanisms, mapping to local credentials Delegation No plaintext passwords Retains local control over policy

Deployed across PACI and NASA sites GSS-API binding, used by ssh, SecureCRT, gsiftp,

Globus, Condor, others GAA (Generic Authorization & Access Control)

interface provides hooks for policy

Page 11: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Security Architecture

User ProxyCp

HostUser

Protocol 1: user proxy creation

Protocol 2: resource allocation

Protocol 3: Resource allocation from a process

Site 2

Resource Proxy

global-to-local mapping table

Local Policyand Mechanisms

Process

CrpProcessCp

Site 1

Resource Proxy

global-to-local mapping table

Local Policyand Mechanisms

Process

CrpProcessCp

Crp

Page 12: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Grid Services (2):Grid Information Service

Effective resource use predicated on knowledge of system components Publish structure and state info, dynamic

performance info, software info, etc., etc. Selection and scheduling of resources

Resource discovery: “find me an X with property Y available at time T”

Auto-configuration: “tell me what I need to know to use A efficiently/securely/...”

Gateways to other data sources required

Page 13: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Information ServicesTechnical Approaches

Infrastructure based on common protocols LDAP as unifying communication protocol Gateways to alternative information sources

and organization Research questions include

Unifying metadata representation How to support range of access modes Scalability of collection and publication

methods Index methods and discovery

Page 14: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

NCSANCSA

Distributed Information Services

ReferralServer

RootServers

Organization Servers

NPACI

NCSA U.Tenn

DOENASA

NCSA

ISI

mds.globus.org:389 Replicated servers

Index Server(s)

RemosSNMPNWS

Page 15: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Page 16: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Grid Services (3):Resource Management

Issues include: Locating and selecting resources Allocating resources Authentication, process creation Other activities required to prepare a resource

for use; monitoring, control End-to-end management/co-allocation Diverse resources: CPU, disk, network Reservation

Page 17: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Resource Management Services

Globus Resource Allocation Manager (GRAM) Uniform interface to resource management Integration with security, policy

Co-allocation services Coordinated allocation across multiple

resources Globus Arch. for Reservation and Allocation

Network and CPU quality of service Immediate and advance reservations

Resource brokers: e.g., Condor

Page 18: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Resource Management Architecture

“10 GFlops, EOS data,20 Mb/sec -- for 20 mins”

MetacomputingDirectoryService

GRAMGRAMGRAM

ResourceBroker

Info service:location + selection

Globus ResourceAllocation Managers

GRAM

ForkLSFEASYLLCondoretc.

“What computers?”“What speed?”“When available?”

“50 processors + storage from 10:20 to 10:40 pm”

“20 Mb/sec”

Page 19: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Globus SecurityInfrastructure

Job Manager

GRAM client API calls to request resource allocation

and process creation.

MDS client API callsto locate resources

Authentication

Query current statusof resource

Create

RSL Library

Parse

Update MDS withresource state information

RequestAllocate &

create processes

Process

Process

Process

Monitor &control

Site boundary

GRAM Client MDS

Gatekeeper

GramReporter

Local Resource Manager

Local Resource Management

Page 20: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Provide end-to-end Quality of Service to applications. This requires: Discovery and selection of resources Allocation of resources Advance reservation of resources

Supercomputer Workstation

Workstation

Supercomputer Workstation

Workstation

Advanced Resource Management

Page 21: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

GARA and Differentiated Services

Server

Diffserv

Resource

Manager

Client

GARA API Diffserv

Resource

Manager

Page 22: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Scheduling Bulk Transferand High-Priority Transfers

0

2000

4000

6000

8000

10000

12000

0 50 100 150 200 250

Time (seconds)

Kb

yte

/se

c

background

foreground

competitive

Page 23: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Integrated Policy Management

Required to control reservation and scheduling

Determine who can to what to whom Integral part of resource management

Resource application, applicationresource

Next step after authentication Need to integrate with and augment

existing approaches Access control lists, capabilities, usage

certificates

Page 24: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Policy: Technical Approaches

Single API to alternative mechanisms Similar to security infrastructure Integration with Globus security model and

Globus resource management components Basic policy mechanism in current system

Research questions Reusable policy structures for resource

specification/management Policy aware resource discovery/scheduling

Page 25: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Grid Services (4):Storage and I/O Services

Access to remote data (GASS) Uniform access to diverse storage management

systems Cache management High-speed, secure transport: gsiftp Integration with metadata & storage systems

Communication (Nexus, GlobusIO) Application-level interfaces to comm services Multiple methods: reliable/unreliable, IP/other,

unicast/multicast Quality of service interfaces

Page 26: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Page 27: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Current Technology Focus Areas Advanced resource management techniques

GARA: Globus Arch. for Resv. & Allocation High-end data-intensive applications

“Data Grid” Interfaces to commodity technologies

CoG Kit: Commodity Grid Toolkits Distance visualization

NOVA: Network Optimized Visualization Arch.

With supporting work on info/instr., policy, accounting, authentication/authorization, etc.

Page 28: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

The Grid Forumhttp://www.gridforum.org

IETF-like community forum for discussion & definition of Grid infrastructure

First two meetings (June 16-18, Oct 18-20) attracted 150 people

9 working groups established in security, information infrastructure, resource management, accounting, etc.

Next mtg: San Diego March 22-24 2000 See also European Grid Forum

www.egrid.org

Page 29: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

-Grid(“Broadband Experimental Terascale Access”)

A proposal to NSF to plan (& build) a national infrastructure for computer systems research dedicated to research of a scale that permits realistic experimentation of a scale that encourages participation by

adventurous applications groups a place for computer and application scientists to

tackle problems together Initial plan is for O(20) Linux clusters, each with

O(30) nodes, O(2 TB) disk, Gb/s networkhttp://dsl.cs.uchicago.edu/beta

Page 30: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Summary: Where We Are

Solid technology base for security, resource management, information services

Globus v1.1 completed, with all core services complete, robust, and documented

Many tool projects are leveraging this considerable investment in infrastructure

Substantial deployment activities and application experiments

New R&D in commodity grids, resource management, distance viz, data grids

http://www.globus.org

Page 31: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

tomographic reconstruction

real-timecollection

wide-areadissemination

desktop & VR clients with shared controls

Advanced Photon Source

Case Study 1:Online Instrumentation

archival storage

DOE X-ray source grand challenge: ANL, USC/ISI, NIST, U.Chicago

Page 32: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Case Study 2:Distributed Supercomputing

SF-Express Distributed Interactive Simulation: Caltech, USC/ISI

Starting point: SF-Express parallel simulation code

Globus mechanisms for Resource allocation Distributed startup I/O and configuration Fault detection

100K vehicles (2002 goal) using 13 computers, 1386 nodes, 9 sites

NCSAOrigin

CaltechExemplar

CEWESSP

MauiSP

Page 33: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

OVERFLOW with latency-tolerant algorithms

MPICH-G “Grid-enabled” message passing

Globus services

Security Directory Scheduling

Process mgmt Communication

ARC SGI O2000(California)

Argonne SGI O2000(Illinois)

OVERFLOW simulation: NASA Ames

Page 34: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Case Study 3:Collaborative Engineering

CAVERNsoft: UIC Electronic Visualization Laboratory

Manipulate shared virtual space, with Simulation components Multiple flows: Control, Text,

Video, Audio, Database, Simulation, Tracking, Haptics, Rendering

Uses Globus comms: (un)reliable uni/multicast

Future: Security, QoS, allocation, reservation

Page 35: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Case Study 4:High-Throughput Computing

Nimrod-G: Monash University

CostDeadline

AvailableMachines

Schedule many independent tasks (e.g., parameter study)

Uses Globus security, discovery, data access, scheduling

Future: Reservation, accounting, code management, etc.

Page 36: Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.

Ian Foster ARGONNE CHICAGO

Problem solving environment for comp. chemistry

Globus services used for authentication, remote job submission, monitoring, and control

Future: distributed data archive, resource discovery, charging

Case Study 5:Problem Solving Environment

ECCE’: Pacific Northwest National Laboratory