Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of...
-
Upload
samuel-osborne-mckinney -
Category
Documents
-
view
215 -
download
2
Transcript of Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of...
Grid TechnologiesResearch
and Development
Ian FosterArgonne National Laboratory
The University of Chicago
Ian Foster ARGONNE CHICAGO
Credits
Globus project Co-PI: Carl Kesselman, USC Globus resarchers and developers at ANL,
USC/ISI, NCSA, and elsewhere Steve Tuecke, Randy Butler, Steve
Fitzgerald, Brian Toonen, Gregor von Laszewski, and many others
Research supported by DARPA, DOE, NSF, NASA; equipment from Cisco Systems
Ian Foster ARGONNE CHICAGO
Grid Services Architecture:An Emerging Grid Computing Framework
Archives, networks, computers, display devices, etc.;associated local services
Protocols, authentication, policy, resource management, instrumentation, discovery, etc., etc.
GridFabric
GridServices
ApplnToolkits
Applns
...
… a rich variety of applications ...
Remoteviz
toolkit
Remotecomp.toolkit
Remotedata
toolkit
Remotesensorstoolkit
Async.collab.toolkit
Ian Foster ARGONNE CHICAGO
Overview
Why Grid Services? Review of existing Grid services
Security Information/directory Resource management Data access
Our current research focus areas Grid Forum and -Grid project
Ian Foster ARGONNE CHICAGO
Creating a Usable Grid :Grid Services (“Middleware”)
Standard grid services that Provide uniform, high-level access to a wide
range of resources (including networks) Address interdomain issues of security,
policy, etc. Permit application-level management and
monitoring of end-to-end performance Middleware-level and higher-level APIs and
tools targeted at application programmers Map between application and Grid
Ian Foster ARGONNE CHICAGO
The Challenge of Heterogeneity Group
Institutions, people; policies Resources
Hardware: computers, archives, networks, ... Interface
Software, mechanisms Distance
Local, campus, metropolitan, wide area Scale
Single CPU, cluster, supercomputer, ...
Ian Foster ARGONNE CHICAGO
Grid Services Approach
Define and deploy standard Grid services that encapsulate heterogeneity Simple: Cost of joining Grid is low Noncoercive: Sites retain local control Uniform: Cost of using Grid is low
Use a Grid information service to represent structure and status of Grid elements Resource discovery Application configuration and optimization
Build Grid-enabled tools to enable applications
Ian Foster ARGONNE CHICAGO
Grid Services
Security: authentication, authorization Information: publication, delivery Resource management: reservation,
allocation, monitoring, control Data: data access, replica management,
metadata access + fault detection, executable
management, accounting, others
Ian Foster ARGONNE CHICAGO
Grid Services (1):Grid Security Infrastructure
Define uniform authentication and authorization mechanisms that allow cooperating sites to accept credentials while retaining local control
Benefit: Only one A/A infrastructure needs to be maintained at each site; enables inter-site resource sharing & interoperability
Requires Authentication/authorization standards Certification authority policies
Ian Foster ARGONNE CHICAGO
Authentication
“Grid Security Infrastructure” Single sign-on via global credential, PKI
mechanisms, mapping to local credentials Delegation No plaintext passwords Retains local control over policy
Deployed across PACI and NASA sites GSS-API binding, used by ssh, SecureCRT, gsiftp,
Globus, Condor, others GAA (Generic Authorization & Access Control)
interface provides hooks for policy
Ian Foster ARGONNE CHICAGO
Security Architecture
User ProxyCp
HostUser
Protocol 1: user proxy creation
Protocol 2: resource allocation
Protocol 3: Resource allocation from a process
Site 2
Resource Proxy
global-to-local mapping table
Local Policyand Mechanisms
Process
CrpProcessCp
Site 1
Resource Proxy
global-to-local mapping table
Local Policyand Mechanisms
Process
CrpProcessCp
Crp
Ian Foster ARGONNE CHICAGO
Grid Services (2):Grid Information Service
Effective resource use predicated on knowledge of system components Publish structure and state info, dynamic
performance info, software info, etc., etc. Selection and scheduling of resources
Resource discovery: “find me an X with property Y available at time T”
Auto-configuration: “tell me what I need to know to use A efficiently/securely/...”
Gateways to other data sources required
Ian Foster ARGONNE CHICAGO
Information ServicesTechnical Approaches
Infrastructure based on common protocols LDAP as unifying communication protocol Gateways to alternative information sources
and organization Research questions include
Unifying metadata representation How to support range of access modes Scalability of collection and publication
methods Index methods and discovery
Ian Foster ARGONNE CHICAGO
NCSANCSA
Distributed Information Services
ReferralServer
RootServers
Organization Servers
NPACI
NCSA U.Tenn
DOENASA
NCSA
ISI
mds.globus.org:389 Replicated servers
Index Server(s)
RemosSNMPNWS
Ian Foster ARGONNE CHICAGO
Ian Foster ARGONNE CHICAGO
Grid Services (3):Resource Management
Issues include: Locating and selecting resources Allocating resources Authentication, process creation Other activities required to prepare a resource
for use; monitoring, control End-to-end management/co-allocation Diverse resources: CPU, disk, network Reservation
Ian Foster ARGONNE CHICAGO
Resource Management Services
Globus Resource Allocation Manager (GRAM) Uniform interface to resource management Integration with security, policy
Co-allocation services Coordinated allocation across multiple
resources Globus Arch. for Reservation and Allocation
Network and CPU quality of service Immediate and advance reservations
Resource brokers: e.g., Condor
Ian Foster ARGONNE CHICAGO
Resource Management Architecture
“10 GFlops, EOS data,20 Mb/sec -- for 20 mins”
MetacomputingDirectoryService
GRAMGRAMGRAM
ResourceBroker
Info service:location + selection
Globus ResourceAllocation Managers
GRAM
ForkLSFEASYLLCondoretc.
“What computers?”“What speed?”“When available?”
“50 processors + storage from 10:20 to 10:40 pm”
“20 Mb/sec”
Ian Foster ARGONNE CHICAGO
Globus SecurityInfrastructure
Job Manager
GRAM client API calls to request resource allocation
and process creation.
MDS client API callsto locate resources
Authentication
Query current statusof resource
Create
RSL Library
Parse
Update MDS withresource state information
RequestAllocate &
create processes
Process
Process
Process
Monitor &control
Site boundary
GRAM Client MDS
Gatekeeper
GramReporter
Local Resource Manager
Local Resource Management
Ian Foster ARGONNE CHICAGO
Provide end-to-end Quality of Service to applications. This requires: Discovery and selection of resources Allocation of resources Advance reservation of resources
Supercomputer Workstation
Workstation
Supercomputer Workstation
Workstation
Advanced Resource Management
Ian Foster ARGONNE CHICAGO
GARA and Differentiated Services
Server
Diffserv
Resource
Manager
Client
GARA API Diffserv
Resource
Manager
Ian Foster ARGONNE CHICAGO
Scheduling Bulk Transferand High-Priority Transfers
0
2000
4000
6000
8000
10000
12000
0 50 100 150 200 250
Time (seconds)
Kb
yte
/se
c
background
foreground
competitive
Ian Foster ARGONNE CHICAGO
Integrated Policy Management
Required to control reservation and scheduling
Determine who can to what to whom Integral part of resource management
Resource application, applicationresource
Next step after authentication Need to integrate with and augment
existing approaches Access control lists, capabilities, usage
certificates
Ian Foster ARGONNE CHICAGO
Policy: Technical Approaches
Single API to alternative mechanisms Similar to security infrastructure Integration with Globus security model and
Globus resource management components Basic policy mechanism in current system
Research questions Reusable policy structures for resource
specification/management Policy aware resource discovery/scheduling
Ian Foster ARGONNE CHICAGO
Grid Services (4):Storage and I/O Services
Access to remote data (GASS) Uniform access to diverse storage management
systems Cache management High-speed, secure transport: gsiftp Integration with metadata & storage systems
Communication (Nexus, GlobusIO) Application-level interfaces to comm services Multiple methods: reliable/unreliable, IP/other,
unicast/multicast Quality of service interfaces
Ian Foster ARGONNE CHICAGO
Ian Foster ARGONNE CHICAGO
Current Technology Focus Areas Advanced resource management techniques
GARA: Globus Arch. for Resv. & Allocation High-end data-intensive applications
“Data Grid” Interfaces to commodity technologies
CoG Kit: Commodity Grid Toolkits Distance visualization
NOVA: Network Optimized Visualization Arch.
With supporting work on info/instr., policy, accounting, authentication/authorization, etc.
Ian Foster ARGONNE CHICAGO
The Grid Forumhttp://www.gridforum.org
IETF-like community forum for discussion & definition of Grid infrastructure
First two meetings (June 16-18, Oct 18-20) attracted 150 people
9 working groups established in security, information infrastructure, resource management, accounting, etc.
Next mtg: San Diego March 22-24 2000 See also European Grid Forum
www.egrid.org
Ian Foster ARGONNE CHICAGO
-Grid(“Broadband Experimental Terascale Access”)
A proposal to NSF to plan (& build) a national infrastructure for computer systems research dedicated to research of a scale that permits realistic experimentation of a scale that encourages participation by
adventurous applications groups a place for computer and application scientists to
tackle problems together Initial plan is for O(20) Linux clusters, each with
O(30) nodes, O(2 TB) disk, Gb/s networkhttp://dsl.cs.uchicago.edu/beta
Ian Foster ARGONNE CHICAGO
Summary: Where We Are
Solid technology base for security, resource management, information services
Globus v1.1 completed, with all core services complete, robust, and documented
Many tool projects are leveraging this considerable investment in infrastructure
Substantial deployment activities and application experiments
New R&D in commodity grids, resource management, distance viz, data grids
http://www.globus.org
Ian Foster ARGONNE CHICAGO
tomographic reconstruction
real-timecollection
wide-areadissemination
desktop & VR clients with shared controls
Advanced Photon Source
Case Study 1:Online Instrumentation
archival storage
DOE X-ray source grand challenge: ANL, USC/ISI, NIST, U.Chicago
Ian Foster ARGONNE CHICAGO
Case Study 2:Distributed Supercomputing
SF-Express Distributed Interactive Simulation: Caltech, USC/ISI
Starting point: SF-Express parallel simulation code
Globus mechanisms for Resource allocation Distributed startup I/O and configuration Fault detection
100K vehicles (2002 goal) using 13 computers, 1386 nodes, 9 sites
NCSAOrigin
CaltechExemplar
CEWESSP
MauiSP
Ian Foster ARGONNE CHICAGO
OVERFLOW with latency-tolerant algorithms
MPICH-G “Grid-enabled” message passing
Globus services
Security Directory Scheduling
Process mgmt Communication
ARC SGI O2000(California)
Argonne SGI O2000(Illinois)
OVERFLOW simulation: NASA Ames
Ian Foster ARGONNE CHICAGO
Case Study 3:Collaborative Engineering
CAVERNsoft: UIC Electronic Visualization Laboratory
Manipulate shared virtual space, with Simulation components Multiple flows: Control, Text,
Video, Audio, Database, Simulation, Tracking, Haptics, Rendering
Uses Globus comms: (un)reliable uni/multicast
Future: Security, QoS, allocation, reservation
Ian Foster ARGONNE CHICAGO
Case Study 4:High-Throughput Computing
Nimrod-G: Monash University
CostDeadline
AvailableMachines
Schedule many independent tasks (e.g., parameter study)
Uses Globus security, discovery, data access, scheduling
Future: Reservation, accounting, code management, etc.
Ian Foster ARGONNE CHICAGO
Problem solving environment for comp. chemistry
Globus services used for authentication, remote job submission, monitoring, and control
Future: distributed data archive, resource discovery, charging
Case Study 5:Problem Solving Environment
ECCE’: Pacific Northwest National Laboratory