09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion:...

17
September 9, 2002 Grid Technology Panel Patrick Dreher 09/02 ID099-1 Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid Patrick Dreher Research Scientist & Associate Director MIT Laboratory for Nuclear Science Common Solutions Group Meeting Seattle, Washington September 18, 2002

Transcript of 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion:...

Page 1: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-1

Technical Panel Discussion:Progress in Developing a Web Services Data Analysis Grid

Patrick DreherResearch Scientist & Associate Director

MIT Laboratory for Nuclear Science

Common Solutions Group MeetingSeattle, WashingtonSeptember 18, 2002

Page 2: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-2

General Goals for Production Grid

• End-to-end integration and deployment using existing and prototype grid services

• Seemless deployment and extensions of grid middleware and extensions between application

• 24x7 service and access to grid• High availability of grid in production mode under

stressful loads

Page 3: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-3

Web Services Data Analysis Grid • Web services data analysis grid is being written,

developed and tested jointly by Thomas Jefferson National Lab (Chip Watson, Walt Akers, Jie Chen, Ying Chen) and MIT (myself)

• Web Services approach chosen because there is wide spread distribution of browsers and powerful tools and applications from multiple vendors

• Traditional 3 tier architecture– 1st tier is the browser– 2nd tier is the Web services– 3rd tier applications, databases, servers, etc.

Page 4: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-4

A Three Tier Web Services Architecture

Web Browser

XML to HTML servlet

Web Service

Application

Web Service

Web Service

Grid Service

Local Backend Services

(batch, file, etc.)

Web Server (Portal)

Authenticated connections

Remote Web Server

Web Service

Storage systemGrid resources,

e.g. Condor

Batch system

Page 5: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-5

Distributed Analysis Environment

• Describe and publish data set (GFN, tag)

• Track and move data (sets) on requests

• Authenticate and authorize

• Submit, control, monitor access to resources

Page 6: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-6

Certificates

Page 7: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-7

Page 8: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-8

Current certificate procedure

• Build strong authentication by combining1. Personal certificate used to create a proxy certificate

2. Permanent certificate w/o the private key

3. CA authority certificate (DOE Grids or JLab)

• Present this strong authentication triplicate to the Web server

• JLab server issues temporary 24 hour certificate• Web server then grants SSL connection• User registration for multi-site access is next step

Page 9: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-9

Authorizations

Page 10: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-10

Authorization Issues• Local vs. grid-wide account management

– Closely tied to local site acceptable use policies – Can either have local or group accounts

• Not every site allows group or pooled accounts

– Determine whether sites be willing (able) to delegate account management to a trusted CA?

– Grid-mapfiles • Needs to be generated and distributed

• Local site needs to be able to access and download a grid-mapfile

Page 11: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-11

Replication and Transfers

• Prototype Web Services Data Analysis Grid is operational at the present time

• Current design allows for automated monitoring of repository directories and disk caches (currently not activated)

Page 12: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-12

Operation of the Replica Catalog

Page 13: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-13

DEMO

Page 14: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-14

Lattice Portal

• Technologies Employed– Apache web server– Tomcat servlet engine (periodically queries

cluster batch systems and produces an XML data structure)

– 2nd servlet applies a style sheet to this XML document

• DEMO (cluster stats and job submission for authorized users)

Page 15: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-15

Theoretical Physics Computational Clusters and

Grid Interconnects• Fermilab and JLAB/MIT designing and operating

clusters for lattice QCD since 1999• Currently 160 Pentium III processors (80 duals) at

Fermilab, 88 ALPHA processors (singles, duals, quads) at JLAB and MIT; connected with Myrinet

• Newly installed 128 node system at JLAB and purchasing 256 node system at FNAL, using dual Pentium IV and Myrinet 2000 interconnect

• Preliminary schedule:

• ~1/4 TFlops sustained in 2002• ~10 TFlops sustained in 2005

Page 16: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-16

Development Milestones

Deploy prototype grid environment in 2002

Improve the robustness, reliability and maintainability of grid software through early use in production application environments.

Integration, coordination, and inter-networking with functional working systems

Integrate with international groups

Page 17: 09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.

September 9, 2002 Grid Technology PanelPatrick Dreher 09/02 ID099-17