© 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry &...

22
© 2006 Open Grid Forum GGF18, 13th September 200 6 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey

Transcript of © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry &...

Page 1: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

© 2006 Open Grid Forum

GGF18, 13th September 2006

OGSA Data Architecture ScenariosDave Berry & Stephen Davey

Page 2: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

2© 2006 Open Grid Forum

Contents

• Overview• Five sample scenarios

• Data Pipelining• Data Storage• Data Replication• Data Staging

(Joint OGSA Data + EMS data staging scenario)

• Personal Data Service

Page 3: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

3© 2006 Open Grid Forum

Two Informational Documents

• OGSA Data Architecture• 70+ pages• Describes the services and their interfaces• Some work remaining to describe interfaceshttps://forge.gridforum.org/sf/docman/do/downloadDocument/projects.ogsa-d-wg/docman.root.working_drafts/doc12659

• OGSA Data Scenarios• 50+ pages• Describes how the services can be combined to

address particular scenarios• Some work remaining to identify interfaceshttps://forge.gridforum.org/sf/docman/do/downloadDocument/projects.ogsa-d-wg/docman.root.working_drafts/doc13605

Page 4: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

4© 2006 Open Grid Forum

Scenarios document

• Example scenarios of a generic nature to accompany the OGSA Data Architecture document.

• Illustrates how the components and interfaces described in the OGSA Data Architecture document can be put together in a selection of typical data scenarios.

• Not a use case document generating requirements.

Page 5: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

5© 2006 Open Grid Forum

Current Scope

• Files and databases (& storage)• Not streams, sessions, …• Services and interfaces

• Storage, Access, Transfer• Replication, Caching, Federation, Metadata

catalogues• Cross-cutting themes

• Security, Policies, …• Part of the bigger OGSA picture

• E.g. Naming, Workflow, Transactions, Scheduling, Provisioning, …

Page 6: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

6© 2006 Open Grid Forum

Progress since GGF16

• More scenarios• E,g, Provenance, Grid File System

• More integration• Particularly between scenarios and

architecture document• Also raising some issues from individual

chapters to cross-cutting concerns

Page 7: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

7© 2006 Open Grid Forum

Scenarios done so far … • Data Storage – store file data in a Grid Data Service and retrieve

it later.

• Data Replication – maintain a replica of data at a different location (for availability or performance).

• Data Staging – the movement of data in preparation for the performing of operations on or with this data.

• Data Pipelining – connect the output from one service to the input of another.

Also in the scenarios document:

• Data Integration – bringing the data that you require together from disparate sources.

• Personal Data Service – the organising of an individual’s data to allow them access to it from many different locations.

• Data Discovery – discover data; register data/metadata.

• Data Provenance – the provenance of a piece of data is the process that led to that piece of data; the history of ownership of an object.

• Grid File System – provide a virtual file system in a Grid environment.

Page 8: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

8© 2006 Open Grid Forum

Data pipelining

Completed Animations

Visualisation Service

Customer2

1. Submit job. 2. Store results.

3. Transfer results.

4. Return results.

Customer1

Data TransferService

3. Transfer results.

Rendering Service

Data Access Service

Page 9: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

9© 2006 Open Grid Forum

Data Pipelining

Completed Animations

Visualisation Service

Customer2

1. Submit job. 2. Store results.

3. Transfer results.

4. Return results.

Customer1 Data Transfer

Service

3. Transfer results.

Rendering Service

Data Service

Page 10: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

10© 2006 Open Grid Forum

Bringing data online

Storage Devices

CustomerData

StorageService

TransferService

1. Make files online.

2. Read files.

2. Read files.

Nearline Storage

Online Storage

1. Make online.

1. Make online.

3. Retire to nearline.

3. Retire to nearline.

Page 11: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

11© 2006 Open Grid Forum

Data Storage – Writing a file

Storage Devices

CustomerData

StorageService

AccessService

TransferService

1. Request file space.

4a. Write file.

FileSpace

4a. Write file.

4b. Access file.

4c. Transfer file.

4b. Access file.

4c. Transfer file.

2. Get file name (SURL).

3. Get Transfer URL (TURL) or Access URL.

5. Notify of completion.

Page 12: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

12© 2006 Open Grid Forum

Data Storage – Bringing data online

Storage Devices

CustomerData

StorageService

TransferService

1. Make files online.

2. Transfer files.

2. Transfer files.

Nearline Storage

Online Storage

1. Make online.

1. Make online.

3. Retire to nearline.

3. Retire to nearline.

Page 13: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

13© 2006 Open Grid Forum

Replication

Customer1

Data TransferService

ReplicationService

Data Storage1

Data Storage2

Data Service 2

Data Service 1

1b. Publish

2. Transfer copies

6. Update

4. Access data 5. Notify

2. Transfer copies

2. Transfer copies

Registry Service

3. Find data

1a. Register data

Customer2

Page 14: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

14© 2006 Open Grid Forum

Data Replication – 1

Customer1

Data TransferService

ReplicationService

Data Storage1

Data Storage2

Data Service 2

Data Service 1

1b. Publish

2. Transfer copies

6. Update

4. Access data

5. Notify

2. Transfer copies

2. Transfer copies

Registry Service

3. Find data

1a. Register data

Customer2

Page 15: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

15© 2006 Open Grid Forum

Data Replication – 2

1. Register

Customer1 Data

TransferService

Data Storage 1

Data Storage 2

Data Service 2

Data Service 1

2. Transfer copies

6. Update

3. Find data

4. Access data

5. Notify

2. Transfer copies

2. Transfer copies

Repli-cation

Service

DataService

Replica Catalogue

Service

Customer2

Page 16: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

16© 2006 Open Grid Forum

Joint OGSA Data + EMS Scenario

• The steps of this simple scenario are as follows:1. Submit job to BES container. (JSDL contains

execution & data staging info).2. Use data transfer service to do the required data

staging.3. Run the executable on the BES container with the

input data.4. Stage result output data back to Data Service 1.5. Delete staged input data at BES container.6. Delete staged output data BES container.

Page 17: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

17© 2006 Open Grid Forum

Data Staging

Data Transfer Service

Input Data

BES Container

Input Data (copy)

OutputData

1. Submit JSDL script.

2a. Stage input data.

Data Service 1

Data Service 2

2b. Transfer input data.

4a. Stage output data.

2a.4a.

2a.4a.

Client4b. Transfer output data.

BES Container:

3. Run executable & save resulting output data.

5. Delete input data.

6. Delete output data.

OutputData (copy)

Page 18: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

18© 2006 Open Grid Forum

Data Staging

Data Transfer Service

BES Container

Input Data(copy)

OutputData

1. Submit JSDL script.

2a. Stage input data.

DataService 1

DataService 2

2b. Transfer input data.

4a. Stage output data.

2a.4a.

2a.4a.Client

4b. Transfer output data.

BES Container:

3. Run executable & save resulting output data.

5. Delete input data (copy).

6. Delete output data.

InputData

OutputData (copy)

Page 19: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

19© 2006 Open Grid Forum

Data Staging

Data Transfer Service

BES Container

Input Data(copy)

OutputData

1. Submit JSDL script.

2a. Stage input data.

DataService 1

DataService 2

2b. Transfer input data.

4a. Stage output data.

2a.4a.

2a.4a.Client

4b. Transfer output data.

BES Container:

3. Run executable & save resulting output data.

InputData

OutputData (copy)

5. Delete input data (copy).

6. Delete output data.

Page 20: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

20© 2006 Open Grid Forum

Personal Data Service

Customer 1 (site 1)

RegistryService

Data Service 1

Data Service 2

Data Service 3

Local Cache

Service 2

Local Cache

Service 1

Index

Index

Index

2. Create named space.

3. Name collection.

1. Locate data.

2. Create.4. Use named space.

Customer 1 (site 2)

6. Use named space.

7. Update.

5. Update.

Personal Data Service

Global Name Resolver Service

Page 21: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

21© 2006 Open Grid Forum

Personal Data Service

Customer 1 (site 1)

RegistryService

Data Service 1

Data Service 2

Data Service 3

Local Cache

Service 2

Local Cache

Service 1

Index

Index

Index

2. Create named space.

3. Name collection.

1. Locate data.

2. Create.4. Use named space.

Customer 1 (site 2)

6. Use named space.

7. Update.

5. Update.

Personal Data Service

Global Name Resolver Service

Page 22: © 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.

© 2006 Open Grid Forum

Questions?