The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics...

19
The ALICE short-term use The ALICE short-term use case case DataGrid WP6 Meeting DataGrid WP6 Meeting Milano, 11 Dec 2000 Piergiorgio Cerello 1 Physics Performance Report (PPR) Physics Performance Report (PPR) production starting in Feb2001 production starting in Feb2001 Opportunity for a "zero-order" Opportunity for a "zero-order" test with GRID tools test with GRID tools Get many people in touch with Get many people in touch with GLOBUS GLOBUS Be ready to make use of newly available Be ready to make use of newly available GRID tools GRID tools Better define our requirements Better define our requirements "experimentally" "experimentally" Improve & simplify the Improve & simplify the production control? production control?

Transcript of The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics...

Page 1: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

The ALICE short-term use caseThe ALICE short-term use case

DataGrid WP6 MeetingDataGrid WP6 Meeting

Milano, 11 Dec 2000 Piergiorgio Cerello 1

Physics Performance Report (PPR) Physics Performance Report (PPR) production starting in Feb2001 production starting in Feb2001

Opportunity for a "zero-order" test with Opportunity for a "zero-order" test with GRID tools GRID tools

Get many people in touch with GLOBUSGet many people in touch with GLOBUSBe ready to make use of newly available GRID toolsBe ready to make use of newly available GRID tools

Better define our requirements "experimentally"Better define our requirements "experimentally"

Improve & simplify the production control? Improve & simplify the production control?

Page 2: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

The ALICE PPRThe ALICE PPR

DataGrid WP6 MeetingDataGrid WP6 Meeting

Milano, 11 Dec 2000 Piergiorgio Cerello 2

Evaluate the ALICE performance with respect to the Evaluate the ALICE performance with respect to the

detection of a set of interesting physics signals detection of a set of interesting physics signals

Pb-Pb event generation Pb-Pb event generation (HIJING, driven by AliRoot) (HIJING, driven by AliRoot)Event tracking (GEANT3, driven by AliRoot)Event tracking (GEANT3, driven by AliRoot)Full detector simulation (AliRoot)Full detector simulation (AliRoot)

List of "Digits" (Raw Data)List of "Digits" (Raw Data)

Step 1: background simulationStep 1: background simulation

Page 3: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

The ALICE PPRThe ALICE PPR

DataGrid WP6 MeetingDataGrid WP6 Meeting

Milano, 11 Dec 2000 Piergiorgio Cerello 3

Physics signal generation & digitizationPhysics signal generation & digitizationBackground superpositionBackground superpositionReconstructionReconstruction

List of Points, Tracks, Vertices List of Points, Tracks, Vertices

Step 2: signal simulation + reconstructionStep 2: signal simulation + reconstruction

Evaluation of:Evaluation of:- signal detection acceptance & - signal detection acceptance &

efficiencyefficiency- mass resolution- mass resolution

Step 3: AnalysisStep 3: Analysis

Page 4: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

SimulationSimulation

DataGrid WP6 MeetingDataGrid WP6 Meeting

Milano, 11 Dec 2000 Piergiorgio Cerello 4

Requirements on mass storage & CPU mainly Requirements on mass storage & CPU mainly come from the simulation of "background" events come from the simulation of "background" events

About About 101044 background events needed background events needed ( (8.4 x108.4 x1044 primary tracks each) primary tracks each)

Mass storage: about 1 GB/eventMass storage: about 1 GB/event> 10 TB> 10 TB storage needed storage needed

Step 1Step 1

CPU: about 24 hours/event CPU: about 24 hours/event on PIII 600 Mhz, 512 MB RAM on PIII 600 Mhz, 512 MB RAM

Memory: about 450 MB/jobMemory: about 450 MB/job

Page 5: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

SimulationSimulation

DataGrid WP6 MeetingDataGrid WP6 Meeting

Milano, 11 Dec 2000 Piergiorgio Cerello 5

node:/path/run#/event#/galice.root (1 GB)node:/path/run#/event#/<det>_sumDigits.root (100 MB)galice.root ----> Header, Kine, Hits <det>_sumDigits.root ----> Digits for detector <det><det> = ITS, TPC, MUON, PMD, RICH, ZDC & so on

Data output structure: 1 event/directoryData output structure: 1 event/directory

InputInput

Config.CConfig.Cgrun.C grun.C

job_id.errjob_id.errjob_id.datjob_id.dat

OutputOutput

Page 6: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

Signal + ReconstructioSignal + Reconstructionn

DataGrid WP6 MeetingDataGrid WP6 Meeting

Milano, 11 Dec 2000 Piergiorgio Cerello 6

RemarksRemarks

Step 2Step 2 Requirements on Requirements on workload management & data access workload management & data accessOrganization will depend on the total input sizeOrganization will depend on the total input size

- Need to make input data (background - Need to make input data (background

digitized events) available to any user digitized events) available to any user- Need to superimpose different - Need to superimpose different

physicsphysics signals on the same background signals on the same background

eventevent

Page 7: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

Signal + ReconstructioSignal + Reconstructionn

DataGrid WP6 MeetingDataGrid WP6 Meeting

Milano, 11 Dec 2000 Piergiorgio Cerello 7

Step 2 - Option 1Step 2 - Option 1 - All the digitized data stored at a single site- All the digitized data stored at a single site- Access to any user + data shipping to the- Access to any user + data shipping to the user's site -> processing user's site -> processing - Reconstruction output stored at the - Reconstruction output stored at the

user's siteuser's site

Remarks:Remarks: - Need for a Workload manager- Need for a Workload manager- Network performance: 1 TB x # of - Network performance: 1 TB x # of

usersusers

Page 8: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

Signal + ReconstructioSignal + Reconstructionn

DataGrid WP6 MeetingDataGrid WP6 Meeting

Milano, 11 Dec 2000 Piergiorgio Cerello 8

Step 2 - Option 2Step 2 - Option 2 - Distributed storage of digitized data- Distributed storage of digitized data

- Scheduling of signal superposition & - Scheduling of signal superposition & reconstrution jobs reconstrution jobs

- Reconstruction output stored at the - Reconstruction output stored at the user's siteuser's site

Remarks:Remarks: - Workload managed locally- Workload managed locally- Network performance: 0.1 TB x # of - Network performance: 0.1 TB x # of

signalssignals

Page 9: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

Signal + ReconstructioSignal + Reconstructionn

DataGrid WP6 MeetingDataGrid WP6 Meeting

Milano, 11 Dec 2000 Piergiorgio Cerello 9

Step 2 - Option 3Step 2 - Option 3 - Distributed storage of digitized data- Distributed storage of digitized data

- Access to any remote user + data - Access to any remote user + data shipping toshipping to the user's site the user's site

- Reconstruction output shipped to a single - Reconstruction output shipped to a single sitesite (which might be different for different signals) (which might be different for different signals)

Remarks:Remarks: - Need for a Workload manager- Need for a Workload manager- Network performance: 1 TB x # of - Network performance: 1 TB x # of

usersusers

Page 10: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

AnalysisAnalysis

DataGrid WP6 MeetingDataGrid WP6 Meeting

Milano, 11 Dec 2000 Piergiorgio Cerello 10

Step 3Step 3 - Distributed storage of reconstructed data - Distributed storage of reconstructed data

- Access to any remote user + data - Access to any remote user + data shipping toshipping to the user's site the user's site

- Analysis- Analysis

Remarks:Remarks: - Workload managed locally- Workload managed locally- Need for a data manager- Need for a data manager- Network performance: 0.1 TB x # of - Network performance: 0.1 TB x # of

usersusers

Page 11: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

The ALICE PPR: where?The ALICE PPR: where?

DataGrid WP6 MeetingDataGrid WP6 Meeting

Milano, 11 Dec 2000 Piergiorgio Cerello 11

Sites quoted for the PPR productionSites quoted for the PPR production

No detailed sharing decided yetNo detailed sharing decided yet

CERNCERNAmsterdam - NIKHEFAmsterdam - NIKHEFCagliari - INFNCagliari - INFNCatania - INFNCatania - INFNColumbus - Ohio State UniversityColumbus - Ohio State UniversityDarmstadt - GSIDarmstadt - GSIDubna - JINRDubna - JINRLyon - IN2P3Lyon - IN2P3Padova - INFNPadova - INFNTorino - INFNTorino - INFN

Page 12: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

Status: ALICE SoftwareStatus: ALICE Software

Milano, 11 Dec 2000 Piergiorgio Cerello

GCC 2.95.2 as compiler, recompiled for i686ROOT v 2.25/03 patched & compiledAliRoot v3.04pre

AliRoot is rapidly evolving towards version3.04 which will be used for the Physics Performance Report production

AliRoot framework development for the PPR production at CERN

GRID (Globus) tests between sites quoted for the PPR production

installed and working at all sites - preliminary installation kit available

DataGrid WP6 MeetingDataGrid WP6 Meeting

12

Page 13: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

Status: GRID SoftwareStatus: GRID SoftwareGlobus v.1.1.3 installed in Cagliari, Catania, Columbus, Lyon, Torino

Dubna, GSI, Saclay: expected by the end of the yearMerida: work just starting on the GRID item

Milano, 11 Dec 2000 Piergiorgio Cerello 13

DataGrid WP6 MeetingDataGrid WP6 Meeting

Lower-level job manager: Cagliari, Catania, GSI: LSF Lyon: BQS Torino: PBS

Columbus, Dubna, GSI, Merida, Saclay: to be defined

Page 14: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

Test activities - up to nowTest activities - up to now overhead tests using a CPU-bound program;

benchmark tests using NAS (System Division at NASA

Ames Research Center) Software

functionality tests (elementary job submission using

globus-job-run and globusrun commands);

Run AliRoot (1 full event) with the whole Alice

detector on the farms [no Grid]

test submission and execution stages on the PBS

(LSF, ...) farm driven by a local host [Grid]Milano, 11 Dec 2000 14

DataGrid WP6 MeetingDataGrid WP6 Meeting

Piergiorgio Cerello

Page 15: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

Test activities - up to nowTest activities - up to now

driven by GLOBUS

submitted from Torino, executed in Catania

CPU load monitored through the WEB

data output stored remotely

stdout & stderr shipped back to submitting machine

Execution time: 24 hMilano, 11 Dec 2000 15

DataGrid WP6 MeetingDataGrid WP6 Meeting

Piergiorgio Cerello

Remote generation of a full ALICE background event

Page 16: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

Test activities - plansTest activities - plans

test submission and execution stages on local farms driven by remote sites [Grid]

repeat between different partners [Grid]

Milano, 11 Dec 2000 16

DataGrid WP6 MeetingDataGrid WP6 Meeting

Piergiorgio Cerello

Repeat with input/output file transferRepeat with input/output file transfer

Page 17: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

Milano, 11 Dec 2000 17

DataGrid WP6 MeetingDataGrid WP6 Meeting

Piergiorgio Cerello

Test activities - Step2Test activities - Step2

Presently planned tests refer Presently planned tests refer to the simulation stage ("Step1")to the simulation stage ("Step1")

What about reconstruction & analysis ("Step2")?What about reconstruction & analysis ("Step2")?

More demanding from the point of More demanding from the point of view of GRID toolsview of GRID tools

Need access to & shipping of more than Need access to & shipping of more than 10 TB of distributed data10 TB of distributed data

Page 18: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

Milano, 11 Dec 2000 18

DataGrid WP6 MeetingDataGrid WP6 Meeting

Piergiorgio Cerello

ConclusionsConclusions

Large number of sites participating to GRID Large number of sites participating to GRID tests & quoted for the PPR production (Step1)tests & quoted for the PPR production (Step1)

The use of GLOBUS as manager for the PPRThe use of GLOBUS as manager for the PPRproduction is becoming a reliable optionproduction is becoming a reliable option

Reconstruction & Analysis (Step2) for the ALICE Reconstruction & Analysis (Step2) for the ALICE Physics Performance Report will soon require Physics Performance Report will soon require more advanced GRID tools more advanced GRID tools

Page 19: The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.

Milano, 11 Dec 2000 19

DataGrid WP6 MeetingDataGrid WP6 Meeting

Piergiorgio Cerello