POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

13
POOL Status and Plans POOL Status and Plans POOL Status and Plans Dirk Düllmann Dirk Düllmann IT-DB & LCG-POOL IT-DB & LCG-POOL Application Area Meeting Application Area Meeting 10 10 th th March 2004 March 2004

Transcript of POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

Page 1: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans

POOL Status and PlansPOOL Status and Plans

Dirk DüllmannDirk DüllmannIT-DB & LCG-POOLIT-DB & LCG-POOL

Application Area MeetingApplication Area Meeting1010thth March 2004 March 2004

Page 2: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans D.Duellmann 2

POOL Release V1.6POOL Release V1.6

• Has been produced yesterday..Has been produced yesterday..• Distribution Kits have been produced by SPIDistribution Kits have been produced by SPI• Main new features in V1.6Main new features in V1.6

– Composite Catalogs– New documentation structure

• http://pool.cern.ch/relnotes.htmlhttp://pool.cern.ch/relnotes.html– Using SEAL_1_3_3, Root 3.10.02, edg-rls-client

2.2.1 – Now build binaries for Linux and Windows

• rh73_gcc32, rh73_gcc32_dbg, rh73_gcc32_gcov, win32_vc71, win32_vc71_dbg

• Updated documentation will appear on our Updated documentation will appear on our web shortlyweb shortly

Page 3: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans D.Duellmann 3

Next StepsNext Steps

• Will move to new IT-CVS serviceWill move to new IT-CVS service– Scheduled for tomorrow 10:00– Will take the opportunity to clean and rearrange

the POOL repository

Page 4: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans D.Duellmann 4

2004 Work Plan2004 Work Plan

• Draft document has been discussed in Draft document has been discussed in POOL and the Architect ForumPOOL and the Architect Forum– http://pool.cern.ch/POOL_Program_of_Work_20040307.doc

– Based on WP work plans and experiment priorities

– No big objections received so far

• Hope to finalize the plan in the next week Hope to finalize the plan in the next week and present it to PEB and CS2and present it to PEB and CS2– Thanks to all experiments for their concrete and

detailed input– Significant overlap between the different

experiment requests

Page 5: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans D.Duellmann 5

Main Focus in 2004Main Focus in 2004

• Stabilise POOL s/w productsStabilise POOL s/w products– Focus on performance improvements rather than large

functionality changes– In line with the experiments plans for the data challenges

• Help to simplify the integration into experiment Help to simplify the integration into experiment frameworksframeworks– Tight coupling between POOL and experiment development

and production teams– Automated schema loading, usability tools, documentation

improvements

• Establish the new ConditionsDB Establish the new ConditionsDB – After a initial consolidation round

• Achieve POOL independence of the RDBMS backend Achieve POOL independence of the RDBMS backend – And extend the set of supported RDBMS systems

Page 6: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans D.Duellmann 6

POOL usability toolsPOOL usability tools

• Received requests for tools which would simplify Received requests for tools which would simplify the daily development work of pool usersthe daily development work of pool users– Create or recreate a POOL catalog from a set of

interrelated POOL files (Q1)– Provide command line tools for consistent file

manipulation of POOL files (Q2)• Eg copy, move and rename in the local file system

together with the associated POOL catalog changes

• Provide eg in the POOL contrib area a central Provide eg in the POOL contrib area a central repository of scripts developed by an experimentrepository of scripts developed by an experiment– To share the experience in the deployment of POOL (Q1)– May later support and distribute some of these tools if

there is sufficient agreement

Page 7: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans D.Duellmann 7

Storage ManagerStorage Manager

• Optimisation required in several areasOptimisation required in several areas– Client side resource usage - memory, CPU, file handles (Q2)– Mass storage handling (minimise costly requests) (Q3)– “Transparent” double to float mapping (Q3)

• Automated schema loading (Q2)Automated schema loading (Q2)– Based on SEAL service– In cooperation with ROOT team to allow late integration of data

types for already open files• Bug fixes - more complex cases Bug fixes - more complex cases

• Eg std containers with user defined allocators which define local data – aka CLHEP Matrix (Q1)

• RDBMS backend based on the RDBMS Abstraction LayerRDBMS backend based on the RDBMS Abstraction Layer– Storage of simple data structures into a RDBMS via the same

interface as for objects stored on the streaming layer – Two step plan:

• First allow for objects which can trivially be mapped to SQL tables (Q2/Q3)

• Possibly later an extension to more complex C++ objects (Q1 ‘05?)

Page 8: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans D.Duellmann 8

File CatalogFile Catalog

• Significant development completed already (Q1)!Significant development completed already (Q1)!– Support for LCG-2 (V1.5)– Support for Composite Catalogs (V1.6)

• File Catalog as model for handling and exchanging data File Catalog as model for handling and exchanging data could be a prototype for other (very similar) meta data could be a prototype for other (very similar) meta data catalogs (Q2/Q3)catalogs (Q2/Q3)– Collection catalog and Collection entries– Condition Folder catalog and Condition Data

• Cataloguing, extraction based on meta data, publishing are Cataloguing, extraction based on meta data, publishing are all very similar all very similar – Even the component implementation could be factorised out and

shared– Performance of XML as exchange format for larger data amounts

needs to be evaluated• Would like to start an activity to propose a common Would like to start an activity to propose a common

approach at least for the persistency framework projectsapproach at least for the persistency framework projects– Closely coupled to a possible emerging LCG activity of

deployment of heterogeneous databases

Page 9: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans D.Duellmann 9

Collections & ARDACollections & ARDA

• Joined Work Package with ARDAJoined Work Package with ARDA– Still some uncertainties concerning the ARDA side of the work

plan– Will continue work to address the outstanding issues on the

POOL side– POOL has asked experiments for principal contacts in this area

• Collection cataloguing, extraction and publishing toolsCollection cataloguing, extraction and publishing tools– Can we achieve a single baseline model for distributed meta

data catalogs?• File Catalog, Collection Catalog, Conditions Folder Catalog• One basic mechanism of data exchange across RDBMS

vendor boundaries based on the POOL relation abstraction layer?

• Separation of logical and physical collection identification Separation of logical and physical collection identification – Introduce a Collection (Fileset?) catalog – First implementation could simply be based on the existing File

Catalog components, but as a separate service• Integrate POOL collections with ARDA provided services Integrate POOL collections with ARDA provided services

– Migrate to ARDA provided catalog and meta data(?) services

Page 10: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans D.Duellmann 10

POOL Infrastructure POOL Infrastructure

• SEAL Component Model (Q2/Q3?)SEAL Component Model (Q2/Q3?)– Once picked up by the experiments

• Parallel build and test machinery (Q2)Parallel build and test machinery (Q2)– Including effective build on windows

• Automated data format regression testing (Q1)Automated data format regression testing (Q1)– Incorporation of experiment defined test suites into the

POOL release test procedure (Q2)

• Evaluate Appworks and possibly migrate to it (Q1)Evaluate Appworks and possibly migrate to it (Q1)– Depending on the future of SCRAM support

• Complete move to QmTest (Q1)Complete move to QmTest (Q1)• Next PortsNext Ports

– ICC 8, ECC, MacOS(?) – Schedule to be review in one of the next AF

Page 11: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans D.Duellmann 11

More Internal Review More Internal Review ResponsesResponses

• Restructure the POOL documentation (Q1)Restructure the POOL documentation (Q1)– All documentation formats derived from a shared set of

DocBook text modules– Minimises the overlap and possible inconsistencies

between design and user documents.

• Provide ROOT plugins for Ref<T> and POOL Provide ROOT plugins for Ref<T> and POOL CollectionsCollections– Allows to use POOL functionality inside ROOT as an

interactive analysis environment (Q2)

• Schema Evolution (Q2/Q3)Schema Evolution (Q2/Q3)– Will start from the ROOT support and try to confirm that

POOL does not restrict the ROOT functionality significantly– Will need to address also schema evolution on the RDBMS

layer

Page 12: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans D.Duellmann 12

Conditions DatabaseConditions Database

Still preliminary and to be discussed with all Still preliminary and to be discussed with all Project ParticipantsProject Participants

• First IOV interface and implementation First IOV interface and implementation (Q2)(Q2)

• Connection to POOL Data (Q2)Connection to POOL Data (Q2)• Review of extensions to the basic interface Review of extensions to the basic interface

(Q3)(Q3)• Condition Folder Catalog review (Q3)Condition Folder Catalog review (Q3)

Page 13: POOL Status and Plans Dirk Düllmann IT-DB & LCG-POOL Application Area Meeting 10 th March 2004.

POOL Status and Plans D.Duellmann 13

SummarySummary

• POOL V1.6 has been released POOL V1.6 has been released – Thanks to experiments for the quick turnaround in

validating our internal releases

• POOL work plan is being finalised POOL work plan is being finalised – Effort estimation of tasks still underway – Minor shifts in the proposed delivery dates may still occur

until PEB/SC2 presentation

• Proposed Focus of the YearProposed Focus of the Year– Consolidation and Optimisation– RDBMS vendor independence– Common POOL model for distributed, heterogeneous

meta data catalogs (incl. data exchange across vendor boundaries)

– ConditionsDB production release and integration with POOL