The LHC Computing Grid A World-Wide Computer Centre
description
Transcript of The LHC Computing Grid A World-Wide Computer Centre
Frédéric Hemmer, CERN, IT Department
1
The LHC Computing Grid – April 2007
The LHC Computing Grid
A World-Wide Computer
Centre
Frédéric Hemmer
Deputy IT Department Head
April 13, 2007
Pour Stéphanie Benz
Journaliste – l’Expansion
Frédéric Hemmer, CERN, IT Department
2
The LHC Computing Grid – April 2007
The accelerator generates 40 million particle collisions (events) every second at the centre of each of the four experiments’ detectors
The LHC Accelerator
Frédéric Hemmer, CERN, IT Department
3
The LHC Computing Grid – April 2007
LHC DATA
This is reduced by online computers that filter out a few hundred “good” events per sec.
Which are recorded on disk and magnetic tapeat 100-1,000 MegaBytes/sec
~15 PetaBytes per year for all four experiments
Frédéric Hemmer, CERN, IT Department
4
The LHC Computing Grid – April 2007
simulation
reconstruction
analysis
interactivephysicsanalysis
batchphysicsanalysis
batchphysicsanalysis
detector
event summary data
rawdata
eventreprocessing
eventreprocessing
eventsimulation
eventsimulation
analysis objects(extracted by physics topic)
Data Handling and Computation for
Physics Analysisevent filter(selection &
reconstruction)
event filter(selection &
reconstruction)
processeddata
les.
rob
ert
son
@ce
rn.c
h
CERN
Frédéric Hemmer, CERN, IT Department
5
The LHC Computing Grid – April 2007
The LHC Data Challenge
• The accelerator will be completed in 2007 and run for 10-15 years
• Experiments will produce about15 Million Gigabytes of data each year (about 20 million CDs!)
• LHC data analysis requires a computing power equivalent to ~100,000 of today's fastest PC processors
• Requires many cooperating computer centres, as CERN can only provide ~20% of the capacity
Frédéric Hemmer, CERN, IT Department
6
The LHC Computing Grid – April 2007
Solution: the Grid
• Use the Grid to unite computing resources of particle physics institutes around the world
The World Wide Web provides seamless access to information that is stored in many millions of different geographical locations
The Grid is an infrastructure that provides seamless access to computing power and data storage capacity distributed over the globe
Frédéric Hemmer, CERN, IT Department
7
The LHC Computing Grid – April 2007
LCG Service Hierarchy
Tier-0 – the accelerator centre• Data acquisition & initial processing• Long-term data curation Data Distribution to Tier-1 centres
Canada – Triumf (Vancouver)France – IN2P3 (Lyon)Germany –KarlsruheItaly – CNAF (Bologna)Netherlands – NIKHEF/SARA (Amsterdam)Nordic countries – distributed Tier-1
Spain – PIC (Barcelona)Taiwan – Academia SInica (Taipei)UK – CLRC (Oxford)US – FermiLab (Illinois) – Brookhaven (NY)
Tier-1 – “online” to the data acquisition process high availability
• Managed Mass Storage – grid-enabled data service
All re-processing passes• Data-heavy analysis• National, regional support
Tier-2 – ~100 centres in ~40 countries• Simulation• End-user analysis – batch and interactive Services, including Data Archive and Delivery, from Tier-1s
Frédéric Hemmer, CERN, IT Department
8
The LHC Computing Grid – April 2007
CERN18%
All Tier-1s39%
All Tier-2s43%
CERN12%
All Tier-1s55%
All Tier-2s33%
CERN34%
All Tier-1s66%
CPU Disk Tape
Summary of Computing Resource RequirementsAll experiments - 2008From LCG TDR - June 2005
CERN All Tier-1s All Tier-2s TotalCPU (MSPECint2000s) 25 56 61 142Disk (PetaBytes) 7 31 19 57Tape (PetaBytes) 18 35 53
Distribution of Computing Services
Frédéric Hemmer, CERN, IT Department
9
The LHC Computing Grid – April 2007
LHC Computing Grid Project - a Collaboration
• The physicists and computing specialists from the LHC experiments
• The national and regional projects in Europe and the US that have been developing Grid middleware
• The regional and national computing centres that provide resources for LHC
• The research networks
Researchers
Computer Scientists &
Software Engineers
Service Providers
Building and operating the LHC Grid – a global collaboration between
Frédéric Hemmer, CERN, IT Department
10
The LHC Computing Grid – April 2007
The Grid used by CERN and its partners
• The EGEE and OSG projects are the basis of the Worldwide LHC Computing Grid Project
Frédéric Hemmer, CERN, IT Department
11
The LHC Computing Grid – April 2007
Frédéric Hemmer, CERN, IT Department
12
The LHC Computing Grid – April 2007
The new European Network Backbone
• LCG working group with Tier-1s and national/ regional research network organisations
• New GÉANT 2 – research network backbone
Strong correlation with major European LHC centres
• Swiss PoP at CERN
Frédéric Hemmer, CERN, IT Department
13
The LHC Computing Grid – April 2007
• LCG has been the driving force for the European multi-science Grid EGEE (Enabling Grids for E-sciencE)
• EGEE is now a global effort, and the largest Grid infrastructure worldwide
• Co-funded by the European Commission (~130 M€ over 4 years)
• EGEE already used for >20 applications, including…
Impact of the LHC Computing Gridin Europe
Medical ImagingEducation, TrainingBio-informatics
Frédéric Hemmer, CERN, IT Department
14
The LHC Computing Grid – April 2007
The EGEE Project• Infrastructure operation
• Currently includes >200 sites across 40 countries• Continuous monitoring of grid services & automated site
configuration/managementhttp://gridportal.hep.ph.ic.ac.uk/rtm/
• Middleware• Production quality middleware distributed under business
friendly open source licence
• User Support - Managed process from first contact through to production usage
• Training• Documentation • Expertise in grid-enabling applications• Online helpdesk• Networking events (User Forum, Conferences etc.)
• Interoperability• Expanding interoperability with related infrastructures
Frédéric Hemmer, CERN, IT Department
15
The LHC Computing Grid – April 2007
0
5000
10000
15000
20000
25000
30000
35000
40000
Apr-0
4
Jun-
04
Aug-0
4
Oct-
04
Dec-0
4
Feb-0
5
Apr-0
5
Jun-
05
Aug-0
5
Oct-
05
Dec-0
5
Feb-0
6
Apr-0
6
Jun-
06
Aug-0
6
Oct-
06
Dec-0
6
No
. CP
U
0
50
100
150
200
250
Apr-0
4
Jun-
04
Aug-0
4
Oct-
04
Dec-0
4
Feb-0
5
Apr-0
5
Jun-
05
Aug-0
5
Oct-
05
Dec-0
5
Feb-0
6
Apr-0
6
Jun-
06
Aug-0
6
Oct-
06
Dec-0
6
No
. Sit
es
Status
~17.5 million jobs run (6450 cpu-years) in 2006;
Workloads of the “other VOs” start to be significant – approaching 8-10K jobs per day; and 1000 cpu-months/month
• one year ago this was the overall scale of work for all VOs
~17.5 million jobs run (6450 cpu-years) in 2006;
Workloads of the “other VOs” start to be significant – approaching 8-10K jobs per day; and 1000 cpu-months/month
• one year ago this was the overall scale of work for all VOs
Frédéric Hemmer, CERN, IT Department
17
The LHC Computing Grid – April 2007
Use of the EGEE Infrastructure
>20k jobs running simultaneously
Frédéric Hemmer, CERN, IT Department
18
The LHC Computing Grid – April 2007
Use for massive data transfer
Large LHC experiments now transferring ~ 1PB/month each
Frédéric Hemmer, CERN, IT Department
19
The LHC Computing Grid – April 2007
Site availability – SAM
Frédéric Hemmer, CERN, IT Department
21
The LHC Computing Grid – April 2007
Applications on EGEE• More than 20 applications from 7 domains
• Astrophysics• MAGIC, Planck
• Computational Chemistry• Earth Sciences
• Earth Observation, Solid Earth Physics, Hydrology, Climate • Financial Simulation
• E-GRID• Fusion• Geophysics
• EGEODE• High Energy Physics
• 4 LHC experiments (ALICE, ATLAS, CMS, LHCb)• BaBar, CDF, DØ, ZEUS
• Life Sciences• Bioinformatics (Drug Discovery, GPS@, Xmipp_MLrefine, etc.)• Medical imaging (GATE, CDSS, gPTM3D, SiMRI 3D, etc.)
• Multimedia• Material Sciences • …
Frédéric Hemmer, CERN, IT Department
22
The LHC Computing Grid – April 2007
Example: EGEE Attacks Avian Flu
• EGEE used to analyse 300,000 possible potential drug compounds against bird flu virus, H5N1.
• 2000 computers at 60 computer centres in Europe, Russia, Taiwan, Israel ran during four weeks in April - the equivalent of 100 years on a single computer.
• Potential drug compounds now being identified and ranked
Neuraminidase, one of the two major surface proteins of influenza viruses, facilitating the release of virions from infected cells. Image Courtesy Ying-Ta Wu, AcademiaSinica.
Frédéric Hemmer, CERN, IT Department
23
The LHC Computing Grid – April 2007
Example: Geocluster industrial application
• The first industrial application successfully running on EGEE
• Developed by the Compagnie Générale de Géophysique (CGG) in France, doing geophysical simulations for oil, gas, mining and environmental industries.
• EGEE technology helps CGG to federate its computing resources around the globe.
Frédéric Hemmer, CERN, IT Department
25
The LHC Computing Grid – April 2007
Evolution
EGEE EGEE-IIEDG EGEE-III
European e-Infrastructure
Coordination
Testbeds Utility ServiceRoutine Usage
Frédéric Hemmer, CERN, IT Department
27
The LHC Computing Grid – April 2007