Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page...

26
DESY IT Challenges and future directions in supporting scientific exploration Dr Paul Millar DESY IT DESY, 2015-03-12

Transcript of Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page...

Page 1: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

DESY IT

Challenges and future directions in supporting scientific exploration

Dr Paul MillarDESY ITDESY, 2015-03-12

Page 2: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 2

“Traditional” in-house research

Facilities:

HERA, PETRA, FLASH, XFEL, ...

User communities:

"DESY people doing science"

Reliable Services:

Compute, Storage.Networking, Desktops, Printing, email, web, Log book, ...

DATAA

NA

LYS

IS

Page 3: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 3

Membership of international collaborations

Facilities:

HERA, PETRA, FLASH, XFEL, ...

User communities:

"DESY people doing science"

Reliable Services:

Compute, Storage.Networking, Desktops, Printing, email, web, Log book, ...

DATA

AN

ALY

SIS

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

Scientific Collaboration Scientific Collaboration

Page 4: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 4

DESY hosting “visitor” scientists

Facilities:

HERA, PETRA, FLASH, XFEL, ...

User communities:

"DESY people doing science"

Reliable Services:

Compute, Storage.Networking, Desktops, Printing, email, web, Log book, ...

DATA

AN

ALY

SIS

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

Scientific Collaboration Scientific Collaboration

External Research Centre:

Local users:

Visiting users:

Page 5: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 5

Researching improvements

Facilities:

HERA, PETRA, FLASH, XFEL, ...

User communities:

"DESY people doing science"

Reliable Services:

Compute, Storage.Networking, Desktops, Printing, email, web, Log book, ...

DATA

AN

ALY

SIS

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

Scientific Collaboration Scientific Collaboration

External Research Centre:

Local users:

Visiting users:

Research activity:

EXPERIENCE

IMPROVEMENTS

Federated AAI Storage Algorithms Optimisation Cloud Data Preservation

Page 6: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 6

Research connected

Masters projects German projectsEU Projects

Facilities:

HERA, PETRA, FLASH, XFEL, ...

User communities:

"DESY people doing science"

Reliable Services:

Compute, Storage.Networking, Desktops, Printing, email, web, Log book, ...

DATA

AN

ALY

SIS

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

Scientific Collaboration Scientific Collaboration

External Research Centre:

Local users:

Visiting users:

Research activity:

EXPERIENCE

IMPROVEMENTS

Federated AAI Storage Algorithms Optimisation Cloud Data Preservation

Industry

Standards Groups

User communities

Page 7: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 7

Complete overview

Masters projects German projectsEU Projects

Facilities:

HERA, PETRA, FLASH, XFEL, ...

User communities:

"DESY people doing science"

Reliable Services:

Compute, Storage.Networking, Desktops, Printing, email, web, Log book, ...

DATA

AN

ALY

SIS

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

External Research Centre:

External users:Services:

Scientific Collaboration Scientific Collaboration

External Research Centre:

Local users:

Visiting users:

Research activity:

EXPERIENCE

IMPROVEMENTS

Federated AAI Storage Algorithms Optimisation Cloud Data Preservation

Sites deploying DESY software:

Industry contributing to and using DESY software

Industry

Standards Groups

User communities

Page 8: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 8

The WLCG “computer” for LHC computation

> 170 computer centres, located at 40 countries spread over the world

> Networking originally hierarchically structured:

Single Tier-0 is CERN, countries (mostly) have a single Tier-1 and multiple Tier-2

Now less structured: network traffic crosses country boundaries.

> Compute facility provided as various independent batch systems:

Some 490,000 job slots (i.e., cores), ~3% by DESY.

> Storage capacity at sites is provided by various software

Some 254 PiB (~5% by DESY) of disk capacity and 200 PiB tape capacity

> Dedicated networking:

LHC-OPN: dedicated fibre-optic link from CERN to Tier-1 centres and between Tier-1 centres.

LHC-ONE: isolated WLCG traffic from normal Internet activity.

Page 9: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 9

Comparison of data sizes

Detektoren, Datenverarbeitung, Vernetzung

9

LHCraw/year

15 PBytes(2012)

ATLAS total managed data 2014(raw, reconstructed, simulated)

140 PBytes

Increase by factor 14 by 2020

Business mails/year3000 PBytes

Facebook uploads/year182 PBytes

Google index98 PBytes

YouTube15 PBytes

Nasdaq

So

urc

e: W

ired

201

3; A

TL

AS

Slide thanks to Dr. Patrick Fuhrmann

Page 10: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

dCache … an example of DESY research project

dCache

Page 11: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 11

dCache: managed storage for big data

Collaborations

InternationalCollaboration

Student mentor programme

dCache

dCache

1.5 FTEs

2 FTEs

5 FTEs

3 students

Page 12: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 12

dCache: evolution of big data E

raA

dd

itio

nal

C

om

mu

nit

ies

Ad

dit

ion

alA

uth

en-

tica

tio

n

Industry

Trusted host X.509,Kerberos

Username+PW SAML, OpenID,OAuth, Token, ...

Disk cache Grid StorageGeneric Storage Cloud Storage

Page 13: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 13

Storage Vision

NFS

WebDAV

HPC & Grid Clusters

Low latency access

DropBox-like storage

Devices synchronise with storage

Remote access

Rich access via web- browser

Bulk WAN transfer

Moving huge datasets

CDMI

HTTPFTP

Cloud storage

Standard back- end for clusters and portals

dCacheNFS

Fast data ingestStandard devices at high data rates

Page 14: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 14

dCache future directions

> Building on existing support for standards:

Strong support for NFS v4.1 pNFS.

> Taking advantage of others' work:

Technologies like CEPH partially overlap with dCache; can we build on it?

> Rethinking storage:

New protocols, like CDMI, define much richer semantics of storage; do these provide new opportunities?

Clients using dCache as an object store.

> INDIGO DataCloud:

€11.1M, 26 parters (11 countries), 30 months H2020 project.

Building software to support a European-wide federated cloud.

> Work towards the Storage Vision:

Many parts already there, we're adding the remaining bit.

Page 15: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 15

Learn more in Femto “Big Data” issue

http://www.desy.de/femto/http://www.desy.de/femto_eng/

Subscribe:

The next issue will focus on Big Data.

Page 16: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Backup slides

Page 17: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 17

Overview of the LHC

Page 18: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 18

HEP data analysis: reconstruction

Images curtsey of CERN and the CMS collaboration

Page 19: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 19

HEP analysis: reconstruction

Images curtsey of CERN and the ATLAS collaboration

Page 20: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 20

HEP data analysis workflow

Monte Carlo Production:

RAWRAW

RAW

Analysis:Reconstruction:

Geometry & Conditions:

RAW

RAW

Page 21: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21

Next generation photon-science detectors

2560 x 2160 100 Hz 5.6 Gb/s Win 7

1k x 1k x 2 2 kHz 30 Gb/s RHEL6

Pixels Frame Rate Data Rate OS

3x1536x512x2 2 kHz 60 Gb/s SL6

PCO Edge

LAMBDA

Eiger

Slide thanks to Dr. Steve Aplin

Page 22: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 22

DESY network topology

Data Center

XACC

LHCONE

MKS

MST

FLASH

Off ce

X-WiN

RT-WAN-01 (LAN)

RT-197-12 RT-IPMI-01

RT-197-1 RT-197-2

GR-HAMXR-DES

RT-197-14

DMZ

XFEL-Guest

Zeuthen Intern

RT-197-13

RT-197-11

RT-HH-01 RT-HH-02

GuestRT-197-17

CR-TUB

RT-WAN-01

RT-197-15

RT-DC-03 RT-DC-04

FW-197-22

40 Gbps

10 Gbps

1 Gbps

100 Mbps

CR-HAN

vPC / MLAG

4 x 10

RT-XACC-02RT-XACC-01

RT-DC-01 RT-DC-02

RT-WAN-02 (LAN)

RT-WAN-02

Zeuthen Extern

05.03.2015

Slide thanks to Kars Ohrenberg

Page 23: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 23

LHC ONE: global infrastructure for LHC connectivity

Slide thanks to Kars Ohrenberg

Page 24: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 24

Computing usage

Page 25: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 25

LSDMA: structure

Page 26: Dr Paul Millar DESY, 2015-03-12 · Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 21 Next generation photon-science detectors 2560 x 2160 100 Hz 5.6 Gb/s Win 7 1k

Paul Millar | DESY IT Challenges in Storage | 2015-03-12 | Page 26

LSDMA: data concept