OpenStack @ CERN, by Tim Bell

53

Transcript of OpenStack @ CERN, by Tim Bell

Page 1: OpenStack @ CERN, by Tim Bell
Page 2: OpenStack @ CERN, by Tim Bell

30/11/2016 OpenStack Switzerland Meetup 2

CERN Cloud Status

Tim Bell – [email protected] - @noggin143

Page 3: OpenStack @ CERN, by Tim Bell

ATLAS Experiment

30/11/2016 OpenStack Switzerland Meetup 3

Page 4: OpenStack @ CERN, by Tim Bell

COLLISIONS

30/11/2016 OpenStack Switzerland Meetup 4

Collisions Produce 1PB/s

Page 5: OpenStack @ CERN, by Tim Bell

Total LHC data:130 PB!

LHC Raw Data Recording (1bin/1month) !

Large Scale Storage for LHC data

30/11/2016 OpenStack Switzerland Meetup 5

Page 6: OpenStack @ CERN, by Tim Bell

30/11/2016

Compute Growth Outlook

0"

20"

40"

60"

80"

100"

120"

140"

160"

Run"1" Run"2" Run"3" Run"4"

GRID"

ATLAS"

CMS"

LHCb"

ALICE"

Compute: Growth > x50 Moore’s law only x16

What we can afford

… and 400PB/year by 2023 OpenStack Switzerland Meetup 6

Page 7: OpenStack @ CERN, by Tim Bell

30/11/2016 OpenStack Switzerland Meetup 7

Page 8: OpenStack @ CERN, by Tim Bell

30/11/2016 OpenStack Switzerland Meetup 8

Expanded Facilities in Hungary

Page 9: OpenStack @ CERN, by Tim Bell

O’Reilly Consideration

30/11/2016 OpenStack Switzerland Meetup 9

Page 10: OpenStack @ CERN, by Tim Bell

Job Trends Consideration

30/11/2016 OpenStack Switzerland Meetup 10

From Indeed.com

Page 11: OpenStack @ CERN, by Tim Bell

CERN Tool Chain

30/11/2016 OpenStack Switzerland Meetup 11

Page 12: OpenStack @ CERN, by Tim Bell

1st Swiss OpenStack User Group

30/11/2016 OpenStack Switzerland Meetup 12

•  15th November 2012 – Hamster cloud based on Essex on 200 hypervisors •  https://www.openstack.org/blog/2012/12/1st-swiss-openstack-user-group/

Page 13: OpenStack @ CERN, by Tim Bell

CERN OpenStack Project

(*) Pilot Trial

ESSEX

Nova (*) Swift

Glance (*) Horizon (*)

Keystone (*)

FOLSOM

Nova (*) Swift

Glance (*) Horizon (*)

Keystone (*) Quantum Cinder

GRIZZLY

Nova Swift

Glance Horizon

Keystone Quantum Cinder

Ceilometer (*)

HAVANA

Nova Swift

Glance Horizon

Keystone Neutron Cinder

Ceilometer (*) Heat

ICEHOUSE

Nova Swift

Glance Horizon

Keystone Neutron Cinder

Ceilometer Heat Ironic Trove

JUNO

Nova Swift

Glance Horizon

Keystone Neutron Cinder

Ceilometer Heat (*) Rally (*)

5 April 2012 27 September 2012 4 April 2013 17 October 2013 17 April 2014 16 October 2014

July 2013 CERN OpenStack

Production

February 2014 CERN OpenStack

Havana

October 2014 CERN OpenStack

Icehouse

30 April 2015

March2015 CERN OpenStack

Juno

LIBERTY

Nova Swift

Glance Horizon

Keystone Neutron (*)

Cinder Ceilometer

Heat Rally

Magnum (*) Barbican (*)

Ironic

15 October 2015

September 2015 CERN OpenStack

Kilo

KILO

Nova Swift

Glance Horizon

Keystone Neutron Cinder

Ceilometer Heat Rally

Manila

September 2016 CERN OpenStack

Liberty

30/11/2016 OpenStack Switzerland Meetup 13

MITAKA

Nova Swift

Glance Horizon

Keystone Neutron Cinder

Ceilometer Heat Rally

Magnum Barbican

Ironic Mistral Manila

Page 14: OpenStack @ CERN, by Tim Bell

Upstream OpenStack on its own does not give you a cloud service

e.g.

•  200 people arrive and leave CERN / month

•  User skill levels vary widely

•  Application range from server consolidation, dev/test to production compute

Cloud is a service!

Monitoring and alerting

Metering and

chargeback Autoscaling

Remediation

Resource Lifecycle

Capacity planning

Upgrades

SLA

Customer support User

experience

Incident resolution

Alerting

Cloud monitoring

Metrics

Log processing

High availability

Config management

Infra onboarding

CI

Builds Net/info sec

Network design

OpenStack APIs

30/11/2016 OpenStack Switzerland Meetup 14

Subbu Allamaraju @ eBay

Not Just The Software

Page 15: OpenStack @ CERN, by Tim Bell

In production: •  >190K cores •  >7000

hypervisors

~100,000 additional cores being installed in next 6 months 90% of CERN’s compute resources are now delivered on top of OpenStack

OpenStack@CERN Status

30/11/2016 OpenStack Switzerland Meetup 15

Page 16: OpenStack @ CERN, by Tim Bell

30/11/2016

Cultural Change Impact

OpenStack Switzerland Meetup 16

•  Move to Agile technologies brings great

benefits •  Reduced deployment time •  Continuous integration validation •  Flexibility

•  Don’t forget Hooke’s Law (adapted) •  Under load, an organisation can

extend proportional to external force •  Too much load leads to permanent

deformation

•  Ensure the tail is moving fast as well as the head

•  Application support •  Cultural challenges •  Process change

Page 17: OpenStack @ CERN, by Tim Bell

Top level cell •  Runs API service •  Top cell scheduler

Child cells run

•  Compute nodes •  Scheduler •  Conductor •  ~40 cells

Version 2 coming •  Default for all

Nova Cells

30/11/2016 OpenStack Switzerland Meetup 17

Page 18: OpenStack @ CERN, by Tim Bell

Onwards Federated Clouds

Public Cloud such as Rackspace or IBM

CERN Private Cloud 160K cores

ATLAS Trigger 28K cores

ALICE Trigger 9K cores

CMS Trigger 13K cores

INFN Italy

Brookhaven National Labs

NecTAR Australia

Many Others on Their Way

30/11/2016 OpenStack Switzerland Meetup 18

Available in standard OpenStack since Kilo

Page 19: OpenStack @ CERN, by Tim Bell

HPC

•  From ~200TB total to ~450 TB of RBD + 50 TB RGW •  From ~200TB total to ~450 TB of RBD + 50 TB RGW

OpenStack Glance + Cinder

Example: ~25 puppet masters reading node configurations at up to 40kHz

iops

CephFS with Manila is now in pilot phase for cluster filesystems

30/11/2016 OpenStack Switzerland Meetup 19

Page 20: OpenStack @ CERN, by Tim Bell

Containers on Clouds

For the user •  Interactive •  Dynamic •  Choice For IT •  Timely! •  Secure •  Managed •  Integrated

30/11/2016 OpenStack Switzerland Meetup 20

CERN now runs Magnum on the production cloud

Page 21: OpenStack @ CERN, by Tim Bell

External Clouds •  Terraform is used to create hybrid clouds •  Tests for 2-3 months on various European

public resources

30/11/2016 OpenStack Switzerland Meetup 21

Page 22: OpenStack @ CERN, by Tim Bell

Summary •  OpenStack at CERN has been in

production for 3 years •  Major cultural and technology changes

have been successfully addressed •  Contributing back upstream has led to

sustainable tools and effective technology transfer

This transformation would not have been possible without the OpenStack community

30/11/2016 OpenStack Switzerland Meetup 22

Page 23: OpenStack @ CERN, by Tim Bell

Technical details at http://openstack-in-production.blogspot.fr

30/11/2016 OpenStack Switzerland Meetup 23

Thanks to all of you for contributing!

Thanks to the CERN IT team and Francois Briard from CERN communications

Page 24: OpenStack @ CERN, by Tim Bell

24

CERN: founded in 1954: 12 European States “Science for Peace” Today: 21 Member States

Member States: Austria, Belgium, Bulgaria, Czech Republic, Denmark, Finland, France, Germany, Greece, Hungary, Israel, Italy, Netherlands, Norway, Poland, Portugal, Slovak Republic, Spain, Sweden, Switzerland and United Kingdom Associate Member States: Pakistan, Turkey States in accession to Membership: Romania, Serbia Applications for Membership or Associate Membership: Azerbaijan, Brazil, Croatia, Cyprus, India, Russia, Slovenia, Ukraine Observers to Council: India, Japan, Russia, United States of America; European Union, JINR and UNESCO

~ 2300 staff ~ 1400 other paid personnel ~ 12500 scientific users Budget (2016) ~1000 MCHF

30/11/2016 OpenStack Switzerland Meetup 24

Page 25: OpenStack @ CERN, by Tim Bell

CMS

ALICE

ATLAS LHCb

Largest Machine on Earth

27 Km circumference, 100m underground 30/11/2016 OpenStack Switzerland Meetup 25

Page 26: OpenStack @ CERN, by Tim Bell

Most Powerful Magnets

9,600 superconducting magnets direct the beam 30/11/2016 OpenStack Switzerland Meetup 26

Page 27: OpenStack @ CERN, by Tim Bell

Highest Vacuum

Pressure is 10 times less than on the moon 30/11/2016 OpenStack Switzerland Meetup 27

Page 28: OpenStack @ CERN, by Tim Bell

Coldest Temperature

120 tonnes of liquid helium for cooling to -271°C 30/11/2016 OpenStack Switzerland Meetup 28

Page 29: OpenStack @ CERN, by Tim Bell

30M USD/year Electricity Bill

Up to 200MW at peak utilisation 30/11/2016 OpenStack Switzerland Meetup 29

Page 30: OpenStack @ CERN, by Tim Bell

ALICE Experiment

30/11/2016 OpenStack Switzerland Meetup 30

Page 31: OpenStack @ CERN, by Tim Bell

CMS Experiment

30/11/2016 OpenStack Switzerland Meetup 31

Page 32: OpenStack @ CERN, by Tim Bell

LHCb Experiment

30/11/2016 OpenStack Switzerland Meetup 32

Page 33: OpenStack @ CERN, by Tim Bell

Some history of scale…

Date Collaboration sizes

Data volume, archive technology

Late 1950’s 2-3 Kilobits, notebooks

1960’s 10-15 kB, punchcards

1970’s ~35 MB, tape

1980’s ~100 GB, tape, disk

1990’s ~750 TB, tape, disk

2010’s ~3000 PB, tape, disk

For comparison: 1990’s: Total LEP data set ~few TB Would fit on 1 tape today Today: 1 year of LHC data ~27 PB

30/11/2016 OpenStack Switzerland Meetup 33

Page 34: OpenStack @ CERN, by Tim Bell

CERNArchive>100 PB

CERNNewdata

15 PB

23 PB

27 PB

30/11/2016 OpenStack Switzerland Meetup 34

Page 35: OpenStack @ CERN, by Tim Bell

35 30/11/2016 OpenStack Switzerland Meetup

Page 36: OpenStack @ CERN, by Tim Bell

30/11/2016 OpenStack Switzerland Meetup 36

Innovations

Page 37: OpenStack @ CERN, by Tim Bell

TIER-1: permanent storage,

re-processing, analysis

TIER-0 (CERN): data recording,

reconstruction and distribution

TIER-2: Simulation,

end-user analysis

> 2 million jobs/day

~350’000 cores

500 PB of storage

nearly 170 sites, 40 countries

10-100 Gb links

Worldwide LHC Computing Grid

30/11/2016 OpenStack Switzerland Meetup 37

Page 38: OpenStack @ CERN, by Tim Bell

THE CERN MEYRIN DATA CENTRE

http://goo.gl/maps/K5SoG

30/11/2016 OpenStack Switzerland Meetup 38

Page 39: OpenStack @ CERN, by Tim Bell

Public Procurement Cycle Step Time (Days) Elapsed (Days)

User expresses requirement 0 Market Survey prepared 15 15 Market Survey for possible vendors 30 45 Specifications prepared 15 60 Vendor responses 30 90 Test systems evaluated 30 120 Offers adjudicated 10 130 Finance committee 30 160 Hardware delivered 90 250 Burn in and acceptance 30 days typical with 380 worst case 280

Total 280+ Days

30/11/2016 OpenStack Switzerland Meetup 39

Page 40: OpenStack @ CERN, by Tim Bell

Good News, Bad News

•  Additional data centre in Budapest now online •  Increasing use of facilities as data rates increase But… •  Staff numbers are fixed, no more people •  Materials budget decreasing, no more money •  Legacy tools are high maintenance and brittle •  User expectations are for fast self-service

30/11/2016 OpenStack Switzerland Meetup 40

Page 41: OpenStack @ CERN, by Tim Bell

Innovation Dilemma

•  How can we avoid the sustainability trap ? •  Define requirements •  No solution available that meets those requirements •  Develop our own new solution •  Accumulate technical debt

•  How can we learn from others and share ? •  Find compatible open source communities •  Contribute back where there is missing functionality •  Stay mainstream

Are CERN computing needs really special ?

30/11/2016 OpenStack Switzerland Meetup 41

Page 42: OpenStack @ CERN, by Tim Bell

30/11/2016 42 OpenStack Switzerland Meetup

Page 43: OpenStack @ CERN, by Tim Bell

The largest detectors

30/11/2016 OpenStack Switzerland Meetup 43

Page 44: OpenStack @ CERN, by Tim Bell

30/11/2016 OpenStack Switzerland Meetup 44

Page 45: OpenStack @ CERN, by Tim Bell

Strategic Plan

�  Establish multi-tenant, multi-provider cloud infrastructure

�  Identify and adopt policies for trust, security and privacy

�  Create governance structure

�  Define funding schemes

To support the computing capacity

needs for the ATLAS experiment

Setting up a new service to simplify analysis of large genomes, for a deeper insight into

evolution and biodiversity

To create an Earth Observation platform,

focusing on earthquake and volcano research

To improve the speed

and quality of research for finding surrogate biomarkers based on

brain images

Adopters

Suppliers

Additional Users:

30/11/2016 OpenStack Switzerland Meetup 45

Page 46: OpenStack @ CERN, by Tim Bell

46 * EC co-funded joint Pre-Commercial Procurement (PCP) project: https://indico.cern.ch/event/319753 ** Other work has been conducted outside CERN, such as the Amazon Pilot project at BNL for ATLAS

HN - Helix Nebula •  Partnership between research

organization and European commercial cloud providers

Support CERN’s scientific computing programme

Investigations

Page 47: OpenStack @ CERN, by Tim Bell

Tier 0

30/11/2016 OpenStack Switzerland Meetup 47

Page 48: OpenStack @ CERN, by Tim Bell

Run 2 has only just started •  Hint of an excess with diphoton mass of 750 GeV

•  Seen by ATLAS and CMS – coincidence or a new signal?

OpenStack Switzerland Meetup 48

200 400 600 800 1000 1200 1400 1600

Even

ts /

20 G

eV

1−10

1

10

210

310

410ATLAS Preliminary

Spin-0 Selection-1 = 13 TeV, 3.2 fbs

Data

Background-only fit

[GeV]γγm200 400 600 800 1000 1200 1400 1600

Dat

a - f

itted

bac

kgro

und

10−

5−05

1015

Even

ts /

( 20

GeV

)

1

10

210

DataFit model

σ 1 ±

σ 2 ±

EBEB

(GeV)γ γm400 600 800 1000 1200 1400 1600

stat

σ(d

ata-

fit)/

-2

0

2

(13 TeV, 3.8T)-12.7 fbCMS Preliminary

30/11/2016

Page 49: OpenStack @ CERN, by Tim Bell

•  Open source collaboration sets model for in-house teams

•  External recognition by the community is highly rewarding for contributors

•  Reviews and being reviewed is a constant learning experience

•  Operator sharing is much more than just OpenStack

30/11/2016

Communities Encourage Change

Keystone 2013

Paris 2014

OpenStack Switzerland Meetup 49

Page 50: OpenStack @ CERN, by Tim Bell

Deployment Models

30/11/2016 OpenStack Switzerland Meetup 50

•  Skills •  Culture •  Need for Speed •  Risk Appetite

Page 51: OpenStack @ CERN, by Tim Bell

Keeping Up with Releases

30/11/2016 OpenStack Switzerland Meetup 51

Page 52: OpenStack @ CERN, by Tim Bell

Users and Technology

30/11/2016 OpenStack Switzerland Meetup 52

Page 53: OpenStack @ CERN, by Tim Bell

When it’s not on the menu?

30/11/2016 OpenStack Switzerland Meetup 53