High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life...

30
High Performance Computing in the Life Sciences or It came for us, it’s coming for you

Transcript of High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life...

Page 1: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

High Performance Computing in the Life Sciences

or

It came for us, it’s coming for you

Page 2: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

HPC questions

Who am I and why are you listening to me?What is High Performance computing?

What are the challenges of HPC?What happens when biomedical research meets HPC?

How we are meeting the challenges?

Page 3: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

A Scientific Computing CVPeter Maccallum BSc PhDHead of IT and Scientific ComputingCancer Research UK Cambridge Research Institute

Page 4: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

A Scientific Computing CVPeter Maccallum BSc PhDHead of IT and Scientific ComputingCancer Research UK Cambridge Research Institute

(Relevant) previous employers•Edinburgh Parallel Computing Centre•European Bioinformatics Institute

Page 5: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

High Performance Computing

Fast processors…or multiple processors…or multiple cores

High memory…or shared memory…or memory interconnect

Fast interconnectFast disk‘Fast’

… ‘High’

… these are relative terms

Page 6: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Cray T3D199319 Gflops

Page 7: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are
Page 8: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Dell Optiplex2011~10 Gflops

Page 9: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

My Desktop

Page 10: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

HTC Desire20111 GHz…

Page 11: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

My Phone

My Desktop

Page 12: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

‘e-Infrastructure’

2011 UK BIS/RCUK Review identified these key areas:

NetworksPeople & skillsDataComputationSoftwareAuthentication & Security

Page 13: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Networks

Connectivity between processorsGb EthernetInfiniBand10Gb EthernetNUMA memory architectures

Connectivity to the outside worldJANETParallel NFS, GridFTPThe last 10 yards…

Page 14: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

People & skills

Where do IT staff come from Desktop support –

MCSE –

Systems Architect

Computing Science –

Software Engineering –

ArchitectureResearch Science –

Systems Administration –

HPC

Some skills issues for HPC Breadth of understanding

• Storage, networking, processing, advanced programming

Parallel, multithreaded, distributed codeCareer paths

Page 15: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Data

Traditional problems were CPU driven• Simulation• Numerical analysis

Current drive to HPC has data as its source• Sensors• Image capture• Internet

HPC storageHigh speed, low-latency storageHigh parallel throughput storageScale-out storage

Page 16: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Computation

HPC is driven by commodity computingVector processors, SIMD devices no longer mainstream

Specialist considerations in commodity processorsFloating point performanceMulti-core architecturesGPU, hybrid GPU-CPU

ConstraintsPowerMemory bandwith, cacheing, I/OProgrammability–

peak vs observed performance

Page 17: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

SoftwareParallelism

Advanced topic in most programming coursesMultithreading is the barest minimum for multi-coreCompilers are clever, but not magic

ScalabilityMulti node, multi-core architecturesWith multilevel cachesAnd add in I/O scalability...

Is exascale programming even possible?Will we have to re-write our science to match the scale-out algorithms?

Page 18: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Authentication and SecurityYou own the computers

...so you give out the passwordsA surprising number of systems are still run this wayBut we need to federate systems, networks, storage

Certificates/public key infrastructuresBased on personal certificates, private/public keysManagement overheads with trust hierarchies

Federated authenticationI don’t know you and I don’t trust you...but I trust the organisation you work forEduroam, Shibboleth, JANET Moonshot

Page 19: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Cancer Research UK

Cambridge Research Institute

Page 20: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Cancer Research UK

IT & Scientific Computing

Cambridge Research Institute

Page 21: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Microscopy

Confocal microscopehigh definition, multidimensional image data

Depth –

z-stacks generate 3D data

Time –

time series of organelle migration

Colour –

differential fluorescence to Identify location of molecules

Page 22: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Histopathology

Robotic image scanner

Tissue microarrays –

hundreds of samples per slide

Each sample scanned at high resolution

Page 23: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

DNA sequencing

Illumina Genome Analyser

Parallel Sequencing of DNA fragments

Rapid sequencing of whole individuals,Detailed studies of cellular processes

Page 24: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Networks at the CRI

Gb Ethernet in HP blade enclosures4:1 oversubscriptionWould prefer IB…

100TB of sequence data…via sneakernet

Page 25: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

People and skills at the CRI

Traditional HPC routeChemistPhysicistMedical Physicist

Computer ScientistsEngineer …

computer scientist

Computer Scientist x3

Page 26: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Data at the CRI

LustreParallel distributed FS96TB

IbrixScale out FS450TB + + +

Page 27: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Computation at the CRI

Xeon Westmere64x2x4 core HP BL460768 cores in 60u

Xeon Sandy Bridge16x2x2x6 core HP BL2/220c384 cores in 10u

Platform LSF scheduler

Page 28: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Software at the CRICodes

Image processingSystems biologyPerl, Python, shell scripts…

How do we optimise without becoming a software house?

Resource optimisationCode optimisationAlgorithm designHardware selectionFrameworksTraining

Page 29: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

Authentication and Security at the CRI

LONDON

CAMBRIDGE

Option 1Guest accounts + ssh

Option 2JANET MoonshotAD federation via GSS

Page 30: High Performance Computing in the Life Sciences€¦ · High Performance Computing in the Life Sciences or It came for us, it’s coming for you. HPC questions Who am I and why are

High Performance ComputingIs a relative term

Today’s HPC developments will become mainstream

Depends on people as much as technologyThere’s never a stable, commodity HPC platform

Asks us all some hard questionsWill our current approaches to software scale to take

advantage of next generation HPC?