Scientific Computing at the SNS - bnl.gov

31
Scientific Computing at the SNS Stuart Campbell S i tifi C ti G Scientific Computing Group Neutron Scattering Science Division

Transcript of Scientific Computing at the SNS - bnl.gov

Page 1: Scientific Computing at the SNS - bnl.gov

Scientific Computing at the SNS

Stuart CampbellS i tifi C ti GScientific Computing GroupNeutron Scattering Science Division

Page 2: Scientific Computing at the SNS - bnl.gov

ORNL’s two world-leading facilities for neutron science

Spallation Neutron Source

High Flux Isotope Reactor

2 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 3: Scientific Computing at the SNS - bnl.gov

• Construction completed May 2006

The Spallation Neutron SourceConstruction completed May 2006

• User operations began in 2007• Design Power 1.4 MW: ~8× ISIS, the world’s leading pulsed spallation source

R f 25 i t t i h i h i t• Room for 25 instruments spanning physics, chemistry,biology, and materials science

• Upgradable to higher power and a second target station

3 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop3 Managed by UT-Battelle

for the Department of Energy

Page 4: Scientific Computing at the SNS - bnl.gov

13 Operating Instruments

4 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 5: Scientific Computing at the SNS - bnl.gov

9 Operating Instruments

5 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 6: Scientific Computing at the SNS - bnl.gov

Scientific Computing Group Responsibilities

• Creation and cataloging of NeXus files following an experiment run –Translation and Live Catalogingg g

• Data Reduction• Live Data Processingg• Visualization tools• Data Access – via the portal • Lines of Code developed:

– Data Reduction including GUIs (40/60): ~450k lines– Data Translation and Live Cataloging: ~50k– Portal: ~100k– Utilities: 35k

6 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 7: Scientific Computing at the SNS - bnl.gov

Strategies:• Generation I up to now• Generation I – up to now.

– Establish stable software platforms for operating instruments– Leverage and, as necessary, adapt existing softwareLeverage and, as necessary, adapt existing software

• Generation II – next 2 yearsy– Transition to produce and provide a coherent suite of

software across the instrument beam linesMore science focused– More science focused

• Generation III – after 2 yearsGeneration III after 2 years– Advanced Simulations– Incorporation of “blue skies” software projects

7 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

p p j– …

Page 8: Scientific Computing at the SNS - bnl.gov

Experimental Data FlowDAS Analysis Computer Data and Portal ServersDAS Analysis Computer Data and Portal Servers

Firewall

Live Data

Disk Data

Network DataUser Data

Experiment Shared Data

Experiment DataAccess

• Private Network

N t ll t d i• Translation service responds to end of

• NeXus and preNeXus files are copied to central disk

Local Storage

Local Storage

• Neutrons are collected in event mode

• Creates ~6 ‘pre-NeXus’ files (different formats)

responds to end of collection message

•NeXus file created from preNeXus + geometry

a e cop ed to ce t a d sarchive

• Data are available to users via computers and portal( )

• Read only mount

• DAS computer sends out a message once a run is

• Data is then catalogued (ICAT)

• Data are network mounted for other computers to access

D t i t d l t

8 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

gcomplete • Data is stored long term

and backed-up

Page 9: Scientific Computing at the SNS - bnl.gov

Translation Monitoring

• Enables viewings of the process of of the process of creating and cataloguing experimental data (NeXus files)

9 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 10: Scientific Computing at the SNS - bnl.gov

Data Archive for Facilities

10 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 11: Scientific Computing at the SNS - bnl.gov

NeXus Utilities

• Enable users to ‘control’ NeXus generation

O id G t– Override Geometry– Alter data binning

N X fil hi t• NeXus files are histograms• Event based NeXus coming soon.• Defining an ‘analysed’ NeXus

format.

11 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 12: Scientific Computing at the SNS - bnl.gov

A Look at SNS Data Sizes

Instrument Typical Histogram Size

NeXus File Size

Event File Size

Ref M 149MB 27.2MB 24.8MBRef_M 149MB 27.2MB 24.8MBBASIS 136MB 3.5MB 3.89MBPowgen 2.1GB 2.0MB 30.1MBo ge G

estimated0 30

ARCS 3.6GB 29.6MB 116.4MB Sequoia 4.3GB 34.69MB 136.2MBCNCS 0.8GB 14.8MB 4.3MBSNAP 7.7GB 83.1MB 118.9MBTOPAZ 14.9GB

estimated? ?

12 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

estimated

Page 13: Scientific Computing at the SNS - bnl.gov

Data Access/Download for Users (Data Portal))• Users get an XCAMS “account”

(username & password)

• Account gives them access to:Account gives them access to:- Beamline Analysis Computer- Central User Disk Space- Experimental Data

• Data Portal gives access to- User’s ‘Home’ area- Raw data - NeXus - Metadata browser for NeXusMetadata browser for NeXus- download/upload of files- 1D and 2D visualization tools

are built-in for NeXus files

• Data Search (ICAT)

13 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 14: Scientific Computing at the SNS - bnl.gov

Data Search

14 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 15: Scientific Computing at the SNS - bnl.gov

Data Access Requirement

• Strong requirement from users and beamline staff was for command line access to datafor command line access to data.

• We defined a storage hierarchy to support hosting data from multiple facilities with multiple beamlines from multiple facilities with multiple beamlines.

• Built infrastructure to support command line access and web portal accessand web portal access

• Used NFS with ACLs to restrict access to directories.W tl thi ki thi h!• We are currently rethinking this approach!

15 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 16: Scientific Computing at the SNS - bnl.gov

Data Policy

• Went for the simple option…

– The Principle Investigator owns the data.U l d t f i t th t f– Users can only see data for experiments they are part of.

– We permanently archive the data.

16 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 17: Scientific Computing at the SNS - bnl.gov

Live Data StreamingReduction GUIReduction GUI

DASAnalysis Computer Firewall

• Events and metadata are streamed from the DAS to the instrument analysis computer.

• Events are continuously processed and temporary

Computer Firewall

NeXus files and event lists are periodically produced.• Does not produce the data of record for the

experiment.• Can ‘pass through’ events to other programs.

Event Data Stream

17 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Can pass through events to other programs.

User Workstation

Page 18: Scientific Computing at the SNS - bnl.gov

Viewing the live event data (IsawEV)

18 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 19: Scientific Computing at the SNS - bnl.gov

Data Reduction

• C++ Core Algorithms• Python ‘Glue’• IDL user GUIs• Read NeXus files• Reduction can be split up by Reduction can be split up by

detector ‘bank’• Processed in parallel p

(SLURM) on analysis machines

19 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 20: Scientific Computing at the SNS - bnl.gov

Job Monitor

• Users can track the execution status of execution status of their jobs

20 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 21: Scientific Computing at the SNS - bnl.gov

Infrastructure Hardware

• 32 core, 256 GB RAM Machines (1 or 2 per beamline)– Physically located on beamlines

30” M it (1 2)– 30” Monitor (1 or 2)

• Gigabit Ethernet• Central Disks served by NFS

21 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 22: Scientific Computing at the SNS - bnl.gov

Community Codes – DAVE (NIST)

22 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Typical Data Size ~ 1-50 GB

Page 23: Scientific Computing at the SNS - bnl.gov

Community Codes - ISAW

Th ISAW li ti h • The ISAW application has been extended to support SNS instrumentsst u e ts– Reduces single crystal and

powder diffraction data C Q– Can reduce event data to Q-space directly (no corrections)

– Can perform data reduction in Can perform data reduction in distributed computing mode

– ISAW has 230K lines of code, 32K SCD specific 8K for event 32K SCD specific, 8K for event data processing, and 6K for 3D visualization

23 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 24: Scientific Computing at the SNS - bnl.gov

Other Software

• IDL• Matlab• Igor Pro• SigmaPlot• OriginOrigin

24 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 25: Scientific Computing at the SNS - bnl.gov

Remote Applications

• Applications hosted remotely and viewed locally

• No user software to install

• OS independent (Java + NX)

• Always have access to latest version of applicationof application

• Same software on Analysis computers

25 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 26: Scientific Computing at the SNS - bnl.gov

Data and Computing StatisticsCompute Jobsp

200000

250000

300000

350000

400000

Tota

l MonthlyCumulative

0

50000

100000

150000

Mar Apr May Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar

Month

• 10.3 TB of Experiment Data (to date)

• 4.6 TB in shared areas (reduced data)

• 6 5TB size of user home6.5TB size of user home areas

• 1,516,819 total files cataloged

26 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 27: Scientific Computing at the SNS - bnl.gov

Assessing from Here…

• More visualization tools needed – during and following an experimentan experiment

• Community produced analysis tools not always suited towards working with SNS data sizesg

• Better integration between experiment and analyses processesp

• More ‘commonality’ across science areas• Higher performance computing requested – this Higher performance computing requested – this

involves computers, disks, and network performance

27 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 28: Scientific Computing at the SNS - bnl.gov

Collaborations

• Software produced will be Open Source.R i d l f DOE d UT B tt ll /ORNL– Required approval from DOE and UT-Battelle/ORNL

– Required Export Control Approval

• Already working with the authors of ISAW and DAVE.C tl f i ft ll b ti ith ISIS b d • Currently forming a software collaboration with ISIS based upon the Mantid Python/C++ framework.

• Currently working with the DANSE team to develop the • Currently working with the DANSE team to develop the plan for how to transition DANSE software to ORNL.

28 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 29: Scientific Computing at the SNS - bnl.gov

Emerging Vision – Interconnected Facilities

• Support multi-technique data analysis

• SBIR Proposal• Thick Client Apps utilizing

web services at facilitiesf

SBIR Proposal– SNS, HFIR, APS, Lujan,

LCLS

29 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

• Inter-facility data movementSlide courtesy of Steve Miller

Page 30: Scientific Computing at the SNS - bnl.gov

Summary

• Data reduction software is available for all operating and commissioning instruments at SNS.

• Near-term software development focus areas are:– To provide additional data reduction visualization tools during

and following an experimentand following an experiment– To improve the performance for running data reduction jobs

• Development scope now includes providing new analysis • Development scope now includes providing new analysis software in addition to providing data reduction.

• The Scientific Computing Group is working more closely • The Scientific Computing Group is working more closely with science teams

30 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop

Page 31: Scientific Computing at the SNS - bnl.gov

Thank you.

Questions ?

31 Managed by UT-Battellefor the U.S. Department of Energy NSLS-II Workshop