NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high...

15
Katie Antypas, Brian Austin, Tina Butler, Richard Gerber, Cary Whitney, Nick Wright, Woo-Sun Yang, Zhengji Zhao NERSC Workload Analysis on Hopper 1 February 22, 2013

Transcript of NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high...

Page 1: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

Katie Antypas, Brian Austin, Tina Butler, Richard Gerber, Cary Whitney, Nick Wright, Woo-Sun Yang, Zhengji Zhao

NERSC Workload Analysis on Hopper

-­‐  1  -­‐  

February  22,  2013  

Page 2: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

Understanding the NERSC workload is key to procuring productive, high performing systems for science

•  Conducted  workload  analysis  to  understand  applica:on  requirements  and  guide  future  system  procurements  

•  Important  for  understanding  efforts  needed  to  transi:on  workload  to  future  architectures  

•  Analyzed  the  workload  by:  –  Science  area  –  Applica.on  code    –  3rd  party  applica.on  usage  –  Algorithm  –  Job  size  –  Memory  usage  –  Threading  usage  –  Library  usage  

•  I/O  workload  analysis  is  in  progress  and  will  be  ready  in  Mid-­‐March  2013  

-­‐  2  -­‐  

Page 3: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

Methods

•  Data  collected  in  this  presenta:on  came  from  a  variety  of  sources  –  System  accoun.ng  logs  

–  NIM  database  –  ALPS  command  line  capture  log    

–  Automa.c  Library  Tracking  Database  (ALTD)  –  Applica.on  Resource  Usage  (ARU)  Tool  

-­‐  3  -­‐  

Page 4: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

NERSC serves a broad range of science disciplines for the DOE Office of Science

NERSC  serves:  

•  Over  4500  users  •  Over  650  projects  

2012  Alloca*on  Breakdown    

Fusion  Energy  

Materials  Science  

LaKce  QCD  Chemistry  

Climate  

Astrophysics  

Biosciences  

Earth  Sciences  Math  and  CS  

Accelerator  Science  

Combus.on  

Nuclear  Physics  

Engineering  

Page 5: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

• 10 codes make up 50% of workload

• 25 codes make up 66% of workload

• 75 codes make up 85% of workload

• remaining codes make up bottom 15% of workload

Top  Applica*on  Codes  on  Hopper  by  Hours  Used  Jan  –  Nov  2012  

Approximately  80%  of  the  workload  needs  to  transfer  to  NERSC-­‐8    (20%  can  remain  on  Edison)  

Over 650 applications run on NERSC resources

Page 6: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

NERSC installs and directly supports software used by over 20% of workload

-­‐  6  -­‐  

Percent  of  workload  using  NERSC  installed  and  supported  applica*on  codes  (On  Hopper  by  Number  of  Hours)    

Page 7: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

In addition to science area and code diversity, "NERSC supports a broad range of algorithms

LaKce  QCD  

Density  Func.onal  Theory  

Fusion  PIC  

Climate  Quantum  Chemistry  QMC  

Fast  Math  

CMB  

Bioinforma.cs  

Molecular  Dynamics  

Fusion  Con.nuum  

Other  codes  

Accelerator  PIC  

Top  Codes  by  Algorithm  

Page 8: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

NERSC users run applications at every scale to conduct their research

2012  Job  Size  Breakdown  on  Hopper  

Jan  2012   Dec  2012  

Frac.o

n  of  hou

rs  

1  

0.8  

0.6  

0.4  

0.2  

0  

Approximately  15%  of  computa.onal  cycles  use  more  than  40%  of  the  compute  nodes  and  40%  of  cycles  use  more  than  10%  of  the  system.  

Page 9: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

80% of the Hopper workload uses less than or equal to 1 GB of memory per task

-­‐  9  -­‐  

Maximum  Memory  Usage  per  Task  on  Hopper  (11/2012  –  2/2013)  

GB  And  >90%  of  the  Hopper  workload  uses    2GBs  or  less  per  task    

Page 10: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

Over 20% of hours used on Hopper run with more than a single thread

-­‐  10  -­‐  

1  Thread;  79%  

2  Threads  3%  

3  Threads  5%  

4  Threads  1%  

6  Threads  10%  

24  Threads  1%  

Node  Hours  used  by  Applica*on  Thread  Count    (Hopper  system  11/2012-­‐2/2013)  

%  of  Hours  Used  by  Thread  Count  (Hopper  system  11/2012-­‐2/2013)  

Other  

Page 11: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

NERSC users require an array of software to conduct their research

-­‐  11  -­‐  

MPICH,  parallel-­‐I/O  and  math  libraries  are  crucial  packages  to  a  large  number  of  NERSC  users  

Uniqu

e  Users  

Page 12: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

Summary

•  NERSC  has  new  tools  to  capture  memory,  threading,  and  library  usage  on  Cray  systems  

•  New  tools  will  allow  workload  characteriza:on  on  both  Hopper  and  Edison  systems  

•  Understanding  workload  is  key  to  quan:fying  the  effort  that  will  be  required  to  transi:on  applica:ons  to  future  architectures    

•  Stay  tuned  for  I/O  workload  analysis  

-­‐  12  -­‐  

Page 13: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

Journal Cover Stories from NERSC-Enabled Research 2012

Devanathan,  PNNL:  BES   Jiang,  ORNL:  BES   Dagge`,  U.  Washington:  BER  Jiang,  ORNL:  BES  

Dupuis,  PNNL:  BES   Petrik,  PNNL:  BES   Jiang,  ORNL:  BES  

Liang,  U.  Maryland:  BER  

Smit,  UCB:  BES   Varga,  Vanderbilt:  BES  

Sugiyama,  MIT:  FES   Snurr,  Northwestern:  BES   Striolo,  U.  Oklahoma:  BES  

Page 14: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

In addition to science area and code diversity, "NERSC supports a broad range of algorithms

LaKce  QCD  Proxy:  MILC  

DFT  Proxies:  Quantum  Espresso  Berkeley  GW  

Fusion  PIC  Proxy:    GTC  

Climate  Proxies:  POP,  SE-­‐CAM,  MPAS,  WRF  

Quantum  Chemistry  Proxy:  NWChem  QMC  Proxy:  Zori  

Fast  Math  Proxies:  FLASH,  MAESTRO  

CMB  Proxy:  MADAM  

Bioinforma.cs  Proxy:  BLAST  

Molecular  Dynamics  Proxies:  NAMD,  Amber  

Fusion  Con.nuum  Proxy:  GYRO  

Other  codes  

Accelerator  PIC:  Impact  

Page 15: NERSC Workload Analysis v3Understanding the NERSC workload is key to procuring productive, high performing systems for science • Conducted(workload(analysis(to(understand(applica:on(

LaKce  QCD  Proxy:  MILC  

DFT  Proxies:  Quantum  Espresso  Berkeley  GW  

Fusion  PIC  Proxy:    GTC  

Climate  Proxies:  POP,  SE-­‐CAM,  MPAS,  WRF  

Quantum  Chemistry  Proxy:  NWChem  

QMC  Proxy:  Zori  

CMB  Proxy:  MADAM  Bioinforma.cs  Proxy:  BLAST  

Molecular  Dynamics  Proxies:  NAMD,  Amber  

Fusion  Con.nuum  Proxy:  GYRO  

Accelerator  PIC:  Impact  

Fast  Math  Proxies:  FLASH,  MAESTRO  

In addition to science area and code diversity, "NERSC supports a broad range of algorithms