A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case:...

18
A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics Magdalena Slawinska 1 Tanja Bode 2 Jeremy Logan 3 Michael Clark 2 Hongbo Zou 1 Matthew Kinsey 2 Matthew Wolf 1 Pablo Laguna 2 Scott Klasky 4 1 College of Computing Georgia Institute of Technology, Atlanta, GA 2 School of Physics Georgia Institute of Technology, Atlanta, GA 3 Remote Data Analysis and Visualization Center University of Tennessee, Oak Ridge, TN 4 Scientific Data Group Oak Ridge National Laboratory, Oak Ridge, TN 2013-07-23, XSEDE 2013, San Diego, CA

Transcript of A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case:...

Page 1: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

A Maya Use Case: Adaptable Scientific Workflowswith ADIOS for General Relativistic Astrophysics

Magdalena Slawinska1 Tanja Bode2 Jeremy Logan3

Michael Clark2 Hongbo Zou1 Matthew Kinsey2

Matthew Wolf1 Pablo Laguna2 Scott Klasky4

1College of ComputingGeorgia Institute of Technology, Atlanta, GA

2School of PhysicsGeorgia Institute of Technology, Atlanta, GA

3Remote Data Analysis and Visualization CenterUniversity of Tennessee, Oak Ridge, TN

4Scientific Data GroupOak Ridge National Laboratory, Oak Ridge, TN

2013-07-23, XSEDE 2013, San Diego, CA

Page 2: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Outline

Maya-RDAV Introduction

Numerical Relativistic Simulations with MayaMaya CodeData Processing and Analysis

Our Approach to Maya Workflow ImprovementIncorporating ADIOS into Maya as Fast I/O System

Incorporating ADIOS into MayaMagneto-hydrodynamicsParticles

How to Make Maya Simulations Interactive and Steerable at RuntimeExploiting Flexpath/ADIOS in Maya

Summary

Page 3: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Maya-RDAV Introduction

High-level Research Goals

I Enrich runtime for, and user’sexperience with, numerical relativistic(NR) simulations

I Provide apparatus for monitoring andinteractive steering to astrophysicssimulations at runtime

I Target XSEDE HPC and visualizationsystems to allow the user to connectKraken, Keeneland, and Nautilus

I Collaboration effort: active computational astrophysics andcomputer science

I Part of RDAV (Remote Data Analysis and Visualization) project

Page 4: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Maya Code

I 3D numerical relativistic calculations simulating creation andcollision of black holes

I Solves Einstein’s equations for black holes, stars, etc

I Based on Einstein Toolkit, Cactus, and Kranc; uses Carpet AMRmodule

I Uses Georgia Tech’s custom spacetime and hydrodynamicsevolution code

Page 5: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Maya Workflow

ADIOS Thorn

Maya Simulation

Simulation Machine

Parallel Storage

Stage Area

runview.sh h5movie.sh VisIt

AD

IOSB

P

plu

gin

Ca

rpet

IO

plu

gin

Summary directory:

contains .png, .html, movies etc

Website publish http://numrel.org

run-summary.sh

processed data

User

cron

submit runview job

run periodically

the data with basic processing

all the images of timeseries & 1d data, tarred data, .html

2d hdf5 data

ADIOSBPmovie.sh

Python script

movie frames

animation, poster frame images

Browse, check, if correct, etc; stop simulation if catastrophic

Data Processing Nodes Control path

Data path Data processing

Others

run control

generate

Data Visualization Importing/Exporting

Raw Data Data Processing

Data Factory

Page 6: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Data Processing and Analysis in Maya

Data

I Physical quantities of interest,e.g., matter density, magneticfield components,gravitational radiation intensity

I Periodic checkpoints

ADIOS Thorn

Maya Simulation

Simulation Machine

Parallel Storage

Stage Area

runview.sh h5movie.sh VisIt

AD

IOSB

P

plu

gin

Ca

rpet

IO

plu

gin

Summary directory:

contains .png, .html, movies etc

Website publish http://numrel.org

run-summary.sh

processed data

User

cron

submit runview job

run periodically

the data with basic processing

all the images of timeseries & 1d data, tarred data, .html

2d hdf5 data

ADIOSBPmovie.sh

Python script

movie frames

animation, poster frame images

Browse, check, if correct, etc; stop simulation if catastrophic

Data Processing Nodes Control path

Data path Data processing

Others

run control

generate

Data Visualization Importing/Exporting

Raw Data Data Processing

Data Factory

Processing and analysis

I Some physical quantities scrutinized duringin-simulation analysis (analysis thorns)

I Post-processing including interactiveanalysis, scripted plots, and movies

Page 7: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Maya I/O Characteristics and ChallengesI Expensive I/O: assumption of keeping I/O overhead at 40% of

walltime → write out a small number of variables with a lowerdimensionality

I Mostly static, shell-based, post-processing-based data analysisand visualization

I A typical simulation:I A couple of weeks on a local clusterI Recording 15 2D physical variables of interestI 21GB output data (15 3D variables would generate 4.5TB)I Utilize a combination of HDF5 and ASCII for writes

As a consequence

I Not all variables get written out. Experimenters pre-select a smallsubset for each run

I Requires reruns of the simulations if correct variables were notoutput

I Simulation data sets prone to corruptions with large files

Page 8: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Our Approach to Maya Workflow ImprovementI Incorporating new staging I/O technologies to reduce I/O

overheadI Enable outputting more physical variables of interestI Enable outputting higher dimensionality of variables

I Expanding the existing Maya workflow to address inlinevisualization and steering

Fast IO

Numerical Relativistic Simulation

HPC Simulation Machine

Parallel Storage

Stage Area

Fa

st IO

p

lug

in

Dynamic, in-line visualization

Visualization Machine

Control path

Data path

Data processing

Run, data stage-in

control

Data Processing and

Data Visualization

VisIt client

Processed Data Analysis

User’s Desktop

control control

VisIt

Data Factory

User

Page 9: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

ADIOS – Adaptable I/O System as Our Fast I/O

I A componentization of the I/O layer with a selection of various I/Omethods

I Designed to provide a scalable, portable, and efficient I/Osolution for various computer infrastructures

I In certain cases demonstrated I/O performance improvements offactor 1000 over established parallel file formats

I Successfully incorporated into several scientific codes includingGTC fusion code, GTS, Chimera, XGC0, XGC1, Flash, S3D

Page 10: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Advantages of Using ADIOS

I Ease switching to other transport method (MPI, MPI_AMR,Flexpath, etc)

I Offers memory-to-memory coupling solutions (Flexpath)I Self-describing BP file format

I Ability to avoid runtime costs related to file consistency validationI Footer index–does not necessitate moving it whenever data length

change occursI Resilience support

I More infowww.olcf.ornl.gov/center-projects/adios/

Page 11: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

How to Improve Maya Workflow with ADIOS

ADIOS Thorn

Maya Simulation

Simulation Machine

Parallel Storage

Stage Area

runview.sh h5movie.sh VisIt

AD

IOSB

P

plu

gin

Ca

rpet

IO

plu

gin

Summary directory:

contains .png, .html, movies etc

Website publish http://numrel.org

run-summary.sh

processed data

User

cron

submit runview job

run periodically

the data with basic processing

all the images of timeseries & 1d data, tarred data, .html

2d hdf5 data

ADIOSBPmovie.sh

Python script

movie frames

animation, poster frame images

Browse, check, if correct, etc; stop simulation if catastrophic

Data Processing Nodes Control path

Data path Data processing

Others

run control

generate

Data Visualization Importing/Exporting

Raw Data Data Processing

Data Factory

Page 12: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Incorporating ADIOS into Maya: CarpetIOADIOS ThornMagneto-hydrodynamics simulations

I Early prototype demonstrated animprovement in output performancefrom 3-5 minutes for a full I/Ocheckpoint using HDF5 to 6 secondsusing ADIOS on Kraken

I After a number of tuning operationson NICS machines, the equivalentHDF5 runs take 15 secs on Kraken

I This is still a 60% improvement inoutput performance

I Hopefully, the CarpetIOADIOS thornwill be released in a couple ofmonths

Page 13: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Incorporating ADIOS into Maya: ParticlesParticle-based simulations

I Early prototype shows 35%improvement over standardI/O methods in the simulationof 1 million particles runningon 192 threads with 8OpenMP threads per MPIprocess

I Ease of adding ADIOSsupport to particle basedsimulations (one afternoon +accelerators ;) )

Page 14: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Incorporating ADIOS into Maya: Current Progress

I Due to recent change in ADIOS API,we had to modify our ADIOS-basedcheckpoint file format

I Validating the new file formatI Debugging and performance

evaluation of the newestCarpetIOADIOS thorn on Krakenand Nautilus

Page 15: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Exploiting Flexpath – ADIOS Transport Method – in MayaI Included in ADIOS spec and distributionI One of standard ADIOS transport methods – switching to

Flexpath is easy:

// config.xml<method group="temperature" method="FLEXPATH">QUEUE_SIZE=4;</method>

// writeradios_init ("config.xml", comm);

// readeradios_read_init_method(ADIOS_READ_METHOD_FLEXPATH, comm, "");ADIOS_FILE* afile = adios_read_open("config.xml",

ADIOS_READ_METHOD_FLEXPATH, comm, ADIOS_LOCKMODE_NONE, 0.0);.....adios_read_finalize_method(ADIOS_READ_METHOD_FLEXPATH);

Advantages of using Flexpath

I Will allow to gain on-demand insights into simulations at runtime

I Will offer dynamic exploration of data

I Fast I/O and binary filtering will enable the scientist to formulatevery specific queries related to NR systems being evolved

Page 16: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Flexpath Technical Details

I Provides memory-to-memory coupling

I Allows for memory-to-disk and disk-to-memory operations usingefficient buffered I/O in the staging area (as part of ADIOS)

I An event-based method (EVPath) with inline transformationsupport

I Uses in-system JIT compiler to deploy binary filters at runtime

I Offers multi-protocol overlay networking, including multi-machine,advanced shared memory support, networking over RDMA(Remote Direct Memory Access)

Page 17: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Summary

I Maya-RDAV projectI Enrich runtime for numerical relativistic simulations and allow

physicists to interactively monitor and steer computations atruntime

I ApproachI Improve Maya workflow by incorporating ADIOS

I ImplementationI Cataloged existing Maya workflowI ADIOS integration: Carpet I/O thorn + VisIt ADIOS plugins for

(magneto-)hydrodynamics and particle-based NR simulationsI Flexpath: providing memory-to-memory coupling + advanced

binary filtering to enable dynamic exploration of data

Page 18: A Maya Use Case: Adaptable Scientific Workflows with ADIOS ... · 7/23/2013  · A Maya Use Case: Adaptable Scientific Workflows with ADIOS for General Relativistic Astrophysics

Acknowledgments

I NSF grant:ARRA-NSF-OCI-0906324

Future Work

I Performing more aggressiveanalytics with large datasets

I Reslicing data across all space-time

I Adding HTML5 extensions to theexisting workflow

THANK YOU? || #