TAU Performance System tau.uoregon
-
Upload
robin-payne -
Category
Documents
-
view
20 -
download
2
description
Transcript of TAU Performance System tau.uoregon
TAU Performance SystemIBM Blue Gene Consortium 2
TAU Parallel Performance System
http://tau.uoregon.edu/ Multi-level performance instrumentation
Multi-language automatic source instrumentation Flexible and configurable performance measurement Widely-ported parallel performance profiling system
Computer system architectures and operating systems Different programming languages and compilers
Support for multiple parallel programming paradigms Multi-threading, message passing, mixed-mode, hybrid
TAU Performance SystemIBM Blue Gene Consortium 3
TAU Port to IBM BG/P Supports automatic instrumentation at:
Source level (PDT, tau_instrumentor; KOJAK, opari) MPI
Flexible and configurable performance measurement Support for profiling and tracing Support for PAPI counters on BG/P Uses bgxlC_r, bgxlc_r, bgxlf90_r as compilers
To configure TAU: ./installtau -arch=bgp -mpi -pdt=<dir> -pdt_c++=xlC -papi=<dir> ./tau_validate --html --build bgp >& results.html
Parallel Profile Analysis: Paraprof profile browser PerfDMF profile database Perfexplorer cross-experiment data analysis toolkit
TAU Performance SystemIBM Blue Gene Consortium 4
Using TAU on IBM BGP (surveyor.alcf.anl.gov) Choose measurement configuration
% ls /soft/apps/tau/tau_latest/bgp/lib/Makefile.*Makefile.tau-mpi-pdtMakefile.tau-mpi-pdt-traceMakefile.tau-callpath-mpi-pdtMakefile.tau-callpath-mpi-compensate-pdtMakefile.tau-depthlimit-mpi-pdtMakefile.tau-mpi-compensate-pdtMakefile.tau-multiplecounters-mpi-papi-pdtMakefile.tau-multiplecounters-mpi-papi-pdt-traceMakefile.tau-multiplecounters-papi-pdtMakefile.tau-multiplecounters-pthread-papi-pdtMakefile.tau-pdtMakefile.tau-phase-multiplecounters-mpi-compensate-papi-pdtMakefile.tau-phase-multiplecounters-mpi-papi-pdtMakefile.tau-pthread-pdt …
% setenv TAU_MAKEFILE /soft/apps/tau/tau-2.17/bgp/lib/Makefile.tau-mpi-pdt
% set path=(/soft/apps/tau/tau-2.17/ppc64/bin $path) # Front-end binaries
Replace mpixlf90_r with tau_f90.sh and compile your application Use tau_cxx.sh and tau_cc.sh for C++ and C compilers respectively
TAU Performance SystemIBM Blue Gene Consortium 5
Using TAU on IBM BGP (surveyor.alcf.anl.gov) Choose measurement configuration
% ls /soft/apps/tau/tau_latest/bgp/lib/Makefile.*Makefile.tau-mpi-pdtMakefile.tau-mpi-pdt-traceMakefile.tau-callpath-mpi-pdtMakefile.tau-callpath-mpi-compensate-pdtMakefile.tau-depthlimit-mpi-pdtMakefile.tau-mpi-compensate-pdtMakefile.tau-multiplecounters-mpi-papi-pdtMakefile.tau-multiplecounters-mpi-papi-pdt-traceMakefile.tau-multiplecounters-papi-pdtMakefile.tau-multiplecounters-pthread-papi-pdtMakefile.tau-pdtMakefile.tau-phase-multiplecounters-mpi-compensate-papi-pdtMakefile.tau-phase-multiplecounters-mpi-papi-pdtMakefile.tau-pthread-pdt …
% setenv TAU_MAKEFILE /soft/apps/tau/tau-2.17/bgp/lib/Makefile.tau-mpi-pdt% set path=(/soft/apps/tau/tau-2.17/ppc64/bin $path) # Front-end binaries
Replace mpixlf90_r with tau_f90.sh and compile your application Use tau_cxx.sh and tau_cc.sh for C++ and C compilers respectively
Visualize performance data with paraprof, pprof, vampir, jumpshot
TAU Performance SystemIBM Blue Gene Consortium 8
Sedov 2D Auto
• Initial test run did not include a load balanced problem• Small problem: too little work for 1024 processor• Proof of concept to validate porting of tools
TAU Performance SystemIBM Blue Gene Consortium 10
TAU PerfExplorer: Runtime Breakdown
MPI_Barrier
IO_OUTPUT
TAU Performance SystemIBM Blue Gene Consortium 14
TAU Portal
TAU portal supports the FLASH regression testing Allows groups to share profiling data in a secure way Allows users to launch TAU performance tools (paraprof,
perfexplorer) Nightly regression testcases uploaded to the database
automatically SVN checkout each night TAU: http://www.cs.uoregon.edu/research/tau TAU Portal: http://tau.nic.uoregon.edu