The SpeedShop Profiler
- No recompilation or relinking is necessary
- Multiple performance experiments and views
- The following performance experiments are supported:
- Program Counter (PC) sampling -pcsamp
- to get exclusive time per subroutine and per source line
- no call stack information
- User Time -usertime
- to get exclusive & inclusive times per subroutine (call stack)
- no source line information
- Hardware counter profiling -”event”_hwc
- Communication overhead of MPI -mpi
- Floating point exception tracing -fpe
- Tracing of I/O system call -io
- malloc/free tracing -heap
- basic-block counting experiment -ideal
see man speedshop for explanation of the experiments
- usage:
- ssrun -exp [options] program [args]
- prof “ssrun output file”