2 materials found
Keywords: ROCm
AMD Profiling
The AMD profiling workshop covers the AMD suite of tools for development of HPC applications on AMD GPUs.
You will learn how to use the rocprof profiler and trace visualization tool that has long been available as part of the ROCm software suite.
You will also learn how to use the new...
AMD Profiling
https://www.youtube.com/playlist?list=PLmu61dgAX-aaQOCG5Jlw8oLBORJfoQC2o
https://dresa.org.au/materials/amd-profiling
The AMD profiling workshop covers the AMD suite of tools for development of HPC applications on AMD GPUs.
You will learn how to use the rocprof profiler and trace visualization tool that has long been available as part of the ROCm software suite.
You will also learn how to use the new Omnitools - Omnitrace and Omniperf - that were introduced at the end of 2022. Omnitrace is a powerful tracing profiler for both CPU and GPU. It can collect data from a much wider range of sources and includes hardware counters and sampling approaches. Omniperf is a performance analysis tool that can help you pinpoint how your application is performing with a visual view of the memory hierarchy on the GPU as well as reporting the percentage of peak for many different measurements.
training@pawsey.org.au
AMD
Pawsey Supercomputing Research Centre
supercomputing, performance, GPUs, CPUs, AMD, HPC, ROCm
Evaluate Application Performance using TAU and E4S
In this workshop, you learn about the Extreme-scale Scientific Software Stack and the TAU Performance System® and its interfaces to other tools and libraries. The workshop includes sample codes that illustrate the different instrumentation and measurement choices.
Topics covered include...
Evaluate Application Performance using TAU and E4S
https://www.youtube.com/playlist?list=PLmu61dgAX-aakuGnuVPiWVaqCLgm3kdRG
https://dresa.org.au/materials/evaluate-application-performance-using-tau-and-e4s
In this workshop, you learn about the Extreme-scale Scientific Software Stack and the TAU Performance System® and its interfaces to other tools and libraries. The workshop includes sample codes that illustrate the different instrumentation and measurement choices.
Topics covered include generating performance profiles and traces with memory utilization and headroom, I/O, and interfaces to ROCm, including ROCProfiler and ROCTracer with support for collecting hardware performance data.
The workshop also covers instrumentation of OpenMP programs using OpenMP Tools Interface (OMPT), including support for target offload and measurement of a program’s memory footprint.
During the session, there are hands-on activities on scalable tracing using OTF2 and visualization using the Vampir trace analysis tool. Performance data analysis using ParaProf and PerfExplorer are demonstrated using the performance data management framework (TAUdb) that includes TAU’s performance database.
training@pawsey.org.au
Sameer Shende
Pawsey Supercomputing Research Centre
supercomputing, TAU, E4S, Performance, ROCm, OpenMP