Register training material
2 materials found

Keywords: ROCm 


AMD Profiling

The AMD profiling workshop covers the AMD suite of tools for development of HPC applications on AMD GPUs.

You will learn how to use the rocprof profiler and trace visualization tool that has long been available as part of the ROCm software suite.

You will also learn how to use the new...

Keywords: supercomputing, performance, GPUs, CPUs, AMD, HPC, ROCm

Resource type: activity

AMD Profiling https://dresa.org.au/materials/amd-profiling The AMD profiling workshop covers the AMD suite of tools for development of HPC applications on AMD GPUs. You will learn how to use the rocprof profiler and trace visualization tool that has long been available as part of the ROCm software suite. You will also learn how to use the new Omnitools - Omnitrace and Omniperf - that were introduced at the end of 2022. Omnitrace is a powerful tracing profiler for both CPU and GPU. It can collect data from a much wider range of sources and includes hardware counters and sampling approaches. Omniperf is a performance analysis tool that can help you pinpoint how your application is performing with a visual view of the memory hierarchy on the GPU as well as reporting the percentage of peak for many different measurements. training@pawsey.org.au supercomputing, performance, GPUs, CPUs, AMD, HPC, ROCm
Evaluate Application Performance using TAU and E4S

In this workshop, you learn about the Extreme-scale Scientific Software Stack and the TAU Performance System® and its interfaces to other tools and libraries. The workshop includes sample codes that illustrate the different instrumentation and measurement choices.

Topics covered include...

Keywords: supercomputing, TAU, E4S, Performance, ROCm, OpenMP

Resource type: activity

Evaluate Application Performance using TAU and E4S https://dresa.org.au/materials/evaluate-application-performance-using-tau-and-e4s In this workshop, you learn about the Extreme-scale Scientific Software Stack and the TAU Performance System® and its interfaces to other tools and libraries. The workshop includes sample codes that illustrate the different instrumentation and measurement choices. Topics covered include generating performance profiles and traces with memory utilization and headroom, I/O, and interfaces to ROCm, including ROCProfiler and ROCTracer with support for collecting hardware performance data. The workshop also covers instrumentation of OpenMP programs using OpenMP Tools Interface (OMPT), including support for target offload and measurement of a program’s memory footprint. During the session, there are hands-on activities on scalable tracing using OTF2 and visualization using the Vampir trace analysis tool. Performance data analysis using ParaProf and PerfExplorer are demonstrated using the performance data management framework (TAUdb) that includes TAU’s performance database. training@pawsey.org.au supercomputing, TAU, E4S, Performance, ROCm, OpenMP