Tutorials to learn how to use STAN
Stan tutorials offer links to exceptional tutorial papers, videos and statistics to learn Bayesian statistical methods and applied statistics.
Keywords: Statistics, applied statistics, Bayesian statistics, R software, Python, MATLAB
Tutorials to learn how to use STAN
https://mc-stan.org/users/documentation/tutorials.html
https://dresa.org.au/materials/tutorials-to-learn-how-to-use-stan
Stan tutorials offer links to exceptional tutorial papers, videos and statistics to learn Bayesian statistical methods and applied statistics.
https://mc-stan.org/about/team/
Statistics, applied statistics, Bayesian statistics, R software, Python, MATLAB
Species Distribution Modelling in R
This set of scripts and videos provide an introduction to running SDMs in R and include some steps to consider that go beyond what's available in the EcoCommons SDM point-and-click tools.
Five videos include: 1. An introduction to SDM in R, 2. occurrence data, 3. environmental data, 4. fitting...
Keywords: Species Distribution Modelling, Ecology, R software, EcoCommons
Species Distribution Modelling in R
https://www.ecocommons.org.au/educational-material4-mastering-species-distribution-modelling-in-r/
https://dresa.org.au/materials/species-distribution-modelling-in-r
This set of scripts and videos provide an introduction to running SDMs in R and include some steps to consider that go beyond what's available in the EcoCommons SDM point-and-click tools.
Five videos include: 1. An introduction to SDM in R, 2. occurrence data, 3. environmental data, 4. fitting your model, 5. model evaluation
Scripts and files are available here:
https://github.com/EcoCommons-Australia/educational_material/tree/main/SDMs_in_R/Scripts
Scripts for all four modules are here: https://www.ecocommons.org.au/wp-content/uploads/EcoCommons_steps_1_to_4.html
https://www.ecocommons.org.au/contact/
https://orcid.org/0000-0002-1359-5133
Species Distribution Modelling, Ecology, R software, EcoCommons
ugrad
mbr
phd
WORKSHOP: Single cell RNAseq analysis in R
This record includes training materials associated with the Australian BioCommons workshop ‘Single cell RNAseq analysis in R’. This workshop took place over two, 3.5 hour sessions on 22 and 3 August 2022.
Event description
Analysis and interpretation of single cell RNAseq (scRNAseq) data...
Keywords: Bioinformatics, Analysis, Transcriptomics, R software, Single cell RNAseq, scRNAseq
WORKSHOP: Single cell RNAseq analysis in R
https://zenodo.org/records/7072910
https://dresa.org.au/materials/workshop-single-cell-rnaseq-analysis-in-r-4f60b82d-2f1e-4021-9569-6955878dd945
This record includes training materials associated with the Australian BioCommons workshop ‘Single cell RNAseq analysis in R’. This workshop took place over two, 3.5 hour sessions on 22 and 3 August 2022.
Event description
Analysis and interpretation of single cell RNAseq (scRNAseq) data requires dedicated workflows. In this hands-on workshop we will show you how to perform single cell analysis using Seurat - an R package for QC, analysis, and exploration of single-cell RNAseq data.
We will discuss the ‘why’ behind each step and cover reading in the count data, quality control, filtering, normalisation, clustering, UMAP layout and identification of cluster markers. We will also explore various ways of visualising single cell expression data.
This workshop is presented by the Australian BioCommons and Queensland Cyber Infrastructure Foundation (QCIF) with the assistance of a network of facilitators from the national Bioinformatics Training Cooperative.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
scRNAseq_Slides (PDF): Slides used to introduce topics
scRNAseq_Schedule (PDF): A breakdown of the topics and timings for the workshop
scRNAseq_Resources (PDF): A list of resources recommended by trainers and participants
scRNAseq_QandA(PDF): Archive of questions and their answers from the workshop Slack Channel.
Materials shared elsewhere:
This workshop follows the tutorial ‘scRNAseq Analysis in R with Seurat’
https://swbioinf.github.io/scRNAseqInR_Doco/index.html
This material is based on the introductory Guided Clustering Tutorial tutorial from Seurat.
It is also drawing from a similar workshop held by Monash Bioinformatics Platform Single-Cell-Workshop, with material here.
Melissa Burke (melissa@biocommons.org.au)
Williams, Sarah
Mehdi, Ahmed (orcid: 0000-0002-9300-2341)
Matigan, Nick
Barugahare, Adele (orcid: 0000-0002-8976-0094)
Harrison, Paul (orcid: 0000-0002-3980-268X)
Morgan, Steven (orcid: 0000-0001-6038-6126)
Whitfield, Holly (orcid: 0000-0002-7282-387X)
Bioinformatics, Analysis, Transcriptomics, R software, Single cell RNAseq, scRNAseq
WORKSHOP: R: fundamental skills for biologists
This record includes training materials associated with the Australian BioCommons workshop ‘R: fundamental skills for biologists’. This workshop took place over four, three-hour sessions on 1, 8, 15 and 22 June 2022.
Event description
Biologists need data analysis skills to be able to...
Keywords: Bioinformatics, Analysis, Statistics, R software, RStudio, Data visualisation
WORKSHOP: R: fundamental skills for biologists
https://zenodo.org/records/6766951
https://dresa.org.au/materials/workshop-r-fundamental-skills-for-biologists-81aa00db-63ad-4962-a7ac-b885bf9f676b
This record includes training materials associated with the Australian BioCommons workshop ‘R: fundamental skills for biologists’. This workshop took place over four, three-hour sessions on 1, 8, 15 and 22 June 2022.
Event description
Biologists need data analysis skills to be able to interpret, visualise and communicate their research results. While Excel can cover some data analysis needs, there is a better choice, particularly for large and complex datasets.
R is a free, open-source software and programming language that enables data exploration, statistical analysis, visualisation and more. The large variety of R packages available for analysing biological data make it a robust and flexible option for data of all shapes and sizes.
Getting started can be a little daunting for those without a background in statistics and programming. In this workshop we will equip you with the foundations for getting the most out of R and RStudio, an interactive way of structuring and keeping track of your work in R. Using biological data from a model of influenza infection, you will learn how to efficiently and reproducibly organise, read, wrangle, analyse, visualise and generate reports from your data in R.
Topics covered in this workshop include:
Spreadsheets, organising data and first steps with R
Manipulating and analysing data with dplyr
Data visualisation
Summarized experiments and getting started with Bioconductor
This workshop is presented by the Australian BioCommons and Saskia Freytag from WEHI with the assistance of a network of facilitators from the national Bioinformatics Training Cooperative.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Schedule (PDF): A breakdown of the topics and timings for the workshop
Recommended resources (PDF): A list of resources recommended by trainers and participants
Q_and_A(PDF): Archive of questions and their answers from the workshop Slack Channel.
Materials shared elsewhere:
This workshop follows the tutorial ‘Introduction to data analysis with R and Bioconductor’ which is publicly available.
https://saskiafreytag.github.io/biocommons-r-intro/
This is derived from material produced as part of The Carpentries Incubator project
https://carpentries-incubator.github.io/bioc-intro/
Melissa Burke (melissa@biocommons.org.au)
Freytag, Saskia (orcid: 0000-0002-2185-7068)
Barugahare, Adele (orcid: 0000-0002-8976-0094)
Doyle, Maria
Ansell, Brendan (orcid: 0000-0003-0297-897X)
Varshney, Akriti
Bourke, Caitlin (orcid: 0000-0002-4466-6563)
Conradsen, Cara (orcid: 0000-0001-9797-3412)
Jung, Chol-Hee (orcid: 0000-0002-2992-3162)
Sandoval, Claudia
Chandrananda, Dineika (orcid: 0000-0002-8834-9500)
Zhang, Eden (orcid: 0000-0003-0294-3734)
Rosello, Fernando (orcid: 0000-0003-3885-8777)
Iacono, Giulia (orcid: 0000-0002-1527-0754)
Tarasova, Ilariya (orcid: 0000-0002-0895-9385)
Chung, Jessica (orcid: 0000-0002-0627-0955)
Moffet, Joel
Gustafsson, Johan (orcid: 0000-0002-2977-5032)
Ding, Ke
Feher, Kristen
Perlaza-Jimenez, Laura (orcid: 0000-0002-8511-1134)
Crowe, Mark (orcid: 0000-0002-9514-2487)
Ma, Mengyao
Kandhari, Nitika (orcid: 0000-0002-0261-727X)
Williams, Sarah
Nelson, Tiffanie (orcid: 0000-0002-5341-312X)
Schreiber, Veronika (orcid: 0000-0001-6088-7828)
Pinzon Perez, William
Bioinformatics, Analysis, Statistics, R software, RStudio, Data visualisation
WORKSHOP: Working with genomics sequences and features in R with Bioconductor
This record includes training materials associated with the Australian BioCommons workshop ‘Working with genomics sequences and features in R with Bioconductor’. This workshop took place on 23 September 2021.
Workshop description
Explore the many useful functions that the Bioconductor...
Keywords: R software, Bioconductor, Bioinformatics, Analysis, Genomics, Sequence analysis
WORKSHOP: Working with genomics sequences and features in R with Bioconductor
https://zenodo.org/records/5781776
https://dresa.org.au/materials/workshop-working-with-genomics-sequences-and-features-in-r-with-bioconductor-8399bf0d-1e9e-48f3-a840-3f70f23254bb
This record includes training materials associated with the Australian BioCommons workshop ‘Working with genomics sequences and features in R with Bioconductor’. This workshop took place on 23 September 2021.
Workshop description
Explore the many useful functions that the Bioconductor environment offers for working with genomic data and other biological sequences.
DNA and proteins are often represented as files containing strings of nucleic acids or amino acids. They are associated with text files that provide additional contextual information such as genome annotations.
This workshop provides hands-on experience with tools, software and packages available in R via Bioconductor for manipulating, exploring and extracting information from biological sequences and annotation files. We will look at tools for working with some commonly used file formats including FASTA, GFF3, GTF, methods for identifying regions of interest, and easy methods for obtaining data packages such as genome assemblies.
This workshop is presented by the Australian BioCommons and Monash Bioinformatics Platform with the assistance of a network of facilitators from the national Bioinformatics Training Cooperative.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Schedule (PDF): schedule for the workshop providing a breakdown of topics and timings
Materials shared elsewhere:
This workshop follows the tutorial ‘Working with DNA sequences and features in R with Bioconductor - version 2’ developed for Monash Bioinformatics Platform and Monash Data Fluency by Paul Harrison.
https://monashdatafluency.github.io/r-bioc-2/
Melissa Burke (melissa@biocommons.org.au)
Harrison, Paul (orcid: 0000-0002-3980-268X)
Deshpande, Nandan (orcid: 0000-0002-0324-8728)
Barugahare, Adele (orcid: 0000-0002-8976-0094)
Perry, Andrew (orcid: 0000-0001-9256-6068)
Wong, Nick (orcid: 0000-0003-4393-7541)
Reames, Benjamin
R software, Bioconductor, Bioinformatics, Analysis, Genomics, Sequence analysis
PCon Preparing applications for El Capitan and beyond
As Lawrence Livermore National Laboratories (LLNL) prepares to stand up its next supercomputer, El Capitan, application teams prepare to pivot to another GPU architecture.
This talk presents how the LLNL application teams made the transition from distributed-memory, CPU-only architectures to...
Keywords: GPUs, supercomputing, HPC, PaCER
PCon Preparing applications for El Capitan and beyond
https://www.youtube.com/watch?v=cj7a7gWgt8o&list=PLmu61dgAX-aZ_aa6SmmExSJtXGS7L_BF9&index=4
https://dresa.org.au/materials/pcon-preparing-applications-for-el-capitan-and-beyond
As Lawrence Livermore National Laboratories (LLNL) prepares to stand up its next supercomputer, El Capitan, application teams prepare to pivot to another GPU architecture.
This talk presents how the LLNL application teams made the transition from distributed-memory, CPU-only architectures to GPUs. They share institutional best practices. They discuss new open-source software products as tools for porting and profiling applications and as avenues for collaboration across the computational science community.
Join LLNL's Erik Draeger and Jane Herriman, who presented this talk at Pawsey's PaCER Conference in September 2023.
training@pawsey.org.au
Erik Draeger
Jane Herriman
Pawsey Supercomputing Research Centre
GPUs, supercomputing, HPC, PaCER
masters
phd
researcher
ecr
support
professional
ugrad
OpenCL
Supercomputers make use of accelerators from a variety of different hardware vendors, using devices such as multi-core CPU’s, GPU’s and even FPGA’s. OpenCL is a way for your HPC application to make effective use of heterogeneous computing devices, and to avoid code refactoring for new HPC...
Keywords: supercomputing, Pawsey Supercomputing Centre, CPUs, GPUs, OpenCL, FPGAs
Resource type: activity
OpenCL
https://www.youtube.com/playlist?list=PLmu61dgAX-aa_lk5fby5PjuS49snHpyYL
https://dresa.org.au/materials/opencl
Supercomputers make use of accelerators from a variety of different hardware vendors, using devices such as multi-core CPU’s, GPU’s and even FPGA’s. OpenCL is a way for your HPC application to make effective use of heterogeneous computing devices, and to avoid code refactoring for new HPC infrastructure.
training@pawsey.org.au
Toby Potter
Pawsey Supercomputing Research Centre
Pelagos
Toby Potter
supercomputing, Pawsey Supercomputing Centre, CPUs, GPUs, OpenCL, FPGAs
masters
ecr
researcher
support
AMD Profiling
The AMD profiling workshop covers the AMD suite of tools for development of HPC applications on AMD GPUs.
You will learn how to use the rocprof profiler and trace visualization tool that has long been available as part of the ROCm software suite.
You will also learn how to use the new...
Keywords: supercomputing, performance, GPUs, CPUs, AMD, HPC, ROCm
Resource type: activity
AMD Profiling
https://www.youtube.com/playlist?list=PLmu61dgAX-aaQOCG5Jlw8oLBORJfoQC2o
https://dresa.org.au/materials/amd-profiling
The AMD profiling workshop covers the AMD suite of tools for development of HPC applications on AMD GPUs.
You will learn how to use the rocprof profiler and trace visualization tool that has long been available as part of the ROCm software suite.
You will also learn how to use the new Omnitools - Omnitrace and Omniperf - that were introduced at the end of 2022. Omnitrace is a powerful tracing profiler for both CPU and GPU. It can collect data from a much wider range of sources and includes hardware counters and sampling approaches. Omniperf is a performance analysis tool that can help you pinpoint how your application is performing with a visual view of the memory hierarchy on the GPU as well as reporting the percentage of peak for many different measurements.
training@pawsey.org.au
AMD
Pawsey Supercomputing Research Centre
supercomputing, performance, GPUs, CPUs, AMD, HPC, ROCm
HIP Workshop
The Heterogeneous Interface for Portability (HIP) provides a programming framework for harnessing the compute capabilities of multicore processors, such as the MI250X GPU’s on Setonix.
In this course we focus on the essentials of developing HIP applications with a focus on...
Keywords: HIP, supercomputing, Programming, GPUs, MPI, debugging
Resource type: full-course
HIP Workshop
https://support.pawsey.org.au/documentation/display/US/Pawsey+Training+Resources
https://dresa.org.au/materials/hip-workshop
The Heterogeneous Interface for Portability (HIP) provides a programming framework for harnessing the compute capabilities of multicore processors, such as the MI250X GPU’s on Setonix.
In this course we focus on the essentials of developing HIP applications with a focus on supercomputing.
Agenda
- Introduction to HIP and high level features
- How to build and run applications on Setonix with HIP and MPI
- A complete line-by-line walkthrough of a HIP-enabled application
- Tools and techniques for debugging and measuring the performance of HIP applications
training@pawsey.org.au
Pelagos
Pawsey Supercomputing Research Centre
HIP, supercomputing, Programming, GPUs, MPI, debugging
VOSON Lab Code Blog
The VOSON Lab Code Blog is a space to share methods, tips, examples and code. Blog posts provide techniques to construct and analyse networks from various API and other online data sources, using the VOSON open-source software and other R based packages.
Keywords: visualisation, Data analysis, data collections, R software, Social network analysis, social media data, Computational Social Science, quantitative, Text Analytics
Resource type: tutorial, other
VOSON Lab Code Blog
https://vosonlab.github.io/
https://dresa.org.au/materials/voson-lab-code-blog
The VOSON Lab Code Blog is a space to share methods, tips, examples and code. Blog posts provide techniques to construct and analyse networks from various API and other online data sources, using the VOSON open-source software and other R based packages.
robert.ackland@anu.edu.au
visualisation, Data analysis, data collections, R software, Social network analysis, social media data, Computational Social Science, quantitative, Text Analytics
researcher
support
phd
masters
Porting the multi-GPU SELF-Fluids code to HIPFort
In this presentation by Dr. Joseph Schoonover of Fluid Numerics LLC, Joe shares their experience with the porting process for SELF-Fluids from multi-GPU CUDA-Fortran to multi-GPU HIPFort.
The presentation covers the design principles and roadmap for SELF and the strategy to port from...
Keywords: AMD, GPUs, supercomputer, supercomputing
Resource type: presentation
Porting the multi-GPU SELF-Fluids code to HIPFort
https://docs.google.com/presentation/d/1JUwFkrHLx5_hgjxsix8h498_YqvFkkcefNYbu-DsHio/edit#slide=id.g10626504d53_0_0
https://dresa.org.au/materials/porting-the-multi-gpu-self-fluids-code-to-hipfort
In this presentation by Dr. Joseph Schoonover of Fluid Numerics LLC, Joe shares their experience with the porting process for SELF-Fluids from multi-GPU CUDA-Fortran to multi-GPU HIPFort.
The presentation covers the design principles and roadmap for SELF and the strategy to port from Nvidia-only platforms to AMD & Nvidia GPUs. Also discussed are the hurdles encountered along the way and considerations for developing multi-GPU accelerated applications in Fortran.
SELF is an object-oriented Fortran library that supports the implementation of Spectral Element Methods for solving partial differential equations. SELF-Fluids is an implementation of SELF that solves the compressible Navier Stokes equations on CPU only and GPU accelerated compute platforms using the Discontinuous Galerkin Spectral Element Method. The SELF API is designed based on the assumption that SEM developers and researchers need to be able to implement derivatives in 1-D and divergence, gradient, and curl in 2-D and 3-D on scalar, vector, and tensor functions using spectral collocation, continuous Galerkin, and discontinuous Galerkin spectral element methods.
The presentation discussion is placed in context of the Exascale era, where we're faced with a zoo of available compute hardware. Because of this, SELF routines provide support for GPU acceleration through AMD’s HIP and support for multi-core, multi-node, and multi-GPU platforms with MPI.
training@pawsey.org.au
Joe Schoonover
AMD, GPUs, supercomputer, supercomputing
Embracing new solutions for in-situ visualisation
This PPT was used by Jean Favre, senior visualisation software engineer at CSCS, the Swiss National Supercomputing Centre during his presentation at P'Con '21 (Pawsey's first PaCER Conference).
This material discusses the upcoming release of ParaView v5.10, a leading scientific visualisation...
Keywords: ParaView, GPUs, supercomputer, supercomputing, visualisation, data visualisation
Resource type: presentation
Embracing new solutions for in-situ visualisation
https://github.com/jfavre/InSitu/blob/master/InSitu-Revisited.pdf
https://dresa.org.au/materials/embracing-new-solutions-for-in-situ-visualisation
This PPT was used by Jean Favre, senior visualisation software engineer at CSCS, the Swiss National Supercomputing Centre during his presentation at P'Con '21 (Pawsey's first PaCER Conference).
This material discusses the upcoming release of ParaView v5.10, a leading scientific visualisation application. In this release ParaView consolidates its implementation of the Catalyst API, a specification developed for simulations and scientific data producers to analyse and visualise data in situ.
The material reviews some of the terminology and issues of different in-situ visualisation scenarios, then reviews early Data Adaptors for tight-coupling of simulations and visualisation solutions. This is followed by an introduction of Conduit, an intuitive model for describing hierarchical scientific data. Both ParaView-Catalyst and Ascent use Conduit’s Mesh Blueprint, a set of conventions to describe computational simulation meshes.
Finally, the materials present CSCS’ early experience in adopting ParaView-Catalyst and Ascent via two concrete examples of instrumentation of some proxy numerical applications.
training@pawsey.org.au
Jean Favre
ParaView, GPUs, supercomputer, supercomputing, visualisation, data visualisation