Network Know-how and Data Handling Workshop
This workshop is a ‘train-the-trainer’ session that covers topics such as jargon busting, network literacy and data movement solutions. The workshop will also provide a peek at some collaborative research tools such as Jupyter Notebooks and CloudStor. You will learn about networks, integrated...
Keywords: Networks, data handling
Resource type: lesson, presentation
Network Know-how and Data Handling Workshop
https://zenodo.org/record/6403757#.Yk-Gl8gza70
https://dresa.org.au/materials/network-know-how-and-data-handling-workshop
This workshop is a ‘train-the-trainer’ session that covers topics such as jargon busting, network literacy and data movement solutions. The workshop will also provide a peek at some collaborative research tools such as Jupyter Notebooks and CloudStor. You will learn about networks, integrated tools, data and storage and where all these things fit in the researcher’s toolkit.
This workshop is targeted at staff who would like to be more confident in giving advice to researchers about the options available to them. It is especially tailored for those with little to no technical knowledge and includes a hands-on component, using basic programming commands, but requires no previous knowledge of programming.
Sara King - sara.king@aarnet.edu.au
King, Sara (orcid: 0000-0003-3199-5592)
Mason, Ingrid (orcid: 0000-0002-0658-6095)
Burke, Melissa (orcid: 0000-0002-5571-8664)
Networks, data handling
Porting the multi-GPU SELF-Fluids code to HIPFort
In this presentation by Dr. Joseph Schoonover of Fluid Numerics LLC, Joe shares their experience with the porting process for SELF-Fluids from multi-GPU CUDA-Fortran to multi-GPU HIPFort.
The presentation covers the design principles and roadmap for SELF and the strategy to port from...
Keywords: AMD, GPUs, supercomputer, supercomputing
Resource type: presentation
Porting the multi-GPU SELF-Fluids code to HIPFort
https://docs.google.com/presentation/d/1JUwFkrHLx5_hgjxsix8h498_YqvFkkcefNYbu-DsHio/edit#slide=id.g10626504d53_0_0
https://dresa.org.au/materials/porting-the-multi-gpu-self-fluids-code-to-hipfort
In this presentation by Dr. Joseph Schoonover of Fluid Numerics LLC, Joe shares their experience with the porting process for SELF-Fluids from multi-GPU CUDA-Fortran to multi-GPU HIPFort.
The presentation covers the design principles and roadmap for SELF and the strategy to port from Nvidia-only platforms to AMD & Nvidia GPUs. Also discussed are the hurdles encountered along the way and considerations for developing multi-GPU accelerated applications in Fortran.
SELF is an object-oriented Fortran library that supports the implementation of Spectral Element Methods for solving partial differential equations. SELF-Fluids is an implementation of SELF that solves the compressible Navier Stokes equations on CPU only and GPU accelerated compute platforms using the Discontinuous Galerkin Spectral Element Method. The SELF API is designed based on the assumption that SEM developers and researchers need to be able to implement derivatives in 1-D and divergence, gradient, and curl in 2-D and 3-D on scalar, vector, and tensor functions using spectral collocation, continuous Galerkin, and discontinuous Galerkin spectral element methods.
The presentation discussion is placed in context of the Exascale era, where we're faced with a zoo of available compute hardware. Because of this, SELF routines provide support for GPU acceleration through AMD’s HIP and support for multi-core, multi-node, and multi-GPU platforms with MPI.
training@pawsey.org.au
Joe Schoonover
AMD, GPUs, supercomputer, supercomputing
Embracing new solutions for in-situ visualisation
This PPT was used by Jean Favre, senior visualisation software engineer at CSCS, the Swiss National Supercomputing Centre during his presentation at P'Con '21 (Pawsey's first PaCER Conference).
This material discusses the upcoming release of ParaView v5.10, a leading scientific visualisation...
Keywords: ParaView, GPUs, supercomputer, supercomputing, visualisation, data visualisation
Resource type: presentation
Embracing new solutions for in-situ visualisation
https://github.com/jfavre/InSitu/blob/master/InSitu-Revisited.pdf
https://dresa.org.au/materials/embracing-new-solutions-for-in-situ-visualisation
This PPT was used by Jean Favre, senior visualisation software engineer at CSCS, the Swiss National Supercomputing Centre during his presentation at P'Con '21 (Pawsey's first PaCER Conference).
This material discusses the upcoming release of ParaView v5.10, a leading scientific visualisation application. In this release ParaView consolidates its implementation of the Catalyst API, a specification developed for simulations and scientific data producers to analyse and visualise data in situ.
The material reviews some of the terminology and issues of different in-situ visualisation scenarios, then reviews early Data Adaptors for tight-coupling of simulations and visualisation solutions. This is followed by an introduction of Conduit, an intuitive model for describing hierarchical scientific data. Both ParaView-Catalyst and Ascent use Conduit’s Mesh Blueprint, a set of conventions to describe computational simulation meshes.
Finally, the materials present CSCS’ early experience in adopting ParaView-Catalyst and Ascent via two concrete examples of instrumentation of some proxy numerical applications.
training@pawsey.org.au
Jean Favre
ParaView, GPUs, supercomputer, supercomputing, visualisation, data visualisation
HPC file systems and what users need to consider for appropriate and efficient usage
Three videos on miscellaneous aspects of HPC usage - useful reference for new users of HPC systems.
1 – General overview of different file systems that might be available on HPC. The video goes through shared file systems such as /home and /scratch, local compute node file systems (local...
Keywords: HPC, high performance computer, File systems
Resource type: video, presentation
HPC file systems and what users need to consider for appropriate and efficient usage
https://www.youtube.com/watch?v=cNW7F9V1plA&list=PLjlLx279X4yO62jHF4rd7I9iEfbnz3Ts1
https://dresa.org.au/materials/hpc-file-systems-and-what-users-need-to-consider-for-appropriate-and-efficient-usage
Three videos on miscellaneous aspects of HPC usage - useful reference for new users of HPC systems.
1 – General overview of different file systems that might be available on HPC. The video goes through shared file systems such as /home and /scratch, local compute node file systems (local scratch or $TMPDIR) and storage file system. It outlines what users need to consider if they wish to use any of these in their workflows.
2 – Overview of the different directories that might be present on HPC. These could include /home, /scratch, /opt, /lib and lib64, /sw and others.
3 – Overview of the Message-of-the-day file and the message that is displayed to users every time they log in. This displays info about general help and often current problems or upcoming outages.
QCIF Training (training@qcif.edu.au)
Marlies Hankel
HPC, high performance computer, File systems
Deep Learning for Natural Language Processing
This workshop is designed to be instructor led and consists of two parts.
Part 1 consists of a lecture-demo about text processing and a hands-on session for attendees to learn how to clean a dataset.
Part 2 consists of a lecture introducing Recurrent Neural Networks and a hands-on session for...
Keywords: Deep learning, NLP, Machine learning
Resource type: presentation, tutorial
Deep Learning for Natural Language Processing
https://doi.org/10.26180/13100513
https://dresa.org.au/materials/deep-learning-for-natural-language-processing
This workshop is designed to be instructor led and consists of two parts.
Part 1 consists of a lecture-demo about text processing and a hands-on session for attendees to learn how to clean a dataset.
Part 2 consists of a lecture introducing Recurrent Neural Networks and a hands-on session for attendees to train their own RNN.
The Powerpoints contain the lecture slides, while the Jupyter notebooks (.ipynb) contain the hands-on coding exercises.
This workshop introduces natural language as data for deep learning. We discuss various techniques and software packages (e.g. python strings, RegEx, NLTK, Word2Vec) that help us convert, clean, and formalise text data “in the wild” for use in a deep learning model. We then explore the training and testing of a Recurrent Neural Network on the data to complete a real world task. We will be using TensorFlow v2 for this purpose.
datascienceplatform@monash.edu
Titus Tang
Deep learning, NLP, Machine learning
Getting Started with Deep Learning
This lecture provides a high level overview of how you could get started with developing deep learning applications. It introduces deep learning in a nutshell and then provides advice relating to the concepts and skill sets you would need to know and have in order to build a deep learning...
Keywords: Deep learning, Machine learning
Resource type: presentation
Getting Started with Deep Learning
https://doi.org/10.26180/15032688
https://dresa.org.au/materials/getting-started-with-deep-learning
This lecture provides a high level overview of how you could get started with developing deep learning applications. It introduces deep learning in a nutshell and then provides advice relating to the concepts and skill sets you would need to know and have in order to build a deep learning application. The lecture also provides pointers to various resources you could use to gain a stronger foothold in deep learning.
This lecture is targeted at researchers who may be complete beginners in machine learning, deep learning, or even with programming, but who would like to get into the space to build AI systems hands-on.
datascienceplatform@monash.edu
Titus Tang
Deep learning, Machine learning
Visualisation and Storytelling
This workshop explores how data visualisation techniques could be utilised to better understand data and to communicate research efforts and outcomes. The workshop covers a broad range of techniques from simple and static 2D graphics to advanced 3D visualisations in order to provide a broad...
Keywords: data visualisation, storytelling
Resource type: presentation, tutorial
Visualisation and Storytelling
https://doi.org/10.26180/13100510
https://dresa.org.au/materials/visualisation-and-storytelling
This workshop explores how data visualisation techniques could be utilised to better understand data and to communicate research efforts and outcomes. The workshop covers a broad range of techniques from simple and static 2D graphics to advanced 3D visualisations in order to provide a broad overview of the tools available for data analysis, presentation and storytelling. We explore, among others, animated charts and graphs, web visualisation tools such as scrollytellers, and the possibilities of 3D, interactive, and even immersive visualisations. We use real world, concrete examples along the way in order to tangibly illustrate how these visualisations can be created and how viewers perceive and interact with them. We also introduce the various tools and skill sets you would need to be proficient at presenting your data to the world.
By the conclusion of this workshop, you would gain familiarity with the various possibilities for presenting your own research data and outcomes. You would have a more intuitive understanding of the strengths and weaknesses of various modes of data visualisation and storytelling, and would have a starting point to obtain the right skill sets relevant to developing your visualisations of choice.
datascienceplatform@monash.edu
Daniel Waghorn
Nora Hamacher
Owen Kaluza
data visualisation, storytelling
Semi-Supervised Deep Learning
Modern deep neural networks require large amounts of labelled data to train. Obtaining the required labelled data is often an expensive and time consuming process. Semi-supervised deep learning involves the use of various creative techniques to train deep neural networks on partially labelled...
Keywords: Deep learning, Machine learning, semi-supervised
Resource type: presentation, tutorial
Semi-Supervised Deep Learning
https://doi.org/10.26180/14176805
https://dresa.org.au/materials/semi-supervised-deep-learning
Modern deep neural networks require large amounts of labelled data to train. Obtaining the required labelled data is often an expensive and time consuming process. Semi-supervised deep learning involves the use of various creative techniques to train deep neural networks on partially labelled data. If successful, it allows better training of a model despite the limited amount of labelled data available.
This workshop is designed to be instructor led and covers various semi-supervised learning techniques available in the literature. The workshop consists of a lecture introducing at a high level a selection of techniques that are suitable for semi-supervised deep learning. We discuss how these techniques can be implemented and the underlying assumptions they require. The lecture is followed by a hands-on session where attendees implement a semi-supervised learning technique to train a neural network. We observe and discuss the changing performance and behaviour of the network as varying degrees of labelled and unlabelled data is provided to the network during training.
datascienceplatform@monash.edu
Titus Tang
Deep learning, Machine learning, semi-supervised
Introduction to Deep Learning and TensorFlow
This workshop is intended to run as an instructor guided live event and consists of two parts. Each part consists of a lecture and a hands-on coding exercise.
Part 1 - Introduction to Deep Learning and TensorFlow
Part 2 - Introduction to Convolutional Neural Networks
The Powerpoints contain...
Keywords: Deep learning, convolutional neural network, tensorflow, Machine learning
Resource type: presentation, tutorial
Introduction to Deep Learning and TensorFlow
https://doi.org/10.26180/13100519
https://dresa.org.au/materials/introduction-to-deep-learning-and-tensorflow
This workshop is intended to run as an instructor guided live event and consists of two parts. Each part consists of a lecture and a hands-on coding exercise.
Part 1 - Introduction to Deep Learning and TensorFlow
Part 2 - Introduction to Convolutional Neural Networks
The Powerpoints contain the lecture slides, while the Jupyter notebooks (.ipynb) contain the hands-on coding exercises.
This workshop is an introduction to how deep learning works and how you could create a neural network using TensorFlow v2. We start by learning the basics of deep learning including what a neural network is, how information passes through the network, and how the network learns from data through the automated process of gradient descent. Workshop attendees would build, train and evaluate a neural network using a cloud GPU (Google Colab).
In part 2, we look at image data and how we could train a convolution neural network to classify images. Workshop attendees will extend their knowledge from the first part to design, train and evaluate this convolutional neural network.
datascienceplatform@monash.edu
Titus Tang
Deep learning, convolutional neural network, tensorflow, Machine learning
ARDC FAIR Data 101 self-guided
FAIR Data 101 v3.0 is a self-guided course covering the FAIR Data principles
The FAIR Data 101 virtual course was designed and delivered by the ARDC Skilled Workforce Program twice in 2020 and has now been reworked as a self-guided course.
Keywords: training material, FAIR data, research data, data management, FAIR
Resource type: presentation, quiz, activity
ARDC FAIR Data 101 self-guided
https://zenodo.org/record/5094034#.YQyLbY4zaUk
https://dresa.org.au/materials/ardc-fair-data-101-self-guided
FAIR Data 101 v3.0 is a self-guided course covering the FAIR Data principles
The FAIR Data 101 virtual course was designed and delivered by the ARDC Skilled Workforce Program twice in 2020 and has now been reworked as a self-guided course.
ARDC Contact us: https://ardc.edu.au/contact-us/
Liz Stokes
Matthias Liffers
Nichola Burton
Paula A. Martinez
Natasha Simons
Keith Russell
Siobhann McCafferty
Richard Ferrers
Steve McEachern
Melanie Barlow
Tom Honeyman
Maria del Mar Quiroga
training material, FAIR data, research data, data management, FAIR
phd
ecr
researcher
support
Software publishing, licensing, and citation
A short presentation for reuse includes speaker notes.
Making software citable using a code repository, an ORCID and a licence.
Cite as
Liffers, Matthias. (2021, July 12). Software publishing, licensing, and citation. Zenodo. https://doi.org/10.5281/zenodo.5091717
Keywords: software citation, software publishing, software registry, software repository, research software
Resource type: presentation
Software publishing, licensing, and citation
https://zenodo.org/record/5091717#.YQyPtY4zaUk
https://dresa.org.au/materials/software-publishing-licensing-and-citation
A short presentation for reuse includes speaker notes.
Making software citable using a code repository, an ORCID and a licence.
**Cite as**
Liffers, Matthias. (2021, July 12). Software publishing, licensing, and citation. Zenodo. https://doi.org/10.5281/zenodo.5091717
ARDC Contact us: https://ardc.edu.au/contact-us/
Matthias Liffers
software citation, software publishing, software registry, software repository, research software
phd
ecr
researcher
support
WEBINAR: Getting started with deep learning
This record includes training materials associated with the Australian BioCommons webinar ‘Getting started with deep learning’. This webinar took place on 21 July 2021.
Are you wondering what deep learning is and how it might be useful in your research? This high level overview introduces...
Keywords: Deep learning, Bioinformatics, Machine learning
Resource type: video, presentation
WEBINAR: Getting started with deep learning
https://zenodo.org/record/5121004#.YQN_QlMzY3Q
https://dresa.org.au/materials/webinar-getting-started-with-deep-learning
This record includes training materials associated with the Australian BioCommons webinar ‘Getting started with deep learning’. This webinar took place on 21 July 2021.
Are you wondering what deep learning is and how it might be useful in your research? This high level overview introduces deep learning ‘in a nutshell’ and provides tips on which concepts and skills you will need to know to build a deep learning application. The presentation also provides pointers to various resources you can use to get started in deep learning.
The webinar is followed by a short Q&A session.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Getting Started with Deep Learning - Slides (PDF): Slides used in the presentation
Materials shared elsewhere:
A recording of the webinar is available on the Australian BioCommons YouTube Channel:
https://youtu.be/I1TmpnZUuiQ
Melissa Burke (melissa@biocommons.org.au)
Titus Tang
Deep learning, Bioinformatics, Machine learning
WEBINAR: Getting started with command line bioinformatics
This record includes training materials associated with the Australian BioCommons webinar ‘Getting started with command line bioinformatics’. This webinar took place on 22 June 2021.
Bioinformatics skills are in demand like never before and biologists are stepping up to the challenge of...
Keywords: Command line, Bioinformatics
Resource type: video, presentation
WEBINAR: Getting started with command line bioinformatics
https://zenodo.org/record/5068997#.YQN4mlMzY3Q
https://dresa.org.au/materials/webinar-getting-started-with-command-line-bioinformatics
This record includes training materials associated with the Australian BioCommons webinar ‘Getting started with command line bioinformatics’. This webinar took place on 22 June 2021.
Bioinformatics skills are in demand like never before and biologists are stepping up to the challenge of learning to analyse large and ever growing datasets. Learning how to use the command line can open up many options for data analysis but getting started can be a little daunting for those without a background in computer science.
Parice Brandies and Carolyn Hogg have recently put together ten simple rules for getting started with command-line bioinformatics to help biologists begin their computational journeys. In this webinar Parice walks you through their hints and tips for getting started with the command line. She covers topics like learning tech speak, evaluating your data and workflows, assessing computational requirements, computing options, the basics of software installation, curating and testing scripts, a bit of bash and keeping good records. The webinar will be followed by a short Q&A session.
The slides were created by Parice Brandies and are based on the publication ‘Ten simple rules for getting started with command-line bioinformatics’ (https://doi.org/10.1371/journal.pcbi.1008645). The slides are shared under a Creative Commons Attribution 4.0 International unless otherwise specified and were current at the time of the webinar.
**Files and materials included in this record:**
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Getting started with command line bioinformatics - slides (PDF): Slides presented during the webinar
**Materials shared elsewhere:**
A recording of the webinar is available on the Australian BioCommons YouTube Channel
https://youtu.be/p7pA4OLB2X4
Melissa (melissa@biocommons.org.au)
Parice Brandies
Command line, Bioinformatics