AWS Ramp-Up Guide: Academic Research
AWS Ramp-Up Guides offer a variety of resources to help you build your skills and knowledge of the AWS Cloud. Each guide features carefully selected digital training, classroom courses, videos, whitepapers, certifications, and more. AWS now offers four ramp-up guides that help academic...
Keywords: Machine learning, machine learning, aws, AWS, cloud, Cloud computing, cloud computing, training material, HPC training, HPC, training registry, training partnerships
AWS Ramp-Up Guide: Academic Research
https://d1.awsstatic.com/training-and-certification/ramp-up_guides/Ramp-Up_Guide_Academic_Research.pdf
https://dresa.org.au/materials/aws-ramp-up-guide-academic-research
AWS Ramp-Up Guides offer a variety of resources to help you build your skills and knowledge of the AWS Cloud. Each guide features carefully selected digital training, classroom courses, videos, whitepapers, certifications, and more. AWS now offers four ramp-up guides that help academic researchers who use AI, ML, Generative AI, and HPC in their research activities, as well as the essential AWS knowledge for Statistician Researchers and Research IT professionals. The guides help learners decide where to start, and how to navigate, their learning journey. Some resources will be more relevant than others based on each learner’s specific research tasks.
AI, ML, Generative AI ramp-up guide (page 2) is for academic researchers who are exploring using AWS AI, ML, and Generative AI tools to improve efficiency and productivity in their research tasks. This course introduces seven components on AI and ML and ten components on Generative AI. The course starts with an introduction to AI, and covers AWS AI/ML services, such as Amazon SageMaker. The Generative AI content covers topics such as planning a Generative AI project, responsible AI Practices, security, compliance, and governance for AI solutions. The Generative AI topics also cover how to get started with Amazon Bedrock. Recommended prerequisites: basic understanding of Python.
High Performance Computing ramp-up guide (page 3) is designed for academic researchers who seek to use HPC on AWS. In this course, you will be introduced to eleven components that are essential about Higher Performance Computing on AWS. The course starts with an overview of HPC on AWS, followed by topics including AWS ParallelCluster and Research HPC Workloads on AWS Batch. Recommended prerequisites: complete AWS Cloud Essentials.
Statistician Researcher ramp-up guide (page 4) is specifically catered for researchers in the fields of statistics and quantum analysis. The course covers topics such as building with Amazon Redshift clusters, getting started with Amazon EMR, Machine Learning for Data Scientists, authoring visual analytics using Amazon QuickSight, Batch analytics on AWS, and Amazon Lightsail for Research. Recommended prerequisites: complete AWS Cloud Essentials.
Research IT ramp-up guide (page 5) is an extension of the Foundational Researcher Learning Plan, and enables Research IT leaders and professionals to dive deeper into specific topics. The goal of this extension for Research IT professionals is to dive deeper on fundamentals, understand management capabilities and implementing guardrails, cost optimization for research workloads, become familiar with platforms for research and research partners, and learn more about AWS Landing Zone and AWS Control Tower for Research. Recommended prerequisites: Foundational Researcher Learning Plan.
emmarrig@amazon.com
Machine learning, machine learning, aws, AWS, cloud, Cloud computing, cloud computing, training material, HPC training, HPC, training registry, training partnerships
Tutorials to learn how to use STAN
Stan tutorials offer links to exceptional tutorial papers, videos and statistics to learn Bayesian statistical methods and applied statistics.
Keywords: Statistics, applied statistics, Bayesian statistics, R software, Python, MATLAB
Tutorials to learn how to use STAN
https://mc-stan.org/users/documentation/tutorials.html
https://dresa.org.au/materials/tutorials-to-learn-how-to-use-stan
Stan tutorials offer links to exceptional tutorial papers, videos and statistics to learn Bayesian statistical methods and applied statistics.
https://mc-stan.org/about/team/
Statistics, applied statistics, Bayesian statistics, R software, Python, MATLAB
Species Distribution Modelling in R
This set of scripts and videos provide an introduction to running SDMs in R and include some steps to consider that go beyond what's available in the EcoCommons SDM point-and-click tools.
Five videos include: 1. An introduction to SDM in R, 2. occurrence data, 3. environmental data, 4. fitting...
Keywords: Species Distribution Modelling, Ecology, R software, EcoCommons
Species Distribution Modelling in R
https://www.ecocommons.org.au/educational-material4-mastering-species-distribution-modelling-in-r/
https://dresa.org.au/materials/species-distribution-modelling-in-r
This set of scripts and videos provide an introduction to running SDMs in R and include some steps to consider that go beyond what's available in the EcoCommons SDM point-and-click tools.
Five videos include: 1. An introduction to SDM in R, 2. occurrence data, 3. environmental data, 4. fitting your model, 5. model evaluation
Scripts and files are available here:
https://github.com/EcoCommons-Australia/educational_material/tree/main/SDMs_in_R/Scripts
Scripts for all four modules are here: https://www.ecocommons.org.au/wp-content/uploads/EcoCommons_steps_1_to_4.html
https://www.ecocommons.org.au/contact/
https://orcid.org/0000-0002-1359-5133
Species Distribution Modelling, Ecology, R software, EcoCommons
ugrad
mbr
phd
WEBINAR: Getting started with deep learning
This record includes training materials associated with the Australian BioCommons webinar ‘Getting started with deep learning’. This webinar took place on 21 July 2021.
Are you wondering what deep learning is and how it might be useful in your research? This high level overview introduces deep...
Keywords: Deep learning, Neural networks, Machine learning
WEBINAR: Getting started with deep learning
https://zenodo.org/records/5121004
https://dresa.org.au/materials/webinar-getting-started-with-deep-learning-986aa2d2-594a-4a7f-836c-44d6e9d5d017
This record includes training materials associated with the Australian BioCommons webinar ‘Getting started with deep learning’. This webinar took place on 21 July 2021.
Are you wondering what deep learning is and how it might be useful in your research? This high level overview introduces deep learning ‘in a nutshell’ and provides tips on which concepts and skills you will need to know to build a deep learning application. The presentation also provides pointers to various resources you can use to get started in deep learning.
The webinar is followed by a short Q&A session.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Getting Started with Deep Learning - Slides (PDF): Slides used in the presentation
Materials shared elsewhere:
A recording of the webinar is available on the Australian BioCommons YouTube Channel:
https://youtu.be/I1TmpnZUuiQ
Melissa Burke (melissa@biocommons.org.au)
Tang, Titus (orcid: 0000-0001-7496-1152)
Deep learning, Neural networks, Machine learning
WORKSHOP: Single cell RNAseq analysis in R
This record includes training materials associated with the Australian BioCommons workshop ‘Single cell RNAseq analysis in R’. This workshop took place over two, 3.5 hour sessions on 22 and 3 August 2022.
Event description
Analysis and interpretation of single cell RNAseq (scRNAseq) data...
Keywords: Bioinformatics, Analysis, Transcriptomics, R software, Single cell RNAseq, scRNAseq
WORKSHOP: Single cell RNAseq analysis in R
https://zenodo.org/records/7072910
https://dresa.org.au/materials/workshop-single-cell-rnaseq-analysis-in-r-4f60b82d-2f1e-4021-9569-6955878dd945
This record includes training materials associated with the Australian BioCommons workshop ‘Single cell RNAseq analysis in R’. This workshop took place over two, 3.5 hour sessions on 22 and 3 August 2022.
Event description
Analysis and interpretation of single cell RNAseq (scRNAseq) data requires dedicated workflows. In this hands-on workshop we will show you how to perform single cell analysis using Seurat - an R package for QC, analysis, and exploration of single-cell RNAseq data.
We will discuss the ‘why’ behind each step and cover reading in the count data, quality control, filtering, normalisation, clustering, UMAP layout and identification of cluster markers. We will also explore various ways of visualising single cell expression data.
This workshop is presented by the Australian BioCommons and Queensland Cyber Infrastructure Foundation (QCIF) with the assistance of a network of facilitators from the national Bioinformatics Training Cooperative.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
scRNAseq_Slides (PDF): Slides used to introduce topics
scRNAseq_Schedule (PDF): A breakdown of the topics and timings for the workshop
scRNAseq_Resources (PDF): A list of resources recommended by trainers and participants
scRNAseq_QandA(PDF): Archive of questions and their answers from the workshop Slack Channel.
Materials shared elsewhere:
This workshop follows the tutorial ‘scRNAseq Analysis in R with Seurat’
https://swbioinf.github.io/scRNAseqInR_Doco/index.html
This material is based on the introductory Guided Clustering Tutorial tutorial from Seurat.
It is also drawing from a similar workshop held by Monash Bioinformatics Platform Single-Cell-Workshop, with material here.
Melissa Burke (melissa@biocommons.org.au)
Williams, Sarah
Mehdi, Ahmed (orcid: 0000-0002-9300-2341)
Matigan, Nick
Barugahare, Adele (orcid: 0000-0002-8976-0094)
Harrison, Paul (orcid: 0000-0002-3980-268X)
Morgan, Steven (orcid: 0000-0001-6038-6126)
Whitfield, Holly (orcid: 0000-0002-7282-387X)
Bioinformatics, Analysis, Transcriptomics, R software, Single cell RNAseq, scRNAseq
WORKSHOP: R: fundamental skills for biologists
This record includes training materials associated with the Australian BioCommons workshop ‘R: fundamental skills for biologists’. This workshop took place over four, three-hour sessions on 1, 8, 15 and 22 June 2022.
Event description
Biologists need data analysis skills to be able to...
Keywords: Bioinformatics, Analysis, Statistics, R software, RStudio, Data visualisation
WORKSHOP: R: fundamental skills for biologists
https://zenodo.org/records/6766951
https://dresa.org.au/materials/workshop-r-fundamental-skills-for-biologists-81aa00db-63ad-4962-a7ac-b885bf9f676b
This record includes training materials associated with the Australian BioCommons workshop ‘R: fundamental skills for biologists’. This workshop took place over four, three-hour sessions on 1, 8, 15 and 22 June 2022.
Event description
Biologists need data analysis skills to be able to interpret, visualise and communicate their research results. While Excel can cover some data analysis needs, there is a better choice, particularly for large and complex datasets.
R is a free, open-source software and programming language that enables data exploration, statistical analysis, visualisation and more. The large variety of R packages available for analysing biological data make it a robust and flexible option for data of all shapes and sizes.
Getting started can be a little daunting for those without a background in statistics and programming. In this workshop we will equip you with the foundations for getting the most out of R and RStudio, an interactive way of structuring and keeping track of your work in R. Using biological data from a model of influenza infection, you will learn how to efficiently and reproducibly organise, read, wrangle, analyse, visualise and generate reports from your data in R.
Topics covered in this workshop include:
Spreadsheets, organising data and first steps with R
Manipulating and analysing data with dplyr
Data visualisation
Summarized experiments and getting started with Bioconductor
This workshop is presented by the Australian BioCommons and Saskia Freytag from WEHI with the assistance of a network of facilitators from the national Bioinformatics Training Cooperative.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Schedule (PDF): A breakdown of the topics and timings for the workshop
Recommended resources (PDF): A list of resources recommended by trainers and participants
Q_and_A(PDF): Archive of questions and their answers from the workshop Slack Channel.
Materials shared elsewhere:
This workshop follows the tutorial ‘Introduction to data analysis with R and Bioconductor’ which is publicly available.
https://saskiafreytag.github.io/biocommons-r-intro/
This is derived from material produced as part of The Carpentries Incubator project
https://carpentries-incubator.github.io/bioc-intro/
Melissa Burke (melissa@biocommons.org.au)
Freytag, Saskia (orcid: 0000-0002-2185-7068)
Barugahare, Adele (orcid: 0000-0002-8976-0094)
Doyle, Maria
Ansell, Brendan (orcid: 0000-0003-0297-897X)
Varshney, Akriti
Bourke, Caitlin (orcid: 0000-0002-4466-6563)
Conradsen, Cara (orcid: 0000-0001-9797-3412)
Jung, Chol-Hee (orcid: 0000-0002-2992-3162)
Sandoval, Claudia
Chandrananda, Dineika (orcid: 0000-0002-8834-9500)
Zhang, Eden (orcid: 0000-0003-0294-3734)
Rosello, Fernando (orcid: 0000-0003-3885-8777)
Iacono, Giulia (orcid: 0000-0002-1527-0754)
Tarasova, Ilariya (orcid: 0000-0002-0895-9385)
Chung, Jessica (orcid: 0000-0002-0627-0955)
Moffet, Joel
Gustafsson, Johan (orcid: 0000-0002-2977-5032)
Ding, Ke
Feher, Kristen
Perlaza-Jimenez, Laura (orcid: 0000-0002-8511-1134)
Crowe, Mark (orcid: 0000-0002-9514-2487)
Ma, Mengyao
Kandhari, Nitika (orcid: 0000-0002-0261-727X)
Williams, Sarah
Nelson, Tiffanie (orcid: 0000-0002-5341-312X)
Schreiber, Veronika (orcid: 0000-0001-6088-7828)
Pinzon Perez, William
Bioinformatics, Analysis, Statistics, R software, RStudio, Data visualisation
WORKSHOP: Working with genomics sequences and features in R with Bioconductor
This record includes training materials associated with the Australian BioCommons workshop ‘Working with genomics sequences and features in R with Bioconductor’. This workshop took place on 23 September 2021.
Workshop description
Explore the many useful functions that the Bioconductor...
Keywords: R software, Bioconductor, Bioinformatics, Analysis, Genomics, Sequence analysis
WORKSHOP: Working with genomics sequences and features in R with Bioconductor
https://zenodo.org/records/5781776
https://dresa.org.au/materials/workshop-working-with-genomics-sequences-and-features-in-r-with-bioconductor-8399bf0d-1e9e-48f3-a840-3f70f23254bb
This record includes training materials associated with the Australian BioCommons workshop ‘Working with genomics sequences and features in R with Bioconductor’. This workshop took place on 23 September 2021.
Workshop description
Explore the many useful functions that the Bioconductor environment offers for working with genomic data and other biological sequences.
DNA and proteins are often represented as files containing strings of nucleic acids or amino acids. They are associated with text files that provide additional contextual information such as genome annotations.
This workshop provides hands-on experience with tools, software and packages available in R via Bioconductor for manipulating, exploring and extracting information from biological sequences and annotation files. We will look at tools for working with some commonly used file formats including FASTA, GFF3, GTF, methods for identifying regions of interest, and easy methods for obtaining data packages such as genome assemblies.
This workshop is presented by the Australian BioCommons and Monash Bioinformatics Platform with the assistance of a network of facilitators from the national Bioinformatics Training Cooperative.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Schedule (PDF): schedule for the workshop providing a breakdown of topics and timings
Materials shared elsewhere:
This workshop follows the tutorial ‘Working with DNA sequences and features in R with Bioconductor - version 2’ developed for Monash Bioinformatics Platform and Monash Data Fluency by Paul Harrison.
https://monashdatafluency.github.io/r-bioc-2/
Melissa Burke (melissa@biocommons.org.au)
Harrison, Paul (orcid: 0000-0002-3980-268X)
Deshpande, Nandan (orcid: 0000-0002-0324-8728)
Barugahare, Adele (orcid: 0000-0002-8976-0094)
Perry, Andrew (orcid: 0000-0001-9256-6068)
Wong, Nick (orcid: 0000-0003-4393-7541)
Reames, Benjamin
R software, Bioconductor, Bioinformatics, Analysis, Genomics, Sequence analysis
VOSON Lab Code Blog
The VOSON Lab Code Blog is a space to share methods, tips, examples and code. Blog posts provide techniques to construct and analyse networks from various API and other online data sources, using the VOSON open-source software and other R based packages.
Keywords: visualisation, Data analysis, data collections, R software, Social network analysis, social media data, Computational Social Science, quantitative, Text Analytics
Resource type: tutorial, other
VOSON Lab Code Blog
https://vosonlab.github.io/
https://dresa.org.au/materials/voson-lab-code-blog
The VOSON Lab Code Blog is a space to share methods, tips, examples and code. Blog posts provide techniques to construct and analyse networks from various API and other online data sources, using the VOSON open-source software and other R based packages.
robert.ackland@anu.edu.au
visualisation, Data analysis, data collections, R software, Social network analysis, social media data, Computational Social Science, quantitative, Text Analytics
researcher
support
phd
masters
Deep Learning for Natural Language Processing
This workshop is designed to be instructor led and consists of two parts.
Part 1 consists of a lecture-demo about text processing and a hands-on session for attendees to learn how to clean a dataset.
Part 2 consists of a lecture introducing Recurrent Neural Networks and a hands-on session for...
Keywords: Deep learning, NLP, Machine learning
Resource type: presentation, tutorial
Deep Learning for Natural Language Processing
https://doi.org/10.26180/13100513
https://dresa.org.au/materials/deep-learning-for-natural-language-processing
This workshop is designed to be instructor led and consists of two parts.
Part 1 consists of a lecture-demo about text processing and a hands-on session for attendees to learn how to clean a dataset.
Part 2 consists of a lecture introducing Recurrent Neural Networks and a hands-on session for attendees to train their own RNN.
The Powerpoints contain the lecture slides, while the Jupyter notebooks (.ipynb) contain the hands-on coding exercises.
This workshop introduces natural language as data for deep learning. We discuss various techniques and software packages (e.g. python strings, RegEx, NLTK, Word2Vec) that help us convert, clean, and formalise text data “in the wild” for use in a deep learning model. We then explore the training and testing of a Recurrent Neural Network on the data to complete a real world task. We will be using TensorFlow v2 for this purpose.
datascienceplatform@monash.edu
Titus Tang
Deep learning, NLP, Machine learning
Getting Started with Deep Learning
This lecture provides a high level overview of how you could get started with developing deep learning applications. It introduces deep learning in a nutshell and then provides advice relating to the concepts and skill sets you would need to know and have in order to build a deep learning...
Keywords: Deep learning, Machine learning
Resource type: presentation
Getting Started with Deep Learning
https://doi.org/10.26180/15032688
https://dresa.org.au/materials/getting-started-with-deep-learning
This lecture provides a high level overview of how you could get started with developing deep learning applications. It introduces deep learning in a nutshell and then provides advice relating to the concepts and skill sets you would need to know and have in order to build a deep learning application. The lecture also provides pointers to various resources you could use to gain a stronger foothold in deep learning.
This lecture is targeted at researchers who may be complete beginners in machine learning, deep learning, or even with programming, but who would like to get into the space to build AI systems hands-on.
datascienceplatform@monash.edu
Titus Tang
Deep learning, Machine learning
Semi-Supervised Deep Learning
Modern deep neural networks require large amounts of labelled data to train. Obtaining the required labelled data is often an expensive and time consuming process. Semi-supervised deep learning involves the use of various creative techniques to train deep neural networks on partially labelled...
Keywords: Deep learning, Machine learning, semi-supervised
Resource type: presentation, tutorial
Semi-Supervised Deep Learning
https://doi.org/10.26180/14176805
https://dresa.org.au/materials/semi-supervised-deep-learning
Modern deep neural networks require large amounts of labelled data to train. Obtaining the required labelled data is often an expensive and time consuming process. Semi-supervised deep learning involves the use of various creative techniques to train deep neural networks on partially labelled data. If successful, it allows better training of a model despite the limited amount of labelled data available.
This workshop is designed to be instructor led and covers various semi-supervised learning techniques available in the literature. The workshop consists of a lecture introducing at a high level a selection of techniques that are suitable for semi-supervised deep learning. We discuss how these techniques can be implemented and the underlying assumptions they require. The lecture is followed by a hands-on session where attendees implement a semi-supervised learning technique to train a neural network. We observe and discuss the changing performance and behaviour of the network as varying degrees of labelled and unlabelled data is provided to the network during training.
datascienceplatform@monash.edu
Titus Tang
Deep learning, Machine learning, semi-supervised
Introduction to Deep Learning and TensorFlow
This workshop is intended to run as an instructor guided live event and consists of two parts. Each part consists of a lecture and a hands-on coding exercise.
Part 1 - Introduction to Deep Learning and TensorFlow
Part 2 - Introduction to Convolutional Neural Networks
The Powerpoints contain...
Keywords: Deep learning, convolutional neural network, tensorflow, Machine learning
Resource type: presentation, tutorial
Introduction to Deep Learning and TensorFlow
https://doi.org/10.26180/13100519
https://dresa.org.au/materials/introduction-to-deep-learning-and-tensorflow
This workshop is intended to run as an instructor guided live event and consists of two parts. Each part consists of a lecture and a hands-on coding exercise.
Part 1 - Introduction to Deep Learning and TensorFlow
Part 2 - Introduction to Convolutional Neural Networks
The Powerpoints contain the lecture slides, while the Jupyter notebooks (.ipynb) contain the hands-on coding exercises.
This workshop is an introduction to how deep learning works and how you could create a neural network using TensorFlow v2. We start by learning the basics of deep learning including what a neural network is, how information passes through the network, and how the network learns from data through the automated process of gradient descent. Workshop attendees would build, train and evaluate a neural network using a cloud GPU (Google Colab).
In part 2, we look at image data and how we could train a convolution neural network to classify images. Workshop attendees will extend their knowledge from the first part to design, train and evaluate this convolutional neural network.
datascienceplatform@monash.edu
Titus Tang
Deep learning, convolutional neural network, tensorflow, Machine learning