Collecting Web Data
Web scraping is a technique for extracting information from websites. This can be done manually but it is usually faster, more efficient and less error-prone if it can be automated.
Web scraping allows you to convert non-tabular or poorly structured data into a usable, structured format,...
Collecting Web Data
https://intersect.org.au/training/course/webdata201
https://dresa.org.au/materials/collecting-web-data
Web scraping is a technique for extracting information from websites. This can be done manually but it is usually faster, more efficient and less error-prone if it can be automated.
Web scraping allows you to convert non-tabular or poorly structured data into a usable, structured format, such as a .csv file or spreadsheet. But scraping is about more than just acquiring data: it can help you track changes to data online, and help you archive data. In short, it’s a skill worth learning.
So join us for this web scraping workshop to learn web scraping, using the researcher-focused training modules from the highly regarded Software Carpentry Foundation.
The concept of structured data
The use of XPath queries on HTML document
How to scrape data using browser extensions
How to scrape using Python and Scrapy
How to automate the scraping of multiple web pages
A good knowledge of the basic concepts and techniques in Python. Consider taking our \Learn to Program: Python\ and \Python for Research\ courses to come up to speed beforehand.
training@intersect.org.au
Intersect Australia
Python
Mastering text with Regular Expressions
Have you ever wanted to extract phone numbers out of a block of unstructured text? Or email addresses. Or find all the words that start with “e” and end with “ed”, no matter their length? Or search through DNA sequences for a pattern? Or extract coordinates from GPS data?
Regular...
Keywords: Regular Expressions
Mastering text with Regular Expressions
https://intersect.org.au/training/course/regex101
https://dresa.org.au/materials/mastering-text-with-regular-expressions
Have you ever wanted to extract phone numbers out of a block of unstructured text? Or email addresses. Or find all the words that start with “e” and end with “ed”, no matter their length? Or search through DNA sequences for a pattern? Or extract coordinates from GPS data?
Regular Expressions (regexes) are a powerful way to handle a multitude of different types of data. They can be used to find patterns in text and make sophisticated replacements. Think of them as find and replace on steroids. Come along to this workshop to learn what they can do and how to apply them to your research.
Comprehend and apply the syntax of regular expressions
Use the http://regexr.com tool to test a regular expression against some text
Construct simple regular expressions to find capitalised words; all numbers; all words that start with a specific set of letters, etc. in a block of text
Craft and test a progressively more complex regular expression
Find helpful resources covering regular expressions on the web
Comprehend and apply the syntax of regular expressions
Use the http://regexr.com tool to test a regular expression against some text
Construct simple regular expressions to find capitalised words; all numbers; all words that start with a specific set of letters, etc. in a block of text
Craft and test a progressively more complex regular expression
Find helpful resources covering regular expressions on the web
training@intersect.org.au
Intersect Australia
Regular Expressions
Unix Shell and Command Line Basics
The Unix environment is incredibly powerful but quite daunting to the newcomer. Command line confidence unlocks powerful computing resources beyond the desktop, including virtual machines and High Performance Computing. It enables repetitive tasks to be automated. And it comes with a swag of...
Unix Shell and Command Line Basics
https://intersect.org.au/training/course/unix101
https://dresa.org.au/materials/unix-shell-and-command-line-basics
The Unix environment is incredibly powerful but quite daunting to the newcomer. Command line confidence unlocks powerful computing resources beyond the desktop, including virtual machines and High Performance Computing. It enables repetitive tasks to be automated. And it comes with a swag of handy tools that can be combined in powerful ways. Getting started is the hardest part, but our helpful instructors are there to demystify Unix as you get to work running programs and writing scripts on the command line.
Every attendee is given a dedicated training environment for the duration of the workshop, with all software and data fully loaded and ready to run.
We teach this course within a GNU/Linux environment. This is best characterised as a Unix-like environment. We teach how to run commands within the Bash Shell. The skills you’ll learn at this course are generally transferable to other Unix environments.
Navigate and work with files and directories (folders)
Use a selection of essential tools
Combine data and tools to build a processing workflow
Automate repetitive analysis using the command line
The course has no prerequisites.
training@intersect.org.au
The Carpentries
Unix
Data Manipulation and Visualisation in R
R is quickly gaining popularity as a programming language for statisticians, data scientists and researchers. It has an excellent ecosystem including the powerful RStudio and the Shiny web application framework.
In this workshop, you will learn how to manipulate, explore and get insights...
Data Manipulation and Visualisation in R
https://intersect.org.au/training/course/r203
https://dresa.org.au/materials/data-manipulation-and-visualisation-in-r
R is quickly gaining popularity as a programming language for statisticians, data scientists and researchers. It has an excellent ecosystem including the powerful RStudio and the Shiny web application framework.
In this workshop, you will learn how to manipulate, explore and get insights from your data (Data Manipulation using the dplyr package), as well as how to convert your data from one format to another (Data Transformation using the tidyr package). You will also explore different types of graphs and learn how to customise them using one of the most popular plotting packages in R, ggplot2 (Data Visualisation).
We teach using RStudio, which allows program code, results, visualisations and documentation to be blended seamlessly.
Join us for a live coding workshop where we write programs that produce results, using the researcher-focused training modules from Intersect and the highly regarded Software Carpentry Foundation.
DataFrame Manipulation using the dplyr package
DataFrame Transformation using the tidyr package
Using the Grammar of Graphics to convert data into figures using the ggplot2 package
Configuring plot elements within ggplot2
Exploring different types of plots using ggplot2
Either \Learn to Program: R\ or \Learn to Program: R\ and \R for Research\ needed to attend this course. If you already have experience with programming, please check the topics covered in the \Learn to Program: R\ and \R for Research\ courses to ensure that you are familiar with the knowledge needed for this course.
training@intersect.org.au
Intersect Australia
R
Traversing t tests in R
R has become a popular programming language for statisticians, data scientists and researchers. It has an excellent ecosystem including the powerful RStudio and the Shiny web application framework.
The primary goal of this workshop is to familiarise you with basic statistical concepts in R...
Traversing t tests in R
https://intersect.org.au/training/course/r211
https://dresa.org.au/materials/traversing-t-tests-in-r
R has become a popular programming language for statisticians, data scientists and researchers. It has an excellent ecosystem including the powerful RStudio and the Shiny web application framework.
The primary goal of this workshop is to familiarise you with basic statistical concepts in R from reading in and manipulating data, checking assumptions, statistical tests and visualisations. This is not an advanced statistics course, but is instead designed to gently introduce you to statistical comparisons and hypothesis testing in R.
Read in and manipulate data
Check assumptions of t tests
Perform one-sample t tests
Perform two-sample t tests (Independent-samples, Paired-samples)
Perform nonparametric t tests (One-sample Wilcoxon Signed Rank test, Independent-samples Mann-Whitney U test)
This course assumes familiarity with R and RStudio. You should have a good understanding of R syntax and basic programming concepts. Please consider attending Intersect’s following courses to get up to speed: \Learn to Program: R\, \Data Manipulation and Visualisation in R\
training@intersect.org.au
Intersect Australia
R
Introduction to Machine Learning using R: Classification
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore...
Introduction to Machine Learning using R: Classification
https://intersect.org.au/training/course/r206
https://dresa.org.au/materials/introduction-to-machine-learning-using-r-classification
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore the fundamentals of Machine Learning from a practical perspective with the help of the R programming language and its scientific computing packages.
Comprehensive introduction to Machine Learning models and techniques such as Logistic Regression, Decision Trees and Ensemble Learning.
Know the differences between various core Machine Learning models.
Understand the Machine Learning modelling workflows.
Use R and its relevant packages to process real datasets, train and apply Machine Learning models.
\\Either \Learn to Program: R\ and \Data Manipulation in R\ or \Learn to Program: R\ and \Data Manipulation and Visualisation in R\needed to attend this course. If you already have experience with programming, please check the topics covered in courses above and \Introduction to ML using R: Introduction & Linear Regression\ to ensure that you are familiar with the knowledge needed for this course, such as good understanding of R syntax and basic programming concepts, familiarity with dplyr, tidyr and ggplot2 packages, and basic understanding of Machine Learning and Model Training.\\Maths knowledge is not required. There are only a few Math formula that you are going to see in this course, however references to Mathematics required for learning about Machine Learning will be provided. Having an understanding of the Mathematics behind each Machine Learning algorithms is going to make you appreciate the behaviour of the model and know its pros/cons when using them.\\
training@intersect.org.au
Intersect Australia
R
Data Visualisation in R
R is quickly gaining popularity as a programming language for statisticians, data scientists and researchers. It has an excellent ecosystem including the powerful RStudio and the Shiny web application framework.
In this workshop, you will explore different types of graphs and learn how to...
Data Visualisation in R
https://intersect.org.au/training/course/r202
https://dresa.org.au/materials/data-visualisation-in-r
R is quickly gaining popularity as a programming language for statisticians, data scientists and researchers. It has an excellent ecosystem including the powerful RStudio and the Shiny web application framework.
In this workshop, you will explore different types of graphs and learn how to customise them using one of the most popular plotting packages in R, ggplot2 (Data Visualisation).
We teach using RStudio, which allows program code, results, visualisations and documentation to be blended seamlessly.
Join us for a live coding workshop where we write programs that produce results, using the researcher-focused training modules from Intersect and the highly regarded Software Carpentry Foundation.
Using the Grammar of Graphics to convert data into figures using the ggplot2 package
Configuring plot elements within ggplot2
Exploring different types of plots using ggplot2
Either \Learn to Program: R\ or \Learn to Program: R\ and \R for Research\ needed to attend this course. If you already have experience with programming, please check the topics covered in the \Learn to Program: R\ and \R for Research\ courses to ensure that you are familiar with the knowledge needed for this course.
We also strongly recommend attending the \Data Manipulation in R\ course.
training@intersect.org.au
Intersect Australia
R
Exploring Chi-square and correlation in R
This hands-on training is designed to familiarise you with the data analysis environment of the R programming. In this session, we will traverse into the realm of inferential statistics, beginning with correlation and reliability. We will present a brief conceptual overview and the R procedures...
Exploring Chi-square and correlation in R
https://intersect.org.au/training/course/r210
https://dresa.org.au/materials/exploring-chi-square-and-correlation-in-r
This hands-on training is designed to familiarise you with the data analysis environment of the R programming. In this session, we will traverse into the realm of inferential statistics, beginning with correlation and reliability. We will present a brief conceptual overview and the R procedures for computing reliability and correlation (Pearson’s r, Spearman’s Rho and Kendall’s tau) in real world datasets.
Obtain inferential statistics and assess data normality
Manipulate data and create graphs
Perform Chi-Square tests (Goodness of Fit test and Test of Independence)
Perform correlations on continuous and categorical data (Pearson’s r, Spearman’s Rho and Kendall’s tau)
This course assumes familiarity with R and RStudio. You should have a good understanding of R syntax and basic programming concepts, as well as familiarity with data manipulation (dplyr) and visualisation (ggplot2 package).
Please consider attending Intersect’s following courses to get up to speed: \Learn to Program: R\, \Data Manipulation and Visualisation in R\
training@intersect.org.au
Intersect Australia
R
Regular Expressions on the Command Line
Would you like to use regular expressions with the classic command line utilities find, grep, sed and awk? These venerable Unix utilities allow you to search, filter and transform large amounts of text (including many common data formats) efficiently and repeatably.
find to locate files and...
Keywords: Regular Expressions
Regular Expressions on the Command Line
https://intersect.org.au/training/course/regex201
https://dresa.org.au/materials/regular-expressions-on-the-command-line
Would you like to use regular expressions with the classic command line utilities find, grep, sed and awk? These venerable Unix utilities allow you to search, filter and transform large amounts of text (including many common data formats) efficiently and repeatably.
find to locate files and directories matching regexes.
grep to filter lines in files based on pattern matches.
sed to find and replace using regular expressions and captures.
awk to work with row- and column-oriented data.
This course assumes prior knowledge of the basic syntax of regular expressions. If you’re new to regular expressions or would like a refresher, take our Mastering text with Regular Expressions course first.
This course also assumes basic familiarity with the Bash command line environment found on GNU/Linux and other Unix-like environments. Take our Unix Shell and Command Line Basics course to get up to speed quickly.
training@intersect.org.au
Intersect Australia
Regular Expressions
Learn to Program: Python
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you'd expect of a modern programming language, and also a rich set of libraries for working with data.
We teach using Jupyter notebooks, which allow program code, results,...
Keywords: Programming, Python
Learn to Program: Python
https://intersect.org.au/training/course/python101
https://dresa.org.au/materials/learn-to-program-python
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you'd expect of a modern programming language, and also a rich set of libraries for working with data.
We teach using Jupyter notebooks, which allow program code, results, visualisations and documentation to be blended seamlessly. Perfect for sharing insights with others while producing reproducible research.
Join us for this live coding workshop where we write programs that produce results, using the researcher-focused training modules from the highly regarded Software Carpentry Foundation.
#### You'll learn:
- Introduction to the JupyterLab interface for programming
- Basic syntax and data types in Python
- How to load external data into Python
- Creating functions (FUNCTIONS)
- Repeating actions and analysing multiple data sets (LOOPS)
- Making choices (IF STATEMENTS - CONDITIONALS)
- Ways to visualise data in Python
#### Prerequisites:
No prior experience with programming is needed to attend this course.
We strongly recommend attending the Start Coding without Hesitation: Programming Languages Showdown and Thinking like a computer: The Fundamentals of Programming webinars. Recordings of previously delivered webinars can be found [here](https://intersect.org.au/training/webinars/).
**For more information, please click [here](https://intersect.org.au/training/course/python101).**
training@intersect.org.au
Programming, Python
Data Manipulation in R
R is quickly gaining popularity as a programming language for statisticians, data scientists and researchers. It has an excellent ecosystem including the powerful RStudio and the Shiny web application framework.
In this workshop, you will learn how to manipulate, explore and get insights...
Data Manipulation in R
https://intersect.org.au/training/course/r201
https://dresa.org.au/materials/data-manipulation-in-r
R is quickly gaining popularity as a programming language for statisticians, data scientists and researchers. It has an excellent ecosystem including the powerful RStudio and the Shiny web application framework.
In this workshop, you will learn how to manipulate, explore and get insights from your data (Data Manipulation using the dplyr package), as well as how to convert your data from one format to another (Data Transformation using the tidyr package).
We teach using RStudio, which allows program code, results, visualisations and documentation to be blended seamlessly.
Join us for a live coding workshop where we write programs that produce results, using the researcher-focused training modules from Intersect and the highly regarded Software Carpentry Foundation.
DataFrame Manipulation using the dplyr package
DataFrame Transformation using the tidyr package
Either \Learn to Program: R\ or \Learn to Program: R\ and \R for Research\ needed to attend this course. If you already have experience with programming, please check the topics covered in the \Learn to Program: R\ and \R for Research\ courses to ensure that you are familiar with the knowledge needed for this course.
training@intersect.org.au
The Carpentries
R
Excel for Researchers
Data rarely comes in the form you require. Often it is messy. Sometimes it is incomplete. And sometimes there’s too much of it. Frequently, it has errors. We’ll use one of the most widespread data wrangling tools, Microsoft Excel, to import, sort, filter, copy, protect, transform, summarise,...
Excel for Researchers
https://intersect.org.au/training/course/excel101
https://dresa.org.au/materials/excel-for-researchers
Data rarely comes in the form you require. Often it is messy. Sometimes it is incomplete. And sometimes there’s too much of it. Frequently, it has errors. We’ll use one of the most widespread data wrangling tools, Microsoft Excel, to import, sort, filter, copy, protect, transform, summarise, merge, and visualise research data.
While aimed at novice Excel users, most attendees will walk away with new tricks to work more efficiently with their research data.
‘Clean up’ messy research data
Organise, format and name your data
Interpret your data (SORTING, FILTERING, CONDITIONAL FORMATTING)
Perform calculations on your data using functions (MAX, MIN, AVERAGE)
Extract significant findings from your data (PIVOT TABLE, VLOOKUP)
Manipulate your data (convert data format, work with DATES and TIMES)
Create graphs and charts to visualise your data (CHARTS)
Handy tips to speed up your work
In order to participate, attendees must have a licensed copy of Microsoft Excel installed on their computer. Speak to your local university IT or Research Office for assistance in obtaining a license and installing the software.
training@intersect.org.au
Intersect Australia
Excel
From PC to Cloud or High Performance Computing
Most of you would have heard of Cloud and High Performance Computing (HPC), or you may already be using it. HPC is not the same as cloud computing. Both technologies differ in a number of ways, and have some similarities as well.
We may refer to both types as “large scale computing” – but...
From PC to Cloud or High Performance Computing
https://intersect.org.au/training/course/compute001
https://dresa.org.au/materials/from-pc-to-cloud-or-high-performance-computing
Most of you would have heard of Cloud and High Performance Computing (HPC), or you may already be using it. HPC is not the same as cloud computing. Both technologies differ in a number of ways, and have some similarities as well.
We may refer to both types as “large scale computing” – but what is the difference? Both systems target scalability of computing, but in different ways.
This webinar will give a good overview to the researchers thinking to make a move from their local computer to Cloud of High Performance Computing Cluster.
Introduction
HPC vs Cloud computing
When to use HPC
When to use the Cloud
The Cloud – Pros and Cons
HPC – Pros and Cons
The webinar has no prerequisites.
training@intersect.org.au
Intersect Australia
HPC
Learn to Program: R
R is quickly gaining popularity as a programming language of choice for statisticians, data scientists and researchers. It has an excellent ecosystem including the powerful RStudio development environment and the Shiny web application framework.
But getting started with R can be...
Learn to Program: R
https://intersect.org.au/training/course/r101
https://dresa.org.au/materials/learn-to-program-r
R is quickly gaining popularity as a programming language of choice for statisticians, data scientists and researchers. It has an excellent ecosystem including the powerful RStudio development environment and the Shiny web application framework.
But getting started with R can be challenging, particularly if you’ve never programmed before. That’s where this introductory course comes in.
We teach using RStudio, which allows program code, results, visualisations and documentation to be blended seamlessly.
Join us for a live coding workshop where we write programs that produce results, using the researcher-focused training modules from the highly regarded Software Carpentry Foundation.
Introduction to the RStudio interface for programming
Basic syntax and data types in R
How to load external data into R
Creating functions (FUNCTIONS)
Repeating actions and analysing multiple data sets (LOOPS)
Making choices (IF STATEMENTS – CONDITIONALS)
Ways to visualise data in R
No prior experience with programming needed to attend this course.
We strongly recommend attending the Start Coding without Hesitation: Programming Languages Showdown and Thinking like a computer: The Fundamentals of Programming webinars. Recordings of previously delivered webinars can be found \here\.
training@intersect.org.au
The Carpentries
R
Data Manipulation and Visualisation in Python
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you’d expect of a modern programming language, and also a rich set of libraries for working with data.
In this workshop, you will explore DataFrames in depth (using...
Data Manipulation and Visualisation in Python
https://intersect.org.au/training/course/python203
https://dresa.org.au/materials/data-manipulation-and-visualisation-in-python
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you’d expect of a modern programming language, and also a rich set of libraries for working with data.
In this workshop, you will explore DataFrames in depth (using the pandas library), learn how to manipulate, explore and get insights from your data (Data Manipulation), as well as how to deal with missing values and how to combine multiple datasets. You will also explore different types of graphs and learn how to customise them using two of the most popular plotting libraries in Python, matplotlib and seaborn (Data Visualisation).
We teach using Jupyter notebooks, which allow program code, results, visualisations and documentation to be blended seamlessly. Perfect for sharing insights with others while producing reproducible research.
Join us for this live coding workshop where we write programs that produce results, using the researcher-focused training modules from the highly regarded Software Carpentry Foundation.
Working with pandas DataFrames
Indexing, slicing and subsetting in pandas DataFrames
Missing data values
Combine multiple pandas DataFrames
Using the Grammar of Graphics to convert data into figures using the seaborn and matplotlib libraries
Configuring plot elements within seaborn and matplotlib
Exploring different types of plots using seaborn
Either \Learn to Program: Python\ or \Learn to Program: Python\ and \Python for Research\ needed to attend this course. If you already have experience with programming, please check the topics covered in the \Learn to Program: Python\ and \Python for Research\ courses to ensure that you are familiar with the knowledge needed for this course.
training@intersect.org.au
Intersect Australia
Python
Introduction to Machine Learning using Python: Classification
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore...
Introduction to Machine Learning using Python: Classification
https://intersect.org.au/training/course/python206
https://dresa.org.au/materials/introduction-to-machine-learning-using-python-classification
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore the fundamentals of Machine Learning from a practical perspective with the help of the Python programming language and its scientific computing libraries.
Comprehensive introduction to Machine Learning models and techniques such as Logistic Regression, Decision Trees and Ensemble Learning.
Know the differences between various core Machine Learning models.
Understand the Machine Learning modelling workflows.
Use Python and scikit-learn to process real datasets, train and apply Machine Learning models.
Either \Learn to Program: Python\, \Data Manipulation in Python\ and \Introduction to ML using Python: Introduction & Linear Regression\ or \Learn to Program: Python\, \Data Manipulation and Visualisation in Python\ and \Introduction to ML using Python: Introduction & Linear Regression\ needed to attend this course.
If you already have experience with programming, please check the topics covered in courses above to ensure that you are familiar with the knowledge needed for this course, such as good understanding of Python syntax, basic programming concepts and familiarity with Pandas, Numpy and Seaborn libraries, and basic understanding of Machine Learning and Model Training.
Maths knowledge is not required. However, there is a few Math formula covered in this course and the references will be provided. Having an understanding of the Mathematics behind each Machine Learning algorithms is going to make you appreciate the behaviour of the model and know its pros/cons when using them.
training@intersect.org.au
Intersect Australia
Python
Getting started with HPC using PBS Pro
Is your computer’s limited power throttling your research ambitions? Are your analysis scripts pushing your laptop’s processor to its limits? Is your software crashing because you’ve run out of memory? Would you like to unleash to power of the Unix command line to automate and run your analysis...
Getting started with HPC using PBS Pro
https://intersect.org.au/training/course/hpc201
https://dresa.org.au/materials/getting-started-with-hpc-using-pbs-pro
Is your computer’s limited power throttling your research ambitions? Are your analysis scripts pushing your laptop’s processor to its limits? Is your software crashing because you’ve run out of memory? Would you like to unleash to power of the Unix command line to automate and run your analysis on supercomputers that you can access for free?
High-Performance Computing (HPC) allows you to accomplish your analysis faster by using many parallel CPUs and huge amounts of memory simultaneously. This course provides a hands on introduction to running software on HPC infrastructure using PBS Pro.
Connect to an HPC cluster
Use the Unix command line to operate a remote computer and create job scripts
Submit and manage jobs on a cluster using a scheduler
Transfer files to and from a remote computer
Use software through environment modules
Use parallelisation to speed up data analysis
Access the facilities available to you as a researcher
This is the PBS Pro version of the Getting Started with HPC course.
This course assumes basic familiarity with the Bash command line environment found on GNU/Linux and other Unix-like environments. To come up to speed, consider taking our \Unix Shell and Command Line Basics\ course.
training@intersect.org.au
Intersect Australia
HPC
Introduction to Machine Learning using Python: SVM & Unsupervised Learning
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore...
Introduction to Machine Learning using Python: SVM & Unsupervised Learning
https://intersect.org.au/training/course/python207
https://dresa.org.au/materials/introduction-to-machine-learning-using-python-svm-unsupervised-learning
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore the fundamentals of Machine Learning from a practical perspective with the help of the Python programming language and its scientific computing libraries.
Comprehensive introduction to Machine Learning models and techniques such as Support Vector Machine, K-Nearest Neighbor and Dimensionality Reduction.
Know the differences between various core Machine Learning models.
Understand the Machine Learning modelling workflows.
Use Python and scikit-learn to process real datasets, train and apply Machine Learning models.
Either \Learn to Program: Python\, \Data Manipulation in Python\ and \Introduction to ML using Python: Introduction & Linear Regression\ or \Learn to Program: Python\, \Data Manipulation and Visualisation in Python\ and \Introduction to ML using Python: Introduction & Linear Regression\ needed to attend this course.
If you already have experience with programming, please check the topics covered in courses above to ensure that you are familiar with the knowledge needed for this course, such as good understanding of Python syntax, basic programming concepts and familiarity with Pandas, Numpy and Seaborn libraries, and basic understanding of Machine Learning and Model Training.
Maths knowledge is not required. However, there is a few Math formula covered in this course and the references will be provided. Having an understanding of the Mathematics behind each Machine Learning algorithms is going to make you appreciate the behaviour of the model and know its pros/cons when using them.
training@intersect.org.au
Intersect Australia
Python
Getting started with HPC using Slurm
Is your computer’s limited power throttling your research ambitions? Are your analysis scripts pushing your laptop’s processor to its limits? Is your software crashing because you’ve run out of memory? Would you like to unleash to power of the Unix command line to automate and run your analysis...
Getting started with HPC using Slurm
https://intersect.org.au/training/course/hpc202
https://dresa.org.au/materials/getting-started-with-hpc-using-slurm
Is your computer’s limited power throttling your research ambitions? Are your analysis scripts pushing your laptop’s processor to its limits? Is your software crashing because you’ve run out of memory? Would you like to unleash to power of the Unix command line to automate and run your analysis on supercomputers that you can access for free?
High-Performance Computing (HPC) allows you to accomplish your analysis faster by using many parallel CPUs and huge amounts of memory simultaneously. This course provides a hands on introduction to running software on HPC infrastructure using Slurm.
Connect to an HPC cluster
Use the Unix command line to operate a remote computer and create job scripts
Submit and manage jobs on a cluster using a scheduler
Transfer files to and from a remote computer
Use software through environment modules
Use parallelisation to speed up data analysis
Access the facilities available to you as a researcher
This is the Slurm version of the Getting Started with HPC course.
This course assumes basic familiarity with the Bash command line environment found on GNU/Linux and other Unix-like environments. To come up to speed, consider taking our \Unix Shell and Command Line Basics\ course.
training@intersect.org.au
Intersect Australia
HPC
Data Visualisation in Python
Course Materials
Using the Grammar of Graphics to convert data into figures using the seaborn and matplotlib libraries
Configuring plot elements within seaborn and matplotlib
Exploring different types of...
Data Visualisation in Python
https://intersect.org.au/training/course/python202
https://dresa.org.au/materials/data-visualisation-in-python
[Course Materials](https://intersectaustralia.github.io/training/PYTHON203/sources/Data-Adv_Python.zip)
Using the Grammar of Graphics to convert data into figures using the seaborn and matplotlib libraries
Configuring plot elements within seaborn and matplotlib
Exploring different types of plots using seaborn
Either \Learn to Program: Python\ or \Learn to Program: Python\ and \Python for Research\ needed to attend this course. If you already have experience with programming, please check the topics covered in the \Learn to Program: Python\ and \Python for Research\ courses to ensure that you are familiar with the knowledge needed for this course.
We also strongly recommend attending the \Data Manipulation in Python\.
training@intersect.org.au
Intersect Australia
Python
Python for Research
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you’d expect of a modern programming language, and also a rich set of libraries for working with data.
This workshop is an introduction to data structures (DataFrames...
Python for Research
https://intersect.org.au/training/course/python110
https://dresa.org.au/materials/python-for-research
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you’d expect of a modern programming language, and also a rich set of libraries for working with data.
This workshop is an introduction to data structures (DataFrames using the pandas library) and visualisation (using the matplotlib library) in Python. The targeted audience for this workshop is researchers who are already familiar with the basic concepts in programming such as loops, functions, and conditionals.
We teach using Jupyter notebooks, which allow program code, results, visualisations and documentation to be blended seamlessly. Perfect for sharing insights with others while producing reproducible research.
Join us for this live coding workshop where we write programs that produce results, using the researcher-focused training modules from the highly regarded Software Carpentry Foundation.
Introduction to Libraries and Built-in Functions in Python
Introduction to DataFrames using the pandas library
Reading and writing data in DataFrames
Selecting values in DataFrames
Quick introduction to Plotting using the matplotlib library
\Learn to Program: Python\ or any of the \Learn to Program: R\, \Learn to Program: MATLAB\ or \Learn to Program: Julia\, needed to attend this course. If you already have some experience with programming, please check the topics covered in the \Learn to Program: Python\ course to ensure that you are familiar with the knowledge needed for this course.
training@intersect.org.au
The Carpentries
Python
Data Manipulation in Python
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you’d expect of a modern programming language, and also a rich set of libraries for working with data.
In this workshop, you will explore DataFrames in depth (using...
Data Manipulation in Python
https://intersect.org.au/training/course/python201
https://dresa.org.au/materials/data-manipulation-in-python
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you’d expect of a modern programming language, and also a rich set of libraries for working with data.
In this workshop, you will explore DataFrames in depth (using the pandas library), learn how to manipulate, explore and get insights from your data (Data Manipulation), as well as how to deal with missing values and how to combine multiple datasets.
We teach using Jupyter notebooks, which allow program code, results, visualisations and documentation to be blended seamlessly. Perfect for sharing insights with others while producing reproducible research.
Join us for this live coding workshop where we write programs that produce results, using the researcher-focused training modules from the highly regarded Software Carpentry Foundation.
Working with pandas DataFrames
Indexing, slicing and subsetting in pandas DataFrames
Missing data values
Combine multiple pandas DataFrames
Either \Learn to Program: Python\ or \Learn to Program: Python\ and \Python for Research\ needed to attend this course. If you already have experience with programming, please check the topics covered in the \Learn to Program: Python\ and \Python for Research\ courses to ensure that you are familiar with the knowledge needed for this course.
training@intersect.org.au
The Carpentries
Python
Introduction to Machine Learning using Python: Introduction & Linear Regression
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore...
Introduction to Machine Learning using Python: Introduction & Linear Regression
https://intersect.org.au/training/course/python205
https://dresa.org.au/materials/introduction-to-machine-learning-using-python-introduction-linear-regression
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore the fundamentals of Machine Learning from a practical perspective with the help of the Python programming language and its scientific computing libraries.
Understand the difference between supervised and unsupervised Machine Learning.
Understand the fundamentals of Machine Learning.
Comprehensive introduction to Machine Learning models and techniques such as Linear Regression and Model Training.
Understand the Machine Learning modelling workflows.
Use Python and scikit-learn to process real datasets, train and apply Machine Learning models
Either \Learn to Program: Python\ and \Data Manipulation in Python\ or \Learn to Program: Python\ and \Data Manipulation and Visualisation in Python\ needed to attend this course.
If you already have experience with programming, please check the topics covered in courses above to ensure that you are familiar with the knowledge needed for this course, such as good understanding of Python syntax and basic programming concepts and familiarity with Pandas, Numpy and Seaborn libraries.
Maths knowledge is not required. However, there is a few Math formula covered in this course and the references will be provided. Having an understanding of the Mathematics behind each Machine Learning algorithms is going to make you appreciate the behaviour of the model and know its pros/cons when using them.
training@intersect.org.au
Intersect Australia
Python
Version Control with Git
Have you mistakenly overwritten programs or data and want to learn techniques to avoid repeating the loss? Version control systems are one of the most powerful tools available for avoiding data loss and enabling reproducible research. While the learning curve can be steep, our trainers are there...
Version Control with Git
https://intersect.org.au/training/course/git101
https://dresa.org.au/materials/version-control-with-git
Have you mistakenly overwritten programs or data and want to learn techniques to avoid repeating the loss? Version control systems are one of the most powerful tools available for avoiding data loss and enabling reproducible research. While the learning curve can be steep, our trainers are there to answer all your questions while you gain hands on experience in using Git, one of the most popular version control systems available.
Join us for this workshop where we cover the fundamentals of version control using the researcher-focused training modules from the highly regarded Software Carpentry Foundation.
keep versions of data, scripts, and other files
examine commit logs to find which files were changed when
restore earlier versions of files
compare changes between versions of a file
push your versioned files to a remote location, for backup and to facilitate collaboration
The course has no prerequisites.
training@intersect.org.au
The Carpentries
Git
Thinking like a computer: The Fundamentals of Programming
Human brains are extremely good at evaluating a small amount of information simultaneously, ignoring anomalies and coming up with an answer to a problem without much in the way of conscious thought. Computers on the other hand are extremely good at performing individual calculations, one at a...
Thinking like a computer: The Fundamentals of Programming
https://intersect.org.au/training/course/coding003
https://dresa.org.au/materials/thinking-like-a-computer-the-fundamentals-of-programming
Human brains are extremely good at evaluating a small amount of information simultaneously, ignoring anomalies and coming up with an answer to a problem without much in the way of conscious thought. Computers on the other hand are extremely good at performing individual calculations, one at a time, and can keep the results in a large bank of short-term memory for quick recall. These two approaches are fundamentally different.
Humans can only reasonably retain seven plus or minus two pieces of information in short-term memory, and new items push older items out, whereas a computer is hopeless when given multiple pieces of information simultaneously.
Understanding this fact is key to being able to write instructions for computers – also known as programs – in a way that takes advantage of their strengths, and overcomes their drawbacks.
Suitable for the programming novice, this webinar is good preparation for researchers wanting to learn how to program.
How a human solves tasks
How a computer solves tasks
Overview of programming concepts:
Variables
Loops
Conditionals
Functions
Data types
The webinar has no prerequisites.
training@intersect.org.au
Intersect Australia
Python
Parallel Programming for HPC
You have written, compiled and run functioning programs in C and/or Fortran. You know how HPC works and you’ve submitted batch jobs.
Now you want to move from writing single-threaded programs into the parallel programming paradigm, so you can truly harness the full power of High Performance...
Parallel Programming for HPC
https://intersect.org.au/training/course/hpc301
https://dresa.org.au/materials/parallel-programming-for-hpc
You have written, compiled and run functioning programs in C and/or Fortran. You know how HPC works and you’ve submitted batch jobs.
Now you want to move from writing single-threaded programs into the parallel programming paradigm, so you can truly harness the full power of High Performance Computing.
OpenMP (Open Multi-Processing): a widespread method for shared memory programming
MPI (Message Passing Interface): a leading distributed memory programming model
To do this course you need to have:
A good working knowledge of HPC. Consider taking our
Getting Started with HPC using PBS Pro course to come up to speed beforehand.
Prior experience of writing programs in either C or Fortran.
training@intersect.org.au
Intersect Australia
HPC
Start Coding without Hesitation: Programming Languages Showdown
Programming is becoming more and more popular, with many researchers using programming to perform data cleaning, data manipulation, data analytics, as well as creating publication quality plots. Programming can be really beneficial for automating processes and workflows. In this webinar, we are...
Keywords: Python, R, Matlab, Julia
Start Coding without Hesitation: Programming Languages Showdown
https://intersect.org.au/training/course/coding001
https://dresa.org.au/materials/start-coding-without-hesitation-programming-languages-showdown
Programming is becoming more and more popular, with many researchers using programming to perform data cleaning, data manipulation, data analytics, as well as creating publication quality plots. Programming can be really beneficial for automating processes and workflows. In this webinar, we are exploring four of the most popular programming languages that are widely used in academia, namely Python, R, MATLAB, and Julia.
Why use Programming
An overview of Python, R, MATLAB, and Julia
Code comparison of the four programming languages
Popularity and job opportunities
Intersect’s comparison
General guidelines on how to choose the best programming language for your research
The webinar has no prerequisites.
training@intersect.org.au
Intersect Australia
Python, R, Matlab, Julia
R for Social Scientists
R is quickly gaining popularity as a programming language of choice for researchers. It has an excellent ecosystem including the powerful RStudio development environment.
But getting started with R can be challenging, particularly if you’ve never programmed before. That’s where this...
R for Social Scientists
https://intersect.org.au/training/course/r103
https://dresa.org.au/materials/r-for-social-scientists
R is quickly gaining popularity as a programming language of choice for researchers. It has an excellent ecosystem including the powerful RStudio development environment.
But getting started with R can be challenging, particularly if you’ve never programmed before. That’s where this introductory course comes in.
Join us for a live coding workshop where we write programs that produce results, using the researcher-focused training modules from the highly regarded Data Carpentry.
Basic syntax and data types in R
RStudio interface
How to import CSV files into R
The structure of data frames
A brief introduction to data wrangling and data transformation
How to calculate summary statistics
A brief introduction to visualise data
No prior experience with programming needed to attend this course.
training@intersect.org.au
The Carpentries
R
Getting Started with Excel
We rarely receive the research data in an appropriate form. Often data is messy. Sometimes it is incomplete. And sometimes there’s too much of it. Frequently, it has errors.
This webinar targets beginners and presents a quick demonstration of using the most widespread data wrangling tool,...
Getting Started with Excel
https://intersect.org.au/training/course/excel001
https://dresa.org.au/materials/getting-started-with-excel
We rarely receive the research data in an appropriate form. Often data is messy. Sometimes it is incomplete. And sometimes there’s too much of it. Frequently, it has errors.
This webinar targets beginners and presents a quick demonstration of using the most widespread data wrangling tool, Microsoft Excel, to sort, filter, copy, protect, transform, aggregate, summarise, and visualise research data.
Introduction to Microsoft Excel user interface
Interpret data using sorting, filtering, and conditional formatting
Summarise data using functions
Analyse data using pivot tables
Manipulate and visualise data
Handy tips to speed up your work
The webinar has no prerequisites.
training@intersect.org.au
Intersect Australia
Excel
Survey Tools in Research: REDCap and Qualtrics
Now more than ever researchers are needing to embrace electronic data capture methods to keep their research moving in the midst of social distancing restrictions and decreased access to survey participants. Using a research specific survey tool can not only solve this problem, but also set your...
Keywords: REDCap, Qualtrics
Survey Tools in Research: REDCap and Qualtrics
https://intersect.org.au/training/course/surveys001
https://dresa.org.au/materials/survey-tools-in-research-redcap-and-qualtrics
Now more than ever researchers are needing to embrace electronic data capture methods to keep their research moving in the midst of social distancing restrictions and decreased access to survey participants. Using a research specific survey tool can not only solve this problem, but also set your research up for success through intuitive data collection and validation, scheduling and reporting.
This webinar will introduce and compare two of the most popular research tools for the collection of survey data and patient records: REDCap and Qualtrics.
Electronic Data Capture: Surveys vs Forms
Confidential vs Anonymous data collection
Strengths and weaknesses of Qualtrics and REDCap
Real-life use cases for each tool
Using survey tools for longitudinal studies
The webinar has no prerequisites.
training@intersect.org.au
Intersect Australia
REDCap, Qualtrics