7 Steps towards Reproducible Research
This workshop aims to take you further down your reproducibility path, by providing concepts and tools you can use in your everyday workflows. It is discipline and experience agnostic, and no coding experience is needed.
We will also examine how Reproducible Research builds business continuity...
Keywords: reproducibility, Reproducibility, reproducible workflows
Resource type: full-course, tutorial
7 Steps towards Reproducible Research
https://amandamiotto.github.io/ReproducibleResearch/
https://dresa.org.au/materials/7-steps-towards-reproducible-research
This workshop aims to take you further down your reproducibility path, by providing concepts and tools you can use in your everyday workflows. It is discipline and experience agnostic, and no coding experience is needed.
We will also examine how Reproducible Research builds business continuity into your research group, how the culture in your institute ecosystem can affect Reproducibility and how you can identify and address risks to your knowledge.
The workshop can be used as self-paced or as an instructor
Amanda Miotto - a.miotto@griffith.edu.au
Amanda Miotto
reproducibility, Reproducibility, reproducible workflows
phd
support
ALA Labs
ALA Labs provides resources and articles from the Atlas of Living Australia's Science and Decision Support team. On the website, you can find:
- Posts: Code, articles, analyses and visualisations that will hopefully help you in your own work
- Research: Highlighted summaries of scientific...
Keywords: Ecology, R, Python, Rstats, Biodiversity data, Open science, Reproducibility, Coding, Data cleaning, Data visualisation, Species Distribution Modelling, Beginner R coding
ALA Labs
https://labs.ala.org.au/
https://dresa.org.au/materials/ala-labs
ALA Labs provides resources and articles from the Atlas of Living Australia's Science and Decision Support team. On the website, you can find:
- Posts: Code, articles, analyses and visualisations that will hopefully help you in your own work
- Research: Highlighted summaries of scientific research that has used data from the Atlas of Living Australia
- Software: R & Python packages that the Science & Decision Support team manage
- Books: Long-form resources with best-practice data wrangling and visualisation
- Gallery: Showcasing external work that uses tools from ALA Labs
Atlas of Living Australia support@ala.org.au
Ecology, R, Python, Rstats, Biodiversity data, Open science, Reproducibility, Coding, Data cleaning, Data visualisation, Species Distribution Modelling, Beginner R coding
How can software containers help your research?
This video explains software containers to a research audience. It is an introduction to why containers are beneficial for research. These benefits are standardisation, portability, reliability and reproducibility.
Software Containers in research are a solution that addresses the challenge of a...
Keywords: containers, software, research, reproducibility, RSE, standard, agility, portable, reusable, code, application, reproducible, standardisation, package, system, cloud, server, version, reliability, program, collaborator, ARDC_AU, training material
How can software containers help your research?
https://zenodo.org/records/5091260
https://dresa.org.au/materials/how-can-software-containers-help-your-research-ca0f9d41-d83b-463b-a548-402c6c642fbf
This video explains software containers to a research audience. It is an introduction to why containers are beneficial for research. These benefits are standardisation, portability, reliability and reproducibility.
Software Containers in research are a solution that addresses the challenge of a replicable computational environment and supports reproducibility of research results. Understanding the concept of software containers enables researchers to better communicate their research needs with their colleagues and other researchers using and developing containers.
Watch the video here: https://www.youtube.com/watch?v=HelrQnm3v4g
If you want to share this video please use this:
Australian Research Data Commons, 2021. How can software containers help your research?. [video] Available at: https://www.youtube.com/watch?v=HelrQnm3v4g DOI: http://doi.org/10.5281/zenodo.5091260 [Accessed dd Month YYYY].
contact@ardc.edu.au
Australian Research Data Commons
Martinez, Paula Andrea (type: ProjectLeader)
Sam Muirhead (type: Producer)
The ARDC Communications Team (type: Editor)
The ARDC Skills and Workforce Development Team (type: ProjectMember)
The ARDC eResearch Infrastructure & Services (type: ProjectMember)
The ARDC Nectar Cloud Services team (type: ProjectMember)
containers, software, research, reproducibility, RSE, standard, agility, portable, reusable, code, application, reproducible, standardisation, package, system, cloud, server, version, reliability, program, collaborator, ARDC_AU, training material
CheckEM User Guide
CheckEM is an open-source web based application which provides quality control assessments on metadata and image annotations of fish stereo-imagery. It is available at marine-ecology.shinyapps.io/CheckEM. The application can assess a range of sampling methods and annotation data formats for...
Keywords: stereo-video, fish, annotation
CheckEM User Guide
https://globalarchivemanual.github.io/CheckEM/articles/manuals/CheckEM_user_guide.html
https://dresa.org.au/materials/checkem-user-guide
CheckEM is an open-source web based application which provides quality control assessments on metadata and image annotations of fish stereo-imagery. It is available at marine-ecology.shinyapps.io/CheckEM. The application can assess a range of sampling methods and annotation data formats for common inaccuracies made whilst annotating stereo imagery. CheckEM creates interactive plots and tables in a graphical interface, and provides summarised data and a report of potential errors to download.
brooke.gibbons@uwa.edu.au
Brooke Gibbons
stereo-video, fish, annotation
EventMeasure Annotation Guide
EventMeasure annotation guide for baited remote underwater stereo video systems (stereo-BRUVs) for count and length
Keywords: fish, stereo-video, annotation
EventMeasure Annotation Guide
https://globalarchivemanual.github.io/CheckEM/articles/manuals/EventMeasure_annotation_guide.html
https://dresa.org.au/materials/eventmeasure-annotation-guide
EventMeasure annotation guide for baited remote underwater stereo video systems (stereo-BRUVs) for count and length
tim.langlois@uwa.edu.au
Brooke Gibbons
Tim Langlois
Claude Spencer
fish, stereo-video, annotation
Stereo-video workflows for fish and benthic ecologists
Stereo imagery is widely used by research institutions and management bodies around the world as a cost-effective and non-destructive method to research and monitor fish and habitats (Whitmarsh, Fairweather and Huveneers, 2017). Stereo-video can provide accurate and precise size and range...
Keywords: stereo-video, fish, sharks, habitats
Resource type: tutorial
Stereo-video workflows for fish and benthic ecologists
https://globalarchivemanual.github.io/CheckEM/index.html
https://dresa.org.au/materials/stereo-video-workflows-for-fish-and-benthic-ecologists
Stereo imagery is widely used by research institutions and management bodies around the world as a cost-effective and non-destructive method to research and monitor fish and habitats (Whitmarsh, Fairweather and Huveneers, 2017). Stereo-video can provide accurate and precise size and range measurements and can be used to study spatial and temporal patterns in fish assemblages (McLean et al., 2016), habitat composition and complexity (Collins et al., 2017), behaviour (Goetze et al., 2017), responses to anthropogenic pressures (Bosch et al., 2022) and the recovery and growth of benthic fauna (Langlois et al. 2020). It is important that users of stereo-video collect, annotate, quality control and store their data in a consistent manner, to ensure data produced is of the highest quality possible and to enable large scale collaborations. Here we collate existing best practices and propose new tools to equip ecologists to ensure that all aspects of the stereo-video workflow are performed in a consistent way.
tim.langlois@uwa.edu.au
Tim Langlois
Brooke Gibbons
Claude Spencer
stereo-video, fish, sharks, habitats
Collecting Web Data
Web scraping is a technique for extracting information from websites. This can be done manually but it is usually faster, more efficient and less error-prone if it can be automated.
Web scraping allows you to convert non-tabular or poorly structured data into a usable, structured format,...
Collecting Web Data
https://intersect.org.au/training/course/webdata201
https://dresa.org.au/materials/collecting-web-data
Web scraping is a technique for extracting information from websites. This can be done manually but it is usually faster, more efficient and less error-prone if it can be automated.
Web scraping allows you to convert non-tabular or poorly structured data into a usable, structured format, such as a .csv file or spreadsheet. But scraping is about more than just acquiring data: it can help you track changes to data online, and help you archive data. In short, it’s a skill worth learning.
So join us for this web scraping workshop to learn web scraping, using the researcher-focused training modules from the highly regarded Software Carpentry Foundation.
The concept of structured data
The use of XPath queries on HTML document
How to scrape data using browser extensions
How to scrape using Python and Scrapy
How to automate the scraping of multiple web pages
A good knowledge of the basic concepts and techniques in Python. Consider taking our \Learn to Program: Python\ and \Python for Research\ courses to come up to speed beforehand.
training@intersect.org.au
Intersect Australia
Python
Data Manipulation and Visualisation in Python
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you’d expect of a modern programming language, and also a rich set of libraries for working with data.
In this workshop, you will explore DataFrames in depth (using...
Data Manipulation and Visualisation in Python
https://intersect.org.au/training/course/python203
https://dresa.org.au/materials/data-manipulation-and-visualisation-in-python
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you’d expect of a modern programming language, and also a rich set of libraries for working with data.
In this workshop, you will explore DataFrames in depth (using the pandas library), learn how to manipulate, explore and get insights from your data (Data Manipulation), as well as how to deal with missing values and how to combine multiple datasets. You will also explore different types of graphs and learn how to customise them using two of the most popular plotting libraries in Python, matplotlib and seaborn (Data Visualisation).
We teach using Jupyter notebooks, which allow program code, results, visualisations and documentation to be blended seamlessly. Perfect for sharing insights with others while producing reproducible research.
Join us for this live coding workshop where we write programs that produce results, using the researcher-focused training modules from the highly regarded Software Carpentry Foundation.
Working with pandas DataFrames
Indexing, slicing and subsetting in pandas DataFrames
Missing data values
Combine multiple pandas DataFrames
Using the Grammar of Graphics to convert data into figures using the seaborn and matplotlib libraries
Configuring plot elements within seaborn and matplotlib
Exploring different types of plots using seaborn
Either \Learn to Program: Python\ or \Learn to Program: Python\ and \Python for Research\ needed to attend this course. If you already have experience with programming, please check the topics covered in the \Learn to Program: Python\ and \Python for Research\ courses to ensure that you are familiar with the knowledge needed for this course.
training@intersect.org.au
Intersect Australia
Python
Introduction to Machine Learning using Python: Classification
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore...
Introduction to Machine Learning using Python: Classification
https://intersect.org.au/training/course/python206
https://dresa.org.au/materials/introduction-to-machine-learning-using-python-classification
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore the fundamentals of Machine Learning from a practical perspective with the help of the Python programming language and its scientific computing libraries.
Comprehensive introduction to Machine Learning models and techniques such as Logistic Regression, Decision Trees and Ensemble Learning.
Know the differences between various core Machine Learning models.
Understand the Machine Learning modelling workflows.
Use Python and scikit-learn to process real datasets, train and apply Machine Learning models.
Either \Learn to Program: Python\, \Data Manipulation in Python\ and \Introduction to ML using Python: Introduction & Linear Regression\ or \Learn to Program: Python\, \Data Manipulation and Visualisation in Python\ and \Introduction to ML using Python: Introduction & Linear Regression\ needed to attend this course.
If you already have experience with programming, please check the topics covered in courses above to ensure that you are familiar with the knowledge needed for this course, such as good understanding of Python syntax, basic programming concepts and familiarity with Pandas, Numpy and Seaborn libraries, and basic understanding of Machine Learning and Model Training.
Maths knowledge is not required. However, there is a few Math formula covered in this course and the references will be provided. Having an understanding of the Mathematics behind each Machine Learning algorithms is going to make you appreciate the behaviour of the model and know its pros/cons when using them.
training@intersect.org.au
Intersect Australia
Python
Introduction to Machine Learning using Python: SVM & Unsupervised Learning
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore...
Introduction to Machine Learning using Python: SVM & Unsupervised Learning
https://intersect.org.au/training/course/python207
https://dresa.org.au/materials/introduction-to-machine-learning-using-python-svm-unsupervised-learning
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore the fundamentals of Machine Learning from a practical perspective with the help of the Python programming language and its scientific computing libraries.
Comprehensive introduction to Machine Learning models and techniques such as Support Vector Machine, K-Nearest Neighbor and Dimensionality Reduction.
Know the differences between various core Machine Learning models.
Understand the Machine Learning modelling workflows.
Use Python and scikit-learn to process real datasets, train and apply Machine Learning models.
Either \Learn to Program: Python\, \Data Manipulation in Python\ and \Introduction to ML using Python: Introduction & Linear Regression\ or \Learn to Program: Python\, \Data Manipulation and Visualisation in Python\ and \Introduction to ML using Python: Introduction & Linear Regression\ needed to attend this course.
If you already have experience with programming, please check the topics covered in courses above to ensure that you are familiar with the knowledge needed for this course, such as good understanding of Python syntax, basic programming concepts and familiarity with Pandas, Numpy and Seaborn libraries, and basic understanding of Machine Learning and Model Training.
Maths knowledge is not required. However, there is a few Math formula covered in this course and the references will be provided. Having an understanding of the Mathematics behind each Machine Learning algorithms is going to make you appreciate the behaviour of the model and know its pros/cons when using them.
training@intersect.org.au
Intersect Australia
Python
Data Visualisation in Python
Course Materials
Using the Grammar of Graphics to convert data into figures using the seaborn and matplotlib libraries
Configuring plot elements within seaborn and matplotlib
Exploring different types of...
Data Visualisation in Python
https://intersect.org.au/training/course/python202
https://dresa.org.au/materials/data-visualisation-in-python
[Course Materials](https://intersectaustralia.github.io/training/PYTHON203/sources/Data-Adv_Python.zip)
Using the Grammar of Graphics to convert data into figures using the seaborn and matplotlib libraries
Configuring plot elements within seaborn and matplotlib
Exploring different types of plots using seaborn
Either \Learn to Program: Python\ or \Learn to Program: Python\ and \Python for Research\ needed to attend this course. If you already have experience with programming, please check the topics covered in the \Learn to Program: Python\ and \Python for Research\ courses to ensure that you are familiar with the knowledge needed for this course.
We also strongly recommend attending the \Data Manipulation in Python\.
training@intersect.org.au
Intersect Australia
Python
Python for Research
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you’d expect of a modern programming language, and also a rich set of libraries for working with data.
This workshop is an introduction to data structures (DataFrames...
Python for Research
https://intersect.org.au/training/course/python110
https://dresa.org.au/materials/python-for-research
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you’d expect of a modern programming language, and also a rich set of libraries for working with data.
This workshop is an introduction to data structures (DataFrames using the pandas library) and visualisation (using the matplotlib library) in Python. The targeted audience for this workshop is researchers who are already familiar with the basic concepts in programming such as loops, functions, and conditionals.
We teach using Jupyter notebooks, which allow program code, results, visualisations and documentation to be blended seamlessly. Perfect for sharing insights with others while producing reproducible research.
Join us for this live coding workshop where we write programs that produce results, using the researcher-focused training modules from the highly regarded Software Carpentry Foundation.
Introduction to Libraries and Built-in Functions in Python
Introduction to DataFrames using the pandas library
Reading and writing data in DataFrames
Selecting values in DataFrames
Quick introduction to Plotting using the matplotlib library
\Learn to Program: Python\ or any of the \Learn to Program: R\, \Learn to Program: MATLAB\ or \Learn to Program: Julia\, needed to attend this course. If you already have some experience with programming, please check the topics covered in the \Learn to Program: Python\ course to ensure that you are familiar with the knowledge needed for this course.
training@intersect.org.au
The Carpentries
Python
Data Manipulation in Python
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you’d expect of a modern programming language, and also a rich set of libraries for working with data.
In this workshop, you will explore DataFrames in depth (using...
Data Manipulation in Python
https://intersect.org.au/training/course/python201
https://dresa.org.au/materials/data-manipulation-in-python
Python has deservedly become a popular language for scientific computing. It has all the friendly features and conveniences you’d expect of a modern programming language, and also a rich set of libraries for working with data.
In this workshop, you will explore DataFrames in depth (using the pandas library), learn how to manipulate, explore and get insights from your data (Data Manipulation), as well as how to deal with missing values and how to combine multiple datasets.
We teach using Jupyter notebooks, which allow program code, results, visualisations and documentation to be blended seamlessly. Perfect for sharing insights with others while producing reproducible research.
Join us for this live coding workshop where we write programs that produce results, using the researcher-focused training modules from the highly regarded Software Carpentry Foundation.
Working with pandas DataFrames
Indexing, slicing and subsetting in pandas DataFrames
Missing data values
Combine multiple pandas DataFrames
Either \Learn to Program: Python\ or \Learn to Program: Python\ and \Python for Research\ needed to attend this course. If you already have experience with programming, please check the topics covered in the \Learn to Program: Python\ and \Python for Research\ courses to ensure that you are familiar with the knowledge needed for this course.
training@intersect.org.au
The Carpentries
Python
Introduction to Machine Learning using Python: Introduction & Linear Regression
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore...
Introduction to Machine Learning using Python: Introduction & Linear Regression
https://intersect.org.au/training/course/python205
https://dresa.org.au/materials/introduction-to-machine-learning-using-python-introduction-linear-regression
Machine Learning (ML) is a new way to program computers to solve real world problems. It has gained popularity over the last few years by achieving tremendous success in tasks that we believed only humans could solve, from recognising images to self-driving cars. In this course, we will explore the fundamentals of Machine Learning from a practical perspective with the help of the Python programming language and its scientific computing libraries.
Understand the difference between supervised and unsupervised Machine Learning.
Understand the fundamentals of Machine Learning.
Comprehensive introduction to Machine Learning models and techniques such as Linear Regression and Model Training.
Understand the Machine Learning modelling workflows.
Use Python and scikit-learn to process real datasets, train and apply Machine Learning models
Either \Learn to Program: Python\ and \Data Manipulation in Python\ or \Learn to Program: Python\ and \Data Manipulation and Visualisation in Python\ needed to attend this course.
If you already have experience with programming, please check the topics covered in courses above to ensure that you are familiar with the knowledge needed for this course, such as good understanding of Python syntax and basic programming concepts and familiarity with Pandas, Numpy and Seaborn libraries.
Maths knowledge is not required. However, there is a few Math formula covered in this course and the references will be provided. Having an understanding of the Mathematics behind each Machine Learning algorithms is going to make you appreciate the behaviour of the model and know its pros/cons when using them.
training@intersect.org.au
Intersect Australia
Python
Thinking like a computer: The Fundamentals of Programming
Human brains are extremely good at evaluating a small amount of information simultaneously, ignoring anomalies and coming up with an answer to a problem without much in the way of conscious thought. Computers on the other hand are extremely good at performing individual calculations, one at a...
Thinking like a computer: The Fundamentals of Programming
https://intersect.org.au/training/course/coding003
https://dresa.org.au/materials/thinking-like-a-computer-the-fundamentals-of-programming
Human brains are extremely good at evaluating a small amount of information simultaneously, ignoring anomalies and coming up with an answer to a problem without much in the way of conscious thought. Computers on the other hand are extremely good at performing individual calculations, one at a time, and can keep the results in a large bank of short-term memory for quick recall. These two approaches are fundamentally different.
Humans can only reasonably retain seven plus or minus two pieces of information in short-term memory, and new items push older items out, whereas a computer is hopeless when given multiple pieces of information simultaneously.
Understanding this fact is key to being able to write instructions for computers – also known as programs – in a way that takes advantage of their strengths, and overcomes their drawbacks.
Suitable for the programming novice, this webinar is good preparation for researchers wanting to learn how to program.
How a human solves tasks
How a computer solves tasks
Overview of programming concepts:
Variables
Loops
Conditionals
Functions
Data types
The webinar has no prerequisites.
training@intersect.org.au
Intersect Australia
Python
Start Coding without Hesitation: Programming Languages Showdown
Programming is becoming more and more popular, with many researchers using programming to perform data cleaning, data manipulation, data analytics, as well as creating publication quality plots. Programming can be really beneficial for automating processes and workflows. In this webinar, we are...
Keywords: Python, R, Matlab, Julia
Start Coding without Hesitation: Programming Languages Showdown
https://intersect.org.au/training/course/coding001
https://dresa.org.au/materials/start-coding-without-hesitation-programming-languages-showdown
Programming is becoming more and more popular, with many researchers using programming to perform data cleaning, data manipulation, data analytics, as well as creating publication quality plots. Programming can be really beneficial for automating processes and workflows. In this webinar, we are exploring four of the most popular programming languages that are widely used in academia, namely Python, R, MATLAB, and Julia.
Why use Programming
An overview of Python, R, MATLAB, and Julia
Code comparison of the four programming languages
Popularity and job opportunities
Intersect’s comparison
General guidelines on how to choose the best programming language for your research
The webinar has no prerequisites.
training@intersect.org.au
Intersect Australia
Python, R, Matlab, Julia
A showcase of Data Analysis in Python and R: A case study using COVID-19 data
In all fields of research we are being confronted with a deluge of data; data that needs cleaning and transformation to be used in further analysis. This webinar demonstrates the effective use of programming tools for an initial analysis of COVID-19 datasets, with examples using both R and...
A showcase of Data Analysis in Python and R: A case study using COVID-19 data
https://intersect.org.au/training/course/coding002
https://dresa.org.au/materials/a-showcase-of-data-analysis-in-python-and-r-a-case-study-using-covid-19-data
In all fields of research we are being confronted with a deluge of data; data that needs cleaning and transformation to be used in further analysis. This webinar demonstrates the effective use of programming tools for an initial analysis of COVID-19 datasets, with examples using both R and Python.
Cleaning up a dataset for analysis
Using Jupyter lab for interactive analysis
Making the most of the tidyverse (R) and pandas (python)
Simple data visualisation using ggplot (R) and seaborn (python)
Best practices for readable code
The webinar has no prerequisites.
training@intersect.org.au
Intersect Australia
Python, R
10 Reproducible Research things - Building Business Continuity
The idea that you can duplicate an experiment and get the same conclusion is the basis for all scientific discoveries. Reproducible research is data analysis that starts with the raw data and offers a transparent workflow to arrive at the same results and conclusions. However not all studies are...
Keywords: reproducibility, data management
Resource type: tutorial, video
10 Reproducible Research things - Building Business Continuity
https://guereslib.github.io/ten-reproducible-research-things/
https://dresa.org.au/materials/9-reproducible-research-things-building-business-continuity
The idea that you can duplicate an experiment and get the same conclusion is the basis for all scientific discoveries. Reproducible research is data analysis that starts with the raw data and offers a transparent workflow to arrive at the same results and conclusions. However not all studies are replicable due to lack of information on the process. Therefore, reproducibility in research is extremely important.
Researchers genuinely want to make their research more reproducible, but sometimes don’t know where to start and often don’t have the available time to investigate or establish methods on how reproducible research can speed up every day work. We aim for the philosophy “Be better than you were yesterday”. Reproducibility is a process, and we highlight there is no expectation to go from beginner to expert in a single workshop. Instead, we offer some steps you can take towards the reproducibility path following our Steps to Reproducible Research self paced program.
Video:
https://www.youtube.com/watch?v=bANTr9RvnGg
Tutorial:
https://guereslib.github.io/ten-reproducible-research-things/
a.miotto@griffith.edu.au; s.stapleton@griffith.edu.au; i.jennings@griffith.edu.au;
Amanda Miotto
Julie Toohey
Sharron Stapleton
Isaac Jennings
reproducibility, data management
masters
phd
ecr
researcher
support