WORKSHOP: R: fundamental skills for biologists
This record includes training materials associated with the Australian BioCommons workshop ‘R: fundamental skills for biologists’. This workshop took place over four, three-hour sessions on 1, 8, 15 and 22 June 2022.
Event description
Biologists need data analysis skills to be able to...
Keywords: Bioinformatics, Analysis, Statistics, R software, RStudio, Data visualisation
WORKSHOP: R: fundamental skills for biologists
https://zenodo.org/records/6766951
https://dresa.org.au/materials/workshop-r-fundamental-skills-for-biologists-81aa00db-63ad-4962-a7ac-b885bf9f676b
This record includes training materials associated with the Australian BioCommons workshop ‘R: fundamental skills for biologists’. This workshop took place over four, three-hour sessions on 1, 8, 15 and 22 June 2022.
Event description
Biologists need data analysis skills to be able to interpret, visualise and communicate their research results. While Excel can cover some data analysis needs, there is a better choice, particularly for large and complex datasets.
R is a free, open-source software and programming language that enables data exploration, statistical analysis, visualisation and more. The large variety of R packages available for analysing biological data make it a robust and flexible option for data of all shapes and sizes.
Getting started can be a little daunting for those without a background in statistics and programming. In this workshop we will equip you with the foundations for getting the most out of R and RStudio, an interactive way of structuring and keeping track of your work in R. Using biological data from a model of influenza infection, you will learn how to efficiently and reproducibly organise, read, wrangle, analyse, visualise and generate reports from your data in R.
Topics covered in this workshop include:
Spreadsheets, organising data and first steps with R
Manipulating and analysing data with dplyr
Data visualisation
Summarized experiments and getting started with Bioconductor
This workshop is presented by the Australian BioCommons and Saskia Freytag from WEHI with the assistance of a network of facilitators from the national Bioinformatics Training Cooperative.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Schedule (PDF): A breakdown of the topics and timings for the workshop
Recommended resources (PDF): A list of resources recommended by trainers and participants
Q_and_A(PDF): Archive of questions and their answers from the workshop Slack Channel.
Materials shared elsewhere:
This workshop follows the tutorial ‘Introduction to data analysis with R and Bioconductor’ which is publicly available.
https://saskiafreytag.github.io/biocommons-r-intro/
This is derived from material produced as part of The Carpentries Incubator project
https://carpentries-incubator.github.io/bioc-intro/
Melissa Burke (melissa@biocommons.org.au)
Freytag, Saskia (orcid: 0000-0002-2185-7068)
Barugahare, Adele (orcid: 0000-0002-8976-0094)
Doyle, Maria
Ansell, Brendan (orcid: 0000-0003-0297-897X)
Varshney, Akriti
Bourke, Caitlin (orcid: 0000-0002-4466-6563)
Conradsen, Cara (orcid: 0000-0001-9797-3412)
Jung, Chol-Hee (orcid: 0000-0002-2992-3162)
Sandoval, Claudia
Chandrananda, Dineika (orcid: 0000-0002-8834-9500)
Zhang, Eden (orcid: 0000-0003-0294-3734)
Rosello, Fernando (orcid: 0000-0003-3885-8777)
Iacono, Giulia (orcid: 0000-0002-1527-0754)
Tarasova, Ilariya (orcid: 0000-0002-0895-9385)
Chung, Jessica (orcid: 0000-0002-0627-0955)
Moffet, Joel
Gustafsson, Johan (orcid: 0000-0002-2977-5032)
Ding, Ke
Feher, Kristen
Perlaza-Jimenez, Laura (orcid: 0000-0002-8511-1134)
Crowe, Mark (orcid: 0000-0002-9514-2487)
Ma, Mengyao
Kandhari, Nitika (orcid: 0000-0002-0261-727X)
Williams, Sarah
Nelson, Tiffanie (orcid: 0000-0002-5341-312X)
Schreiber, Veronika (orcid: 0000-0001-6088-7828)
Pinzon Perez, William
Bioinformatics, Analysis, Statistics, R software, RStudio, Data visualisation
ARDC FAIR Data 101 self-guided
FAIR Data 101 v3.0 is a self-guided course covering the FAIR Data principles
The FAIR Data 101 virtual course was designed and delivered by the ARDC Skilled Workforce Program twice in 2020 and has now been reworked as a self-guided course.
The course structure was based on 'FAIR Data in the...
Keywords: training material, FAIR data, video, webinar, activities, quiz, FAIR, research data management
ARDC FAIR Data 101 self-guided
https://zenodo.org/records/5094034
https://dresa.org.au/materials/ardc-fair-data-101-self-guided-2d794a84-f0ff-4e11-a39c-fa8ea481e097
FAIR Data 101 v3.0 is a self-guided course covering the FAIR Data principles
The FAIR Data 101 virtual course was designed and delivered by the ARDC Skilled Workforce Program twice in 2020 and has now been reworked as a self-guided course.
The course structure was based on 'FAIR Data in the Scholarly Communications Lifecycle', run by Natasha Simons at the FORCE11 Scholarly Communications Institute. These training materials are hosted on GitHub.
contact@ardc.edu.au
Stokes, Liz (orcid: 0000-0002-2973-5647)
Liffers, Matthias (orcid: 0000-0002-3639-2080)
Burton, Nichola (orcid: 0000-0003-4470-4846)
Martinez, Paula A. (orcid: 0000-0002-8990-1985)
Simons, Natasha (orcid: 0000-0003-0635-1998)
Russell, Keith (orcid: 0000-0001-5390-2719)
McCafferty, Siobhann (orcid: 0000-0002-2491-0995)
Ferrers, Richard (orcid: 0000-0002-2923-9889)
McEachern, Steve (orcid: 0000-0001-7848-4912)
Barlow, Melanie (orcid: 0000-0002-3956-5784)
Brady, Catherine (orcid: 0000-0002-7919-7592)
Brownlee, Rowan (orcid: 0000-0002-1955-1262)
Honeyman, Tom (orcid: 0000-0001-9448-4023)
Quiroga, Maria del Mar (orcid: 0000-0002-8943-2808)
training material, FAIR data, video, webinar, activities, quiz, FAIR, research data management
MetaSat. An open, collaboratively-developed metadata toolkit to support the future of space exploration.
MetaSat is an open metadata toolkit for describing small satellite (and even large satellite) missions in a uniform and shareable way. Optimised for small satellite missions, MetaSat fills an informatics gap. Although there have been a number of relevant metadata sets, there has been a...
Keywords: Small satellites, metadata, vocabularies, training material
MetaSat. An open, collaboratively-developed metadata toolkit to support the future of space exploration.
https://zenodo.org/records/5832057
https://dresa.org.au/materials/metasat-an-open-collaboratively-developed-metadata-toolkit-to-support-the-future-of-space-exploration-49af7d4d-f0d1-4f95-9fbe-afbd45170a6a
MetaSat is an open metadata toolkit for describing small satellite (and even large satellite) missions in a uniform and shareable way. Optimised for small satellite missions, MetaSat fills an informatics gap. Although there have been a number of relevant metadata sets, there has been a longstanding need for a vocabulary to span these community standards. A vocabulary to annotate the data and information outputs of these satellite missions, to enable search across disparate data repositories, and provide support for application of analytical services to retrieved datasets.
A common problem among small satellite teams is finding information about how other small satellites were put together, what parts worked well, what weren't compatible, what were the mission goals and outcomes. A lot of this information can be found, but it's not usually described in a consistent and searchable way across projects. MetaSat helps by building a uniform language of description which can be embedded into small satellite databases and tools to connect information across projects.
Although a relatively new vocabulary initiative, MetaSat has secured early adoption by SatNOGS, a global network of ground stations that collects, manages & enables access to satellite observations. Also partnering with NASA's Small Satellite Reliability Initiative, and in discussion with NASA concerning implementation of the vocabulary in other areas of its information infrastructure.
You can watch the full presentation on YouTube here: https://www.youtube.com/watch?v=uaCOzNL1eh4
contact@ardc.edu.au
Bouquin, Daina (orcid: 0000-0003-2626-3688)
Chivvis, Daniel (orcid: 0000-0001-6656-160X)
Small satellites, metadata, vocabularies, training material
WORKSHOP: R: fundamental skills for biologists
This record includes training materials associated with the Australian BioCommons workshop ‘R: fundamental skills for biologists’. This workshop took place over four, three-hour sessions on 1, 8, 15 and 22 June 2022.
Event description
Biologists need data analysis skills to be able to...
Keywords: Bioinformatics, Analysis, Statistics, R software, RStudio, Data visualisation
WORKSHOP: R: fundamental skills for biologists
https://zenodo.org/record/6766951
https://dresa.org.au/materials/workshop-r-fundamental-skills-for-biologists
This record includes training materials associated with the Australian BioCommons workshop ‘R: fundamental skills for biologists’. This workshop took place over four, three-hour sessions on 1, 8, 15 and 22 June 2022.
**Event description**
Biologists need data analysis skills to be able to interpret, visualise and communicate their research results. While Excel can cover some data analysis needs, there is a better choice, particularly for large and complex datasets.
R is a free, open-source software and programming language that enables data exploration, statistical analysis, visualisation and more. The large variety of R packages available for analysing biological data make it a robust and flexible option for data of all shapes and sizes.
Getting started can be a little daunting for those without a background in statistics and programming. In this workshop we will equip you with the foundations for getting the most out of R and RStudio, an interactive way of structuring and keeping track of your work in R. Using biological data from a model of influenza infection, you will learn how to efficiently and reproducibly organise, read, wrangle, analyse, visualise and generate reports from your data in R.
Topics covered in this workshop include:
- Spreadsheets, organising data and first steps with R
- Manipulating and analysing data with dplyr
- Data visualisation
- Summarized experiments and getting started with Bioconductor
This workshop is presented by the Australian BioCommons and Saskia Freytag from WEHI with the assistance of a network of facilitators from the national Bioinformatics Training Cooperative.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
**Files and materials included in this record:**
- Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
- Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
- Schedule (PDF): A breakdown of the topics and timings for the workshop
- Recommended resources (PDF): A list of resources recommended by trainers and participants
- Q_and_A(PDF): Archive of questions and their answers from the workshop Slack Channel.
**Materials shared elsewhere:**
This workshop follows the tutorial ‘Introduction to data analysis with R and Bioconductor’ which is publicly available.
https://saskiafreytag.github.io/biocommons-r-intro/
This is derived from material produced as part of The Carpentries Incubator project
https://carpentries-incubator.github.io/bioc-intro/
Melissa Burke (melissa@biocommons.org.au)
Freytag, Saskia (orcid: 0000-0002-2185-7068)
Barugahare, Adele (orcid: 0000-0002-8976-0094)
Doyle, Maria
Ansell, Brendan (orcid: 0000-0003-0297-897X)
Varshney, Akriti
Bourke, Caitlin (orcid: 0000-0002-4466-6563)
Conradsen, Cara (orcid: 0000-0001-9797-3412)
Jung, Chol-Hee (orcid: 0000-0002-2992-3162)
Sandoval, Claudia
Chandrananda, Dineika (orcid: 0000-0002-8834-9500)
Zhang, Eden (orcid: 0000-0003-0294-3734)
Rosello, Fernando (orcid: 0000-0003-3885-8777)
Iacono, Giulia (orcid: 0000-0002-1527-0754)
Tarasova, Ilariya (orcid: 0000-0002-0895-9385)
Chung, Jessica (orcid: 0000-0002-0627-0955)
Moffet, Joel
Gustafsson, Johan (orcid: 0000-0002-2977-5032)
Ding, Ke
Feher, Kristen
Perlaza-Jimenez, Laura (orcid: 0000-0002-8511-1134)
Crowe, Mark (orcid: 0000-0002-9514-2487)
Ma, Mengyao
Kandhari, Nitika (orcid: 0000-0002-0261-727X)
Williams, Sarah
Nelson, Tiffanie (orcid: 0000-0002-5341-312X)
Schreiber, Veronika (orcid: 0000-0001-6088-7828)
Pinzon Perez, William
Bioinformatics, Analysis, Statistics, R software, RStudio, Data visualisation
MetaSat. An open, collaboratively-developed metadata toolkit to support the future of space exploration.
MetaSat is an open metadata toolkit for describing small satellite (and even large satellite) missions in a uniform and shareable way. Optimised for small satellite missions, MetaSat fills an informatics gap. Although there have been a number of relevant metadata sets, there has been a...
Keywords: Small satellites, metadata, vocabularies, training material
MetaSat. An open, collaboratively-developed metadata toolkit to support the future of space exploration.
https://zenodo.org/record/5832057
https://dresa.org.au/materials/metasat-an-open-collaboratively-developed-metadata-toolkit-to-support-the-future-of-space-exploration
MetaSat is an open metadata toolkit for describing small satellite (and even large satellite) missions in a uniform and shareable way. Optimised for small satellite missions, MetaSat fills an informatics gap. Although there have been a number of relevant metadata sets, there has been a longstanding need for a vocabulary to span these community standards. A vocabulary to annotate the data and information outputs of these satellite missions, to enable search across disparate data repositories, and provide support for application of analytical services to retrieved datasets.
A common problem among small satellite teams is finding information about how other small satellites were put together, what parts worked well, what weren't compatible, what were the mission goals and outcomes. A lot of this information can be found, but it's not usually described in a consistent and searchable way across projects. MetaSat helps by building a uniform language of description which can be embedded into small satellite databases and tools to connect information across projects.
Although a relatively new vocabulary initiative, MetaSat has secured early adoption by SatNOGS, a global network of ground stations that collects, manages & enables access to satellite observations. Also partnering with NASA's Small Satellite Reliability Initiative, and in discussion with NASA concerning implementation of the vocabulary in other areas of its information infrastructure.
You can watch the full presentation on YouTube here: https://www.youtube.com/watch?v=uaCOzNL1eh4
contact@ardc.edu.au
Bouquin, Daina (orcid: 0000-0003-2626-3688)
Chivvis, Daniel (orcid: 0000-0001-6656-160X)
Small satellites, metadata, vocabularies, training material
ARDC FAIR Data 101 self-guided
FAIR Data 101 v3.0 is a self-guided course covering the FAIR Data principles
The FAIR Data 101 virtual course was designed and delivered by the ARDC Skilled Workforce Program twice in 2020 and has now been reworked as a self-guided course.
The course structure was based on 'FAIR Data in the...
Keywords: training material, FAIR data, video, webinar, activities, quiz, FAIR, research data management
ARDC FAIR Data 101 self-guided
https://zenodo.org/record/5094034
https://dresa.org.au/materials/ardc-fair-data-101-self-guided-bba41a59-8479-4f4f-b9ee-337b9eb294bf
FAIR Data 101 v3.0 is a self-guided course covering the FAIR Data principles
The FAIR Data 101 virtual course was designed and delivered by the ARDC Skilled Workforce Program twice in 2020 and has now been reworked as a self-guided course.
The course structure was based on 'FAIR Data in the Scholarly Communications Lifecycle', run by Natasha Simons at the FORCE11 Scholarly Communications Institute. These training materials are hosted on GitHub.
contact@ardc.edu.au
Stokes, Liz (orcid: 0000-0002-2973-5647)
Liffers, Matthias (orcid: 0000-0002-3639-2080)
Burton, Nichola (orcid: 0000-0003-4470-4846)
Martinez, Paula A. (orcid: 0000-0002-8990-1985)
Simons, Natasha (orcid: 0000-0003-0635-1998)
Russell, Keith (orcid: 0000-0001-5390-2719)
McCafferty, Siobhann (orcid: 0000-0002-2491-0995)
Ferrers, Richard (orcid: 0000-0002-2923-9889)
McEachern, Steve (orcid: 0000-0001-7848-4912)
Barlow, Melanie (orcid: 0000-0002-3956-5784)
Brady, Catherine (orcid: 0000-0002-7919-7592)
Brownlee, Rowan (orcid: 0000-0002-1955-1262)
Honeyman, Tom (orcid: 0000-0001-9448-4023)
Quiroga, Maria del Mar (orcid: 0000-0002-8943-2808)
training material, FAIR data, video, webinar, activities, quiz, FAIR, research data management
HPC file systems and what users need to consider for appropriate and efficient usage
Three videos on miscellaneous aspects of HPC usage - useful reference for new users of HPC systems.
1 – General overview of different file systems that might be available on HPC. The video goes through shared file systems such as /home and /scratch, local compute node file systems (local...
Keywords: HPC, high performance computer, File systems
Resource type: video, presentation
HPC file systems and what users need to consider for appropriate and efficient usage
https://www.youtube.com/watch?v=cNW7F9V1plA&list=PLjlLx279X4yO62jHF4rd7I9iEfbnz3Ts1
https://dresa.org.au/materials/hpc-file-systems-and-what-users-need-to-consider-for-appropriate-and-efficient-usage
Three videos on miscellaneous aspects of HPC usage - useful reference for new users of HPC systems.
1 – General overview of different file systems that might be available on HPC. The video goes through shared file systems such as /home and /scratch, local compute node file systems (local scratch or $TMPDIR) and storage file system. It outlines what users need to consider if they wish to use any of these in their workflows.
2 – Overview of the different directories that might be present on HPC. These could include /home, /scratch, /opt, /lib and lib64, /sw and others.
3 – Overview of the Message-of-the-day file and the message that is displayed to users every time they log in. This displays info about general help and often current problems or upcoming outages.
QCIF Training (training@qcif.edu.au)
Marlies Hankel
HPC, high performance computer, File systems
Basic Linux/Unix commands
A series of eight videos (each between 5 and 10 minutes long) following the content of the Software Carpentry workshop "The Unix Shell".
Sessions 1, 2 and 3 provide instructions on the minimal level of Linux/Unix commands recommended for new...
Keywords: HPC, high performance computer, Unix, Linux, Software Carpentry
Resource type: video, guide
Basic Linux/Unix commands
https://www.youtube.com/playlist?list=PLjlLx279X4yP5GodfbqQTJuJ1S9EJU3GM
https://dresa.org.au/materials/basic-linux-unix-commands
A series of eight videos (each between 5 and 10 minutes long) following the content of the Software Carpentry workshop ["The Unix Shell"](https://swcarpentry.github.io/shell-novice/).
Sessions 1, 2 and 3 provide instructions on the minimal level of Linux/Unix commands recommended for new users of HPC.
1 – An overview of how to find out where a user is in the filesystem, list the files there, and how to get help on Unix commands
2 – How to move around the file system and change into other directories
3 – Explains the difference between an absolute and relative path
4 – Overview of how to create new directories, and to create and edit new files with nano
5 – How to use the vi editor to edit files
6 – Overview of file viewers available
7 – How to copy and move files and directories
8 – How to remove files and directories
Further details and exercises with solutions can be found on the Software Carpentry "The Unix Shell" page (https://swcarpentry.github.io/shell-novice/)
QCIF Training (training@qcif.edu.au)
Marlies Hankel
HPC, high performance computer, Unix, Linux, Software Carpentry
Transferring files and data
A short video outlining the basics on how to use FileZilla to establish a secure file transfer protocol (sftp) connection to HPC to use a drag and drop interface to transfer files between the HPC and a desktop computer.
Keywords: sftp, file transfer, HPC, high performance computer
Resource type: video, guide
Transferring files and data
https://www.youtube.com/watch?v=9ABMxcKqfkQ&list=PLjlLx279X4yP3eTLu0S6nOt0HQ7XRf6WF
https://dresa.org.au/materials/transferring-files-and-data
A short video outlining the basics on how to use FileZilla to establish a secure file transfer protocol (sftp) connection to HPC to use a drag and drop interface to transfer files between the HPC and a desktop computer.
QCIF Training (training@qcif.edu.au)
Marlies Hankel
sftp, file transfer, HPC, high performance computer
Connecting to HPC
A series of three short videos introducing how to use PuTTY to connect from a Windows PC to a secure HPC (high performance computing) cluster.
1 - The very basics on how to establish a connection to HPC.
2 - How to add more specific options for the connection to HPC.
3 - How to save the...
Keywords: HPC, high performance computer, ssh
Resource type: video, guide
Connecting to HPC
https://www.youtube.com/playlist?list=PLjlLx279X4yPJBVQuIRhz1CVMfQpTuvZW
https://dresa.org.au/materials/connecting-to-hpc
A series of three short videos introducing how to use PuTTY to connect from a Windows PC to a secure HPC (high performance computing) cluster.
1 - The very basics on how to establish a connection to HPC.
2 - How to add more specific options for the connection to HPC.
3 - How to save the details and options for a connection for future use.
QCIF Training (training@qcif.edu.au)
Marlies Hankel
HPC, high performance computer, ssh