WORKSHOP: Online data analysis for biologists
This record includes training materials associated with the Australian BioCommons workshop ‘Online data analysis for biologists’. This workshop took place on 21 August 2024.
Topic description
Galaxy is a web-based platform that lets you conduct accessible, reproducible, and transparent...
Keywords: Bioinformatics, Data analysis, Galaxy
WORKSHOP: Online data analysis for biologists
https://zenodo.org/records/13948826
https://dresa.org.au/materials/workshop-online-data-analysis-for-biologists
This record includes training materials associated with the Australian BioCommons workshop ‘Online data analysis for biologists’. This workshop took place on 21 August 2024.
Topic description
Galaxy is a web-based platform that lets you conduct accessible, reproducible, and transparent computational biological research. Widely used by researchers world wide, Galaxy gives you access to 1000’s of popular tools for analysis and processing of biological data. It is perfect for working with a wide range of big and small datasets including genome assembly, annotation, epigenetics, metabolomics, metagenomics, proteomics, statistics, transcriptomics, variant analysis and visualisation.
This workshop provides an introduction to using Galaxy and available tools. Using an example dataset, you’ll practice uploading data, choosing and running tools, and viewing the results. We’ll share our top tips for managing your experiments and speeding up your analysis with workflows.
Lead trainer: Dr Gareth Price, Galaxy Australia
Facilitator: Mike Thang, Galaxy Australia / QCIF
Infrastructure provision: Galaxy Australia
Host: Dr Melissa Burke, Australian BioCommons
Training Materials
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event_metadata_Online_data_analysis_for_biologists_210824 (PDF): Information about the event logistics including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Schedule_Online_data_analysis_for_biologists_210824 (PDF): Schedule for the workshop providing a breakdown of topics and timings
Materials shared elsewhere:
This workshop is based on the Galaxy Training Network tutorial ‘Galaxy basics for everyone’: https://training.galaxyproject.org/training-material/topics/introduction/tutorials/galaxy-intro-101-everyone/tutorial.html
A recording of this workshop is available on the Australian BioCommons YouTube Channel: https://www.youtube.com/watch?v=PF39KjOvreM
Melissa Burke (melissa@biocommons.org.au)
Price, Gareth (orcid: 0000-0003-2439-8650)
Thang, Michael
Bioinformatics, Data analysis, Galaxy
Astronomy Data And Computing Services - Upskilling the Australian astronomy community
The Astronomy Data And Computing Services (ADACS) initiative has been working with the Australian astronomy community for just over 3 years now. Our vision is to deliver astronomy-focused training, support and expertise to maximise the scientific return on investments in astronomical data &...
Keywords: astronomy, data skills, eresearch skills, skills, computational skills, training, skills gaps, astronomy-focused training, training material
Astronomy Data And Computing Services - Upskilling the Australian astronomy community
https://zenodo.org/records/4287748
https://dresa.org.au/materials/astronomy-data-and-computing-services-upskilling-the-australian-astronomy-community-57afa0b9-77da-4dc1-ad29-25089f19363d
The Astronomy Data And Computing Services (ADACS) initiative has been working with the Australian astronomy community for just over 3 years now. Our vision is to deliver astronomy-focused training, support and expertise to maximise the scientific return on investments in astronomical data & computing infrastructure.
During these last 3 years, we have delivered dozens of face-to-face, hands-on workshops and created several hours worth of online tutorial materials. This talk will focus on our journey to deliver this computational skills training to the community, exploring how we chose different delivery pathways and content, based both on community input as well as our professional expertise and understanding of existing skill gaps. Most importantly we will discuss our plans for the future and how we are working on actively including the community in developing new training material beyond the usual skills survey.
Come along to this talk if you would like to hear about a national effort to deliver computational skills training and would like to know more about potential new avenues to provide just-in-time training and how to collaborate with ADACS.
contact@ardc.edu.au
Lange, Rebecca (orcid: 0000-0002-9449-4384)
astronomy, data skills, eresearch skills, skills, computational skills, training, skills gaps, astronomy-focused training, training material
WEBINAR: Getting started with R
This record includes training materials associated with the Australian BioCommons webinar ‘Getting started with R’. This webinar took place on 16 August 2021.
Data analysis skills are now central to most biological experiments. While Excel can cover some of your data analysis needs, it is not...
Keywords: R statistical software, R studio, Tidyverse, Bioinformatics, Data analysis
WEBINAR: Getting started with R
https://zenodo.org/records/5214277
https://dresa.org.au/materials/webinar-getting-started-with-r-1c8f2b21-bc4b-4b42-9a5d-d6096a2afbe6
This record includes training materials associated with the Australian BioCommons webinar ‘Getting started with R’. This webinar took place on 16 August 2021.
Data analysis skills are now central to most biological experiments. While Excel can cover some of your data analysis needs, it is not always the best choice, particularly for large and complex datasets.
R is an open-source software and programming language that enables data exploration, statistical analysis visualisation and more. While it is the tool of choice for data analysis, getting started can be a little daunting for those without a background in statistics.
In this webinar Saskia Freytag, an R user with over a decade of experience and member of the Bioconductor Community Advisory Board, will walk you through their hints and tips for getting started with R and data analysis. She’ll cover topics like R Studio and why you need it, where to get help, basic data manipulation, visualisations and extending R with libraries. The webinar will be followed by a short Q&A session
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Getting started with R - slides (PDF): Slides used in the presentation
Materials shared elsewhere:
A recording of the webinar is available on the Australian BioCommons YouTube Channel:
https://youtu.be/JS7yZw7bnX8
Melissa Burke (melissa@biocommons.org.au)
Freytag, Saskia (orcid: 0000-0002-2185-7068)
R statistical software, R studio, Tidyverse, Bioinformatics, Data analysis
Successful data training stories from NCI
NCI Australia manages a multi-petabyte sized data repository, collocated with its HPC systems and data services, which allows high performance access to many scientific research datasets across many earth science domains.
An important aspect is to provide training materials that proactively...
Keywords: skills, training, eresearch skills, HPC training, domain-specific training, reproducible workflows, training material
Successful data training stories from NCI
https://zenodo.org/records/4287750
https://dresa.org.au/materials/successful-data-training-stories-from-nci-33f110e3-0c06-492e-9cc5-fa0f886ca1b8
NCI Australia manages a multi-petabyte sized data repository, collocated with its HPC systems and data services, which allows high performance access to many scientific research datasets across many earth science domains.
An important aspect is to provide training materials that proactively engages with the research community to improve their understanding of the data available, and to share knowledge and best practices in the use of tools and other software. We have developed multiple levels of training modules (introductory, intermediate and advanced) to cater for users with different levels of experience and interest. We have also tailored courses for each scientific domain, so that the use-cases and software will be most relevant to their interests and needs.
For our training, we combine brief lectures followed by hands-on training examples on how to use datasets, using working examples of well-known tools and software that people can use as a template and modify to fit their needs. For example, we take representative use-cases from some scientific activities, from our collaborations and from user support issues, and convert to Jupyter notebook examples so that people can repeat the workfIow and reproduce the results. We also use the training as an opportunity to raise awareness of growing issues in resource management. Some examples include a familiarity of the FAIR data principles, licensing, citation, data management and trusted digital repositories. This approach to both our online training materials and workshops has been well-received by PhD students, early careers, and cross disciplinary users.
contact@ardc.edu.au
Wang, Jingbo
skills, training, eresearch skills, HPC training, domain-specific training, reproducible workflows, training material
Accelerating skills development in Data science and AI at scale
At the Monash Data Science and AI platform, we believe that upskilling our research community and building a workforce with data science skills are key to accelerating the application of data science in research. To achieve this, we create and leverage new and existing training capabilities...
Keywords: AI, machine learning, eresearch skills, training, train the trainer, volunteer instructors, training partnerships, training material
Accelerating skills development in Data science and AI at scale
https://zenodo.org/records/4287746
https://dresa.org.au/materials/accelerating-skills-development-in-data-science-and-ai-at-scale-2d8a65fa-f96e-44ad-a026-cfae3f38d128
At the Monash Data Science and AI platform, we believe that upskilling our research community and building a workforce with data science skills are key to accelerating the application of data science in research. To achieve this, we create and leverage new and existing training capabilities within and outside Monash University. In this talk, we will discuss the principles and purpose of establishing collaborative models to accelerate skills development at scale. We will talk about our approach to identifying gaps in the existing skills and training available in data science, key areas of interest as identified by the research community and various sources of training available in the marketplace. We will provide insights into the collaborations we currently have and intend to develop in the future within the university and also nationally.
The talk will also cover our approach as outlined below
• Combined survey of gaps in skills and trainings for Data science and AI
• Provide seats to partners
• Share associate instructors/helpers/volunteers
• Develop combined training materials
• Publish a repository of open source trainings
• Train the trainer activities
• Establish a network of volunteers to deliver trainings at their local regions
Industry plays a significant role in making some invaluable training available to the research community either through self learning platforms like AWS Machine Learning University or Instructor led courses like NVIDIA Deep Learning Institute. We will discuss how we leverage our partnerships with Industry to bring these trainings to our research community.
Finally, we will discuss how we map our training to the ARDC skills roadmap and how the ARDC platforms project “Environments to accelerate Machine Learning based Discovery” has enabled collaboration between Monash University and University of Queensland to develop and deliver training together.
contact@ardc.edu.au
Tang, Titus
AI, machine learning, eresearch skills, training, train the trainer, volunteer instructors, training partnerships, training material
Data Fluency: a community of practice supporting a digitally skilled workforce
This presentation showcases the impact of the Monash Data Fluency Community of Practice upon digitally skilled Graduate Research students involved as learners and instructors in the program. The strong focus on building community to complement training, has fostered an environment of learning,...
Keywords: skills, training, eresearch skills, data skills, online learning, pedagogy, train the trainer, digitally skilled workforce, training material
Data Fluency: a community of practice supporting a digitally skilled workforce
https://zenodo.org/records/4287752
https://dresa.org.au/materials/data-fluency-a-community-of-practice-supporting-a-digitally-skilled-workforce-b911a1a8-0331-496e-95a6-0015a12acc34
This presentation showcases the impact of the Monash Data Fluency Community of Practice upon digitally skilled Graduate Research students involved as learners and instructors in the program. The strong focus on building community to complement training, has fostered an environment of learning, networking and sharing of expertise. Hear what the Graduate research students have to say about the value of skills training and how it has impacted their research; how the community has enabled them to network with a broad range of researchers and affiliate partner groups they would not ordinarily be in contact with; how their research journey has been enhanced by working as part of a multi-disciplinary team, as well as sharpening their teaching skills.
The rapid refocus from face - face to online delivery, as a result of the pandemic, highlights the importance of the multi-faceted online approach including workshops, drop-in sessions, SLACK chat and online learning resources. As a result of the shift to online, the range of strategic external partner/affiliate groups has extended and demand for workshops and drop-ins has increased. Learn how the instructors have altered their pedagogical approach to engage workshop and drop-in participants; how they have overcome some of the challenges of facilitating in an online environment; and how this is preparing them to become part of a digitally skilled workforce.
contact@ardc.edu.au
Groenewegen, David (orcid: 0000-0003-2523-1676)
skills, training, eresearch skills, data skills, online learning, pedagogy, train the trainer, digitally skilled workforce, training material
ARDC Skills Landscape
The Australian Research Data Commons is driving transformational change in the research data ecosystem, enabling researchers to conduct world class data-intensive research. One interconnected component of this ecosystem is skills development/uplift, which is critical to the Commons and its...
Keywords: skills, data skills, eresearch skills, community, skilled workforce, FAIR, research data management, data stewardship, data governance, data use, data generation, training material
ARDC Skills Landscape
https://zenodo.org/records/4287743
https://dresa.org.au/materials/ardc-skills-landscape-56b224ca-9e30-4771-8615-d028c7be86a6
The Australian Research Data Commons is driving transformational change in the research data ecosystem, enabling researchers to conduct world class data-intensive research. One interconnected component of this ecosystem is skills development/uplift, which is critical to the Commons and its purpose of providing Australian researchers with a competitive advantage through data.
In this presentation, Kathryn Unsworth introduces the ARDC Skills Landscape. The Landscape is a first step in developing a national skills framework to enable a coordinated and cohesive approach to skills development across the Australian eResearch sector. It is also a first step towards helping to analyse current approaches in data training to identify:
- Siloed skills initiatives, and finding ways to build partnerships and improve collaboration
- Skills deficits, and working to address the gaps in data skills
- Areas of skills development for investment by skills stakeholders like universities, research organisations, skills and training service providers, ARDC, etc.
contact@ardc.edu.au
Unsworth, Kathryn (orcid: 0000-0002-5407-9987)
skills, data skills, eresearch skills, community, skilled workforce, FAIR, research data management, data stewardship, data governance, data use, data generation, training material
Exploratory Data Analysis
This is the second of three modules in our exciting new machine learning workshop series by the Sydney Informatics Hub (SIH).
Module 1: https://youtu.be/dMwHFhKWRRI
Module 3:...
Keywords: Data analysis, training material
Exploratory Data Analysis
https://youtu.be/HVAFflj2PS0
https://dresa.org.au/materials/exploratory-data-analysis
This is the second of three modules in our exciting new machine learning workshop series by the Sydney Informatics Hub (SIH).
**Module 1**: [https://youtu.be/dMwHFhKWRRI](https://youtu.be/dMwHFhKWRRI)
**Module 3**: [https://github.com/Sydney-Informatics-Hub/Module3R](https://github.com/Sydney-Informatics-Hub/Module3R)
*The Sydney Informatics Hub is a Core Research Facility at The University of Sydney, enabling excellence in research* [https://sydney.edu.au/informatics-hub](https://sydney.edu.au/informatics-hub)
sih.training@sydney.edu.au
Zhang, Eden (orcid: 0000-0003-0294-3734)
Mori, Giorgia (orcid: 0000-0003-3469-5632)
Data analysis, training material
National Transfusion Dataset Secure eResearch Platform (SeRP)/SafeHaven Training
A short training video for NTD users on how to access and use the SeRP once data access is granted.
Keywords: research data, Data analysis, research data management
National Transfusion Dataset Secure eResearch Platform (SeRP)/SafeHaven Training
https://www.transfusiondataset.com/training-and-user-guides
https://dresa.org.au/materials/national-transfusion-dataset-secure-eresearch-platform-serp-safehaven-training
A short training video for NTD users on how to access and use the SeRP once data access is granted.
sphpm.ntd@monash.edu
research data, Data analysis, research data management
Introduction to Data Cleaning with OpenRefine
Learn basic data cleaning techniques in this self-paced online workshop using open data from data.qld.gov.au and open source tool OpenRefine openrefine.org. Learn techniques to prepare messy tabular data for comupational analysis. Of most relevance to HASS disciplines, working with textual data...
Keywords: data skills, Data analysis
Resource type: tutorial
Introduction to Data Cleaning with OpenRefine
https://griffithunilibrary.github.io/data-cleaning-intro/
https://dresa.org.au/materials/introduction-to-data-cleaning-with-openrefine
Learn basic data cleaning techniques in this self-paced online workshop using open data from data.qld.gov.au and open source tool OpenRefine openrefine.org. Learn techniques to prepare messy tabular data for comupational analysis. Of most relevance to HASS disciplines, working with textual data in a structured or semi-structured format.
s.stapleton@griffith.edu.au;
Sharron Stapleton
data skills, Data analysis
mbr
phd
ecr
researcher
support
professional
VOSON Lab Code Blog
The VOSON Lab Code Blog is a space to share methods, tips, examples and code. Blog posts provide techniques to construct and analyse networks from various API and other online data sources, using the VOSON open-source software and other R based packages.
Keywords: visualisation, Data analysis, data collections, R software, Social network analysis, social media data, Computational Social Science, quantitative, Text Analytics
Resource type: tutorial, other
VOSON Lab Code Blog
https://vosonlab.github.io/
https://dresa.org.au/materials/voson-lab-code-blog
The VOSON Lab Code Blog is a space to share methods, tips, examples and code. Blog posts provide techniques to construct and analyse networks from various API and other online data sources, using the VOSON open-source software and other R based packages.
robert.ackland@anu.edu.au
visualisation, Data analysis, data collections, R software, Social network analysis, social media data, Computational Social Science, quantitative, Text Analytics
researcher
support
phd
masters
Galaxy Training
Galaxy is a hosted web-accessible platform that lets you conduct accessible, reproducible, and transparent computational biological research. It is an international, community driven effort to make it easy for life scientists to analyse their data for free and without the need for programmatic...
Keywords: Galaxy Australia, Galaxy Project, Bioinformatics, Data analysis
Galaxy Training
https://training.galaxyproject.org/training-material/
https://dresa.org.au/materials/galaxy-training
Galaxy is a hosted web-accessible platform that lets you conduct accessible, reproducible, and transparent computational biological research. It is an international, community driven effort to make it easy for life scientists to analyse their data for free and without the need for programmatic skills.
This is a collection of tutorials developed and maintained by the worldwide Galaxy community that show you how to analyse a variety of biological data using Galaxy.
Melissa (melissa@biocommons.org.au)
Galaxy Australia, Galaxy Project, Bioinformatics, Data analysis