WORKSHOP: Online data analysis for biologists
This record includes training materials associated with the Australian BioCommons workshop ‘Online data analysis for biologists’. This workshop took place on 9 September 2021.
Workshop description
Galaxy is an online platform for biological research that allows people to use computational data...
Keywords: Bioinformatics, Analysis, Workflows, Galaxy Australia
WORKSHOP: Online data analysis for biologists
https://zenodo.org/records/5775277
https://dresa.org.au/materials/workshop-online-data-analysis-for-biologists-08d66913-4ce3-4528-bdd6-0b0fcf234982
This record includes training materials associated with the Australian BioCommons workshop ‘Online data analysis for biologists’. This workshop took place on 9 September 2021.
Workshop description
Galaxy is an online platform for biological research that allows people to use computational data analysis tools and workflows without the need for programming experience.
It is an open source, web-based platform for accessible, reproducible, and transparent computational biomedical research. It also captures run information so that workflows can be saved, repeated and shared efficiently via the web.
This interactive beginners workshop will provide an introduction to the Galaxy interface, histories and available tools. The material covered in this workshop is freely available through the Galaxy Training Network.
The workshop will be held via Zoom and involves a combination of presentations by the lead trainer and smaller breakout groups supported by experienced facilitators.
The materials are shared under a Creative Commons 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Schedule (PDF): schedule for the workshop
Online_data_analysis_for_biologists_extraslides (PPTX and PDF): Slides used to introduce the data set and emphasise the importance of workflows. These slides were developed by Ms Grace Hall.
Materials shared elsewhere:
The tutorial used in this workshop is available via the Galaxy Training Network.
Anne Fouilloux, Nadia Goué, Christopher Barnett, Michele Maroni, Olha Nahorna, Dave Clements, Saskia Hiltemann, 2021 Galaxy 101 for everyone (Galaxy Training Materials). https://training.galaxyproject.org/training-material/topics/introduction/tutorials/galaxy-intro-101-everyone/tutorial.html Online; accessed Fri Dec 10 2021
Melissa Burke (melissa@biocommons.org.au)
Hall, Grace (orcid: 0000-0002-5105-8347)
Perreau, Vicky (orcid: 0000-0002-0773-7246)
Morgan, Steven (orcid: 0000-0001-6038-6126)
Bioinformatics, Analysis, Workflows, Galaxy Australia
WEBINAR: Where to go when your bioinformatics outgrows your compute
This record includes training materials associated with the Australian BioCommons webinar ‘Where to go when your bioinformatics outgrows your compute’. This webinar took place on 19 August 2021.
Bioinformatics analyses are often complex, requiring multiple software tools and specialised compute...
Keywords: Computational Biology, Bioinformatics, High performance computing, HPC, Galaxy Australia, Nectar Research Cloud, Pawsey Supercomputing Centre, NCI, NCMAS, Cloud computing
WEBINAR: Where to go when your bioinformatics outgrows your compute
https://zenodo.org/records/5240578
https://dresa.org.au/materials/webinar-where-to-go-when-your-bioinformatics-outgrows-your-compute-7a5a0ff8-8f4f-4fd0-af20-a88d515a6554
This record includes training materials associated with the Australian BioCommons webinar ‘Where to go when your bioinformatics outgrows your compute’. This webinar took place on 19 August 2021.
Bioinformatics analyses are often complex, requiring multiple software tools and specialised compute resources. “I don’t know what compute resources I will need”, “My analysis won’t run and I don’t know why” and "Just getting it to work" are common pain points for researchers. In this webinar, you will learn how to understand the compute requirements for your bioinformatics workflows. You will also hear about ways of accessing compute that suits your needs as an Australian researcher, including Galaxy Australia, cloud and high-performance computing services offered by the Australian Research Data Commons, the National Compute Infrastructure (NCI) and Pawsey. We also describe bioinformatics and computing support services available to Australian researchers.
This webinar was jointly organised with the Sydney Informatics Hub at the University of Sydney.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Where to go when your bioinformatics outgrows your compute - slides (PDF and PPTX): Slides presented during the webinar
Australian research computing resources cheat sheet (PDF): A list of resources and useful links mentioned during the webinar.
Materials shared elsewhere:
A recording of the webinar is available on the Australian BioCommons YouTube Channel:
https://youtu.be/hNTbngSc-W0
Melissa Burke (melissa@biocommons.org.au)
Samaha, Georgina (orcid: 0000-0003-0419-1476)
Chew, Tracy (orcid: 0000-0001-9529-7705)
Sadsad, Rosemarie (orcid: 0000-0003-2488-953X)
Coddington, Paul (orcid: 0000-0003-1336-9686)
Gladman, Simon (orcid: 0000-0002-6100-4385)
Edberg, Roger
Shaikh, Javed
Cytowski, Maciej (orcid: 0000-0002-0007-0979)
Computational Biology, Bioinformatics, High performance computing, HPC, Galaxy Australia, Nectar Research Cloud, Pawsey Supercomputing Centre, NCI, NCMAS, Cloud computing
WEBINAR: Detection of and phasing of hybrid accessions in a target capture dataset
This record includes training materials associated with the Australian BioCommons webinar ‘Detection of and phasing of hybrid accessions in a target capture dataset’. This webinar took place on 10 June 2021.
Hybridisation plays an important role in evolution, leading to the exchange of genes...
Keywords: Phylogenetics, Bioinformatics, Phylogeny, Genomics, Target capture sequencing
WEBINAR: Detection of and phasing of hybrid accessions in a target capture dataset
https://zenodo.org/records/5105013
https://dresa.org.au/materials/webinar-detection-of-and-phasing-of-hybrid-accessions-in-a-target-capture-dataset-51cc7740-0da1-45f1-95de-f1a47f676053
This record includes training materials associated with the Australian BioCommons webinar ‘Detection of and phasing of hybrid accessions in a target capture dataset’. This webinar took place on 10 June 2021.
Hybridisation plays an important role in evolution, leading to the exchange of genes between species and, in some cases, generate new lineages. The use of molecular methods has revealed the frequency and importance of reticulation events is higher than previously thought and this insight continues with the ongoing development of phylogenomic methods that allow novel insights into the role and extent of hybridisation. Hybrids notoriously provide challenges for the reconstruction of evolutionary relationships, as they contain conflicting genetic information from their divergent parental lineages. However, this also provides the opportunity to gain insights into the origin of hybrids (including autopolyploids).
This webinar explores some of the challenges and opportunities that occur when hybrids are included in a target capture sequence dataset. In particular, it describes the impact of hybrid accessions on sequence assembly and phylogenetic analysis and further explores how the information of the conflicting phylogenetic signal can be used to detect and resolve hybrid accessions. The webinar showcases a novel bioinformatic workflow, HybPhaser, that can be used to detect and phase hybrids in target capture datasets and will provide the theoretical background and concepts behind the workflow.
This webinar is part of a series of webinars and workshops developed by the Genomics for Australian Plants (GAP) Initiative that focuses on the analysis of target capture sequence data. In addition to two public webinars, the GAP bioinformatics working group is offering training workshops in the use of newly developed and existing scripts in an integrated workflow to participants in the 2021 virtual Australasian Systematic Botany Society Conference.
The materials are shared under a Creative Commons 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Nauheimer_hybphaser_slides (PDF): Slides presented during the webinar
Materials shared elsewhere:
A recording of the webinar is available on the Australian BioCommons YouTube Channel: https://youtu.be/japXwTAhA5U
Melissa Burke (melissa@biocommons.org.au)
Nauheimer, Lars (orcid: 0000-0002-2847-0966)
Phylogenetics, Bioinformatics, Phylogeny, Genomics, Target capture sequencing
WEBINAR: Conflict in multi-gene datasets: why it happens and what to do about it - deep coalescence, paralogy and reticulation
This record includes training materials associated with the Australian BioCommons webinar ‘Conflict in multi-gene datasets: why it happens and what to do about it - deep coalescence, paralogy and reticulation’. This webinar took place on 20 May 2021.
Multi-gene datasets used in phylogenetic...
Keywords: Phylogenetics, Bioinformatics, Phylogeny, Genomics, Target capture sequencing
WEBINAR: Conflict in multi-gene datasets: why it happens and what to do about it - deep coalescence, paralogy and reticulation
https://zenodo.org/records/5104998
https://dresa.org.au/materials/webinar-conflict-in-multi-gene-datasets-why-it-happens-and-what-to-do-about-it-deep-coalescence-paralogy-and-reticulation-a6743550-b904-45e1-9635-4e481ee8f739
This record includes training materials associated with the Australian BioCommons webinar ‘Conflict in multi-gene datasets: why it happens and what to do about it - deep coalescence, paralogy and reticulation’. This webinar took place on 20 May 2021.
Multi-gene datasets used in phylogenetic analyses, such as those produced by the sequence capture or target enrichment used in the Genomics for Australian Plants: Australian Angiosperm Tree of Life project, often show discordance between individual gene trees and between gene and species trees. This webinar explores three different forms of discordance: deep coalescence, paralogy, and reticulation. In each case, it considers underlying biological processes, how discordance presents in the data, and what bioinformatic or phylogenetic approaches and tools are available to address these challenges. It covers Yang and Smith paralogy resolution and general information on options for phylogenetic analysis.
This webinar is part of a series of webinars and workshops developed by the Genomics for Australian Plants (GAP) Initiative that focused on the analysis of target capture sequence data. In addition to two public webinars, the GAP bioinformatics working group is offering training workshops in the use of newly developed and existing scripts in an integrated workflow to participants in the 2021 virtual Australasian Systematic Botany Society Conference.
The materials are shared under a Creative Commons 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Schmidt-Lebuhn - paralogy lineage sorting reticulation - slides (PDF): Slides presented during the webinar
Materials shared elsewhere:
A recording of the webinar is available on the Australian BioCommons YouTube Channel: https://youtu.be/1bw81q898z8
Melissa Burke (melissa@biocommons.org.au)
Schmidt-Lebuhn, Alexander (orcid: 0000-0002-7402-8941)
Phylogenetics, Bioinformatics, Phylogeny, Genomics, Target capture sequencing
WEBINAR: AlphaFold: what's in it for me?
This record includes training materials associated with the Australian BioCommons webinar ‘WEBINAR: AlphaFold: what’s in it for me?’. This webinar took place on 18 April 2023.
Event description
AlphaFold has taken the scientific world by storm with the ability to accurately predict the...
Keywords: Bioinformatics, Machine Learning, Structural Biology, Proteins, Drug discovery, AlphaFold, AI, Artificial Intelligence, Deep learning
WEBINAR: AlphaFold: what's in it for me?
https://zenodo.org/records/7865494
https://dresa.org.au/materials/webinar-alphafold-what-s-in-it-for-me-4d1ea222-4240-4b68-b9ae-7769ac664ee0
This record includes training materials associated with the Australian BioCommons webinar ‘WEBINAR: AlphaFold: what’s in it for me?’. This webinar took place on 18 April 2023.
Event description
AlphaFold has taken the scientific world by storm with the ability to accurately predict the structure of any protein in minutes using artificial intelligence (AI). From drug discovery to enzymes that degrade plastics, this promises to speed up and fundamentally change the way that protein structures are used in biological research.
Beyond the hype, what does this mean for structural biology as a field (and as a career)?
Dr Craig Morton, Drug Discovery Lead at the CSIRO, is an early adopter of AlphaFold and has decades of expertise in protein structure / function, protein modelling, protein – ligand interactions and computational small molecule drug discovery, with particular interest in anti-infective agents for the treatment of bacterial and viral diseases.
Craig joins this webinar to share his perspective on the implications of AlphaFold for science and structural biology. He will give an overview of how AlphaFold works, ways to access AlphaFold, and some examples of how it can be used for protein structure/function analysis.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Materials shared elsewhere:
A recording of this webinar is available on the Australian BioCommons YouTube Channel:
https://youtu.be/4ytn2_AiH8s
Melissa Burke (melissa@biocommons.org.au)
Morton, Craig (orcid: 0000-0001-5452-5193)
Bioinformatics, Machine Learning, Structural Biology, Proteins, Drug discovery, AlphaFold, AI, Artificial Intelligence, Deep learning
WEBINAR: Here's one we prepared earlier: (re)creating bioinformatics methods and workflows with Galaxy Australia
This record includes training materials associated with the Australian BioCommons webinar ‘Here’s one we prepared earlier: (re)creating bioinformatics methods and workflows with Galaxy Australia’. This webinar took place on 26 October 2022.
Event description
Have you discovered a brilliant...
Keywords: Bioinformatics, Workflows, FAIR, Galaxy Australia
WEBINAR: Here's one we prepared earlier: (re)creating bioinformatics methods and workflows with Galaxy Australia
https://zenodo.org/records/7251310
https://dresa.org.au/materials/webinar-here-s-one-we-prepared-earlier-re-creating-bioinformatics-methods-and-workflows-with-galaxy-australia-134a8bf5-3801-421f-a454-e0f9020f4871
This record includes training materials associated with the Australian BioCommons webinar ‘Here’s one we prepared earlier: (re)creating bioinformatics methods and workflows with Galaxy Australia’. This webinar took place on 26 October 2022.
Event description
Have you discovered a brilliant bioinformatics workflow but you’re not quite sure how to use it? In this webinar we will introduce the power of Galaxy for construction and (re)use of reproducible workflows, whether building workflows from scratch, recreating them from published descriptions and/or extracting from Galaxy histories.
Using an established bioinformatics method, we’ll show you how to:
Use the workflows creator in Galaxy Australia
Build a workflow based on a published method
Annotate workflows so that you (and others) can understand them
Make workflows finable and citable (important and very easy to do!)
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
GalaxyWorkflows_Slides (PDF): A PDF copy of the slides presented during the webinar.
Materials shared elsewhere:
A recording of this webinar is available on the Australian BioCommons YouTube Channel:
https://youtu.be/IMkl6p7hkho
Melissa Burke (melissa@biocommons.org.au)
Price, Gareth (orcid: 0000-0003-2439-8650)
Gustafsson, Johan (orcid: 0000-0002-2977-5032)
Bioinformatics, Workflows, FAIR, Galaxy Australia
WEBINAR: Conservation genomics in the age of extinction
This record includes training materials associated with the Australian BioCommons webinar ‘Conservation genomics in the age of extinction’. This webinar took place on 8 March 2022.
Event description
Biodiversity is crashing and millions of plant and animal species are at the edge of...
Keywords: Conservation genomics, Genomics, Bioinformatics, Sequencing, Threatened Species Initiative, Galaxy Australia
WEBINAR: Conservation genomics in the age of extinction
https://zenodo.org/records/6350785
https://dresa.org.au/materials/webinar-conservation-genomics-in-the-age-of-extinction-c7718a53-68ee-4c69-adf4-cd0550710d3f
This record includes training materials associated with the Australian BioCommons webinar ‘Conservation genomics in the age of extinction’. This webinar took place on 8 March 2022.
Event description
Biodiversity is crashing and millions of plant and animal species are at the edge of extinction. Understanding the genetic diversity of these species is an important tool for conservation biology but obtaining high quality genomes for threatened species is not always straightforward.
In this webinar Dr Carolyn Hogg speaks about the work she has been doing with the Threatened Species Initiative to build genomic resources to understand and protect Australia’s threatened species. Using examples such as the Kroombit Tinker Frog and the Greater Bilby, Carolyn describes some of the complexities and challenges of generating genomes from short reads and HiFi reads for critically endangered species. She outlines the technologies and resources being used and how these are bridging the gap between genomicists, bioinformaticians and conservation experts to help save Australian species.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Materials shared elsewhere:
A recording of this webinar is available on the Australian BioCommons YouTube Channel:
https://youtu.be/Bl7CaiGQ91s
Melissa Burke (melissa@biocommons.org.au)
Hogg, Carolyn (orcid: 0000-0002-6328-398X)
Conservation genomics, Genomics, Bioinformatics, Sequencing, Threatened Species Initiative, Galaxy Australia
WORKSHOP: Hybrid de novo genome assembly
This record includes training materials associated with the Australian BioCommons workshop ‘Hybrid de novo genome assembly’. This workshop took place on 7 October 2021.
Workshop description
It’s now easier than ever to assemble new reference genomes thanks to hybrid genome assembly approaches...
Keywords: Galaxy Australia, Bioinformatics, Analysis, Workflows, Genomics, Genome assembly, De novo assembly
WORKSHOP: Hybrid de novo genome assembly
https://zenodo.org/records/5781781
https://dresa.org.au/materials/workshop-hybrid-de-novo-genome-assembly-714004ba-0348-47c8-a68f-038a1f8ccfb1
This record includes training materials associated with the Australian BioCommons workshop ‘Hybrid de novo genome assembly’. This workshop took place on 7 October 2021.
Workshop description
It’s now easier than ever to assemble new reference genomes thanks to hybrid genome assembly approaches which enable research on organisms for which reference genomes were not previously available. These approaches combine the strengths of short (Illumina) and long (PacBio or Nanopore) read technologies, resulting in improved assembly quality.
In this workshop we will learn how to create and assess genome assemblies from Illumina and Nanopore reads using data from a Bacillus Subtilis strain. We will demonstrate two hybrid-assembly methods using the tools Flye, Pilon, and Unicycler to perform assembly and subsequent error correction. You will learn how to visualise input read sets and the assemblies produced at each stage and assess the quality of the final assembly.
All analyses will be performed using Galaxy Australia, an online platform for biological research that allows people to use computational data analysis tools and workflows without the need for programming experience.
This workshop is presented by the Australian BioCommons and Melbourne Bioinformatics with the assistance of a network of facilitators from the national Bioinformatics Training Cooperative.
Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event.
Files and materials included in this record:
Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc.
Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file.
Schedule (PDF): A breakdown of the topics and timings for the workshop
Materials shared elsewhere:
This workshop follows the tutorial ‘Hybrid genome assembly - Nanopore and Illumina’ developed by Melbourne Bioinformatics.
https://www.melbournebioinformatics.org.au/tutorials/tutorials/hybrid_assembly/nanopore_assembly/
Melissa Burke (melissa@biocommons.org.au)
Hall, Grace (orcid: 0000-0002-5105-8347)
Morgan, Steven (orcid: 0000-0001-6038-6126)
Makunin, Igor
Galaxy Australia, Bioinformatics, Analysis, Workflows, Genomics, Genome assembly, De novo assembly
Accelerating skills development in Data science and AI at scale
At the Monash Data Science and AI platform, we believe that upskilling our research community and building a workforce with data science skills are key to accelerating the application of data science in research. To achieve this, we create and leverage new and existing training capabilities...
Keywords: AI, machine learning, eresearch skills, training, train the trainer, volunteer instructors, training partnerships, training material
Accelerating skills development in Data science and AI at scale
https://zenodo.org/records/4287746
https://dresa.org.au/materials/accelerating-skills-development-in-data-science-and-ai-at-scale-2d8a65fa-f96e-44ad-a026-cfae3f38d128
At the Monash Data Science and AI platform, we believe that upskilling our research community and building a workforce with data science skills are key to accelerating the application of data science in research. To achieve this, we create and leverage new and existing training capabilities within and outside Monash University. In this talk, we will discuss the principles and purpose of establishing collaborative models to accelerate skills development at scale. We will talk about our approach to identifying gaps in the existing skills and training available in data science, key areas of interest as identified by the research community and various sources of training available in the marketplace. We will provide insights into the collaborations we currently have and intend to develop in the future within the university and also nationally.
The talk will also cover our approach as outlined below
• Combined survey of gaps in skills and trainings for Data science and AI
• Provide seats to partners
• Share associate instructors/helpers/volunteers
• Develop combined training materials
• Publish a repository of open source trainings
• Train the trainer activities
• Establish a network of volunteers to deliver trainings at their local regions
Industry plays a significant role in making some invaluable training available to the research community either through self learning platforms like AWS Machine Learning University or Instructor led courses like NVIDIA Deep Learning Institute. We will discuss how we leverage our partnerships with Industry to bring these trainings to our research community.
Finally, we will discuss how we map our training to the ARDC skills roadmap and how the ARDC platforms project “Environments to accelerate Machine Learning based Discovery” has enabled collaboration between Monash University and University of Queensland to develop and deliver training together.
contact@ardc.edu.au
Tang, Titus
AI, machine learning, eresearch skills, training, train the trainer, volunteer instructors, training partnerships, training material
Monash University - University of Queensland training partnership in Data science and AI
We describe the peer network exchange for training that has been recently created via an ARDC funded partnership between Monash University and Universities of Queensland under the umbrella of the Queensland Cyber Infrastructure Foundation (QCIF). As part of a training program in machine learning,...
Keywords: data skills, training partnerships, data science, AI, training material
Monash University - University of Queensland training partnership in Data science and AI
https://zenodo.org/records/4287864
https://dresa.org.au/materials/monash-university-university-of-queensland-training-partnership-in-data-science-and-ai-8082bf73-d20f-4214-ad8c-95123e25a36c
We describe the peer network exchange for training that has been recently created via an ARDC funded partnership between Monash University and Universities of Queensland under the umbrella of the Queensland Cyber Infrastructure Foundation (QCIF). As part of a training program in machine learning, visualisation, and computing tools, we have established a series of over 20 workshops over the year where either Monash or QCIF hosts the event for some 20-40 of their researchers and students, while some 5 places are offered to participants from the other institution. In the longer term we aim to share material developed at one institution and have trainers present it at the other. In this talk we will describe the many benefits we have found to this approach including access to a wider range of expertise in several rapidly developing fields, upskilling of trainers, faster identification of emerging training needs, and peer learning for trainers.
contact@ardc.edu.au
Tang, Titus
data skills, training partnerships, data science, AI, training material
Galaxy Training
Galaxy is a hosted web-accessible platform that lets you conduct accessible, reproducible, and transparent computational biological research. It is an international, community driven effort to make it easy for life scientists to analyse their data for free and without the need for programmatic...
Keywords: Galaxy Australia, Galaxy Project, Bioinformatics, Data analysis
Galaxy Training
https://training.galaxyproject.org/training-material/
https://dresa.org.au/materials/galaxy-training
Galaxy is a hosted web-accessible platform that lets you conduct accessible, reproducible, and transparent computational biological research. It is an international, community driven effort to make it easy for life scientists to analyse their data for free and without the need for programmatic skills.
This is a collection of tutorials developed and maintained by the worldwide Galaxy community that show you how to analyse a variety of biological data using Galaxy.
Melissa (melissa@biocommons.org.au)
Galaxy Australia, Galaxy Project, Bioinformatics, Data analysis