Introduction to Data Cleaning with OpenRefine
Learn basic data cleaning techniques in this self-paced online workshop using open data from data.qld.gov.au and open source tool OpenRefine openrefine.org. Learn techniques to prepare messy tabular data for comupational analysis. Of most relevance to HASS disciplines, working with textual data in a structured or semi-structured format.
Licence: Creative Commons Attribution 4.0 lower case
Contact: s.stapleton@griffith.edu.au;
Keywords: data skills, Data analysis
Additional information
Target audience: MBR student, PhD student, Post-doc / Fellow, Academic, Professional (research-related), Professional (other)
Resource type: Tutorial
Status: Active
Learning objectives:
Learn basic data cleaning techniques in this self-paced online workshop such as:
Exploring tabular data through facets and filters
Implementing ‘tidy data’ principles
Cleaning, organising and preparing data for analysis
Extracting and using a script to automate wrangling on similar data
Download the software and dataset, do activities and watch videos to guide you through the lessons. Give yourself around 2 1/2 hours to complete the workshop.
Contributors: Sharron Stapleton