OpenRefine

This beginner-level hands-on workshop is designed to introduce the basics of working with OpenRefine to clean up, transform and enhance ‘messy’ datasets. At the conclusion of the workshop you will understand what the OpenRefine software does and how to use it to clean and transform data for your own projects.

Prerequisites

To complete this lesson you will need a web browser and OpenRefine.

Schedule

Setup Download files required for the lesson
00:00 1. Introduction to OpenRefine What is OpenRefine? What can it do?
00:15 2. Importing data into OpenRefine How do I get data into OpenRefine?
00:30 3. Layout of OpenRefine, Rows vs Records How is data organised in OpenRefine?
How do I access options to amend data in OpenRefine?
What is the difference between Rows and Records in OpenRefine?
How do I work with single cells that contain multiple values in a list?
00:45 4. Faceting and filtering What is a facet in OpenRefine?
What is a filter in OpenRefine?
How can I use filters and facets to explore data OpenRefine?
How can I easily correct common data issues in my data with OpenRefine?
01:05 5. Transformations - Undo and Redo How do the Undo and Redo features work?
01:10 6. Clustering What is Clustering in OpenRefine and when would you use it?
How does clustering work in OpenRefine?
01:30 7. Working with columns and sorting How do I move, rename or remove columns in OpenRefine?
How do I sort data in OpenRefine?
01:40 8. Introduction to Transformations How do I use transformations to edit my data?
What are the kind of transformations OpenRefine supports?
01:50 9. Writing Transformations What is GREL?
Where do I write GREL expressions in the OpenRefine interface?
How do I write a valid GREL expression?
02:05 10. Transforming Strings, Numbers, Dates and Booleans How do I use transformations to programmatically edit my data?
How do I transform the various data types?
02:25 11. Transformations - Handling Arrays How do I use Arrays in data transformation?
02:45 12. Transformations - Exporting How do I export transformed data?
02:50 13. Advanced OpenRefine functions How do I fetch data from an Application Programming Interface (API) to be used in OpenRefine?
How do I reconcile my data by comparing it to authoritative datasets
How do I install extensions for OpenRefine
03:20 Finish

The actual schedule may vary slightly depending on the topics and exercises chosen by the instructor.