Virtual Library Carpentry: OpenRefine Session 1

-

Location: Zoom Meeting

Events Cds Workshop 1Events Cds Workshop 1

Overview

This session introduces working with data in OpenRefine. You will learn what the OpenRefine software does and how to use it to work with data files. OpenRefine can be used to standardize and clean data in a file. OpenRefine is useful where you have data in a simple delimited format such as a spreadsheet, a comma separated values file (csv), or a tab delimited file (tsv), but with internal inconsistencies either in data formats, where data appears, or in the terminology used.

Objectives

  • Use OpenRefine to Get an overview of a data set
  • Resolve inconsistencies in a data set, for example standardizing date formatting
  • Split cells which contain multiple bits of data so that each piece of data is in its own cell
  • Match local data up to other data sets
  • Enhance a data set with data from other sources
  • Answer questions about the content of a data set using facets
  • Use facets and filters to work with a subset of data
  • Correct data problems through a facet
  • Use clustering to identify and fix replace varying forms of the same data with a single consistent value

Connect via Zoom

Please register for this event.
A URL to join the Zoom session will be emailed to you one hour before the workshop start time. To ensure access to the workshop from the virtual waiting room, your Zoom name should match the information provided when registering for the workshop. Please email cds@nd.edu with any questions or issues.

Prerequisites

You will need to install OpenRefine and download the data file doaj-article-sample.csv to follow the lesson in this session.

OpenRefine does not support Internet Explorer or Edge. For this session please use Firefox, Chrome or Safari instead. See Setup for more information. If you’d like help to address difficulties you encounter with setup, please contact cds@nd.edu.