OpenRefine

Description

OpenRefine is a powerful free, open source tool for working with messy data: cleaning it, transforming it from one format into another, and extending it with web services and external data.

Each logged-in user on this workspace gets their own instance of OpenRefine.

Login is done via SRAM.

Creation

Create a storage volume

If desired, first create a storage volume before creating the workspace.

See the Getting started page for more info about how and why to create a storage volume.

Create a workspace

In the Research Cloud portal click the ‘Create a new workspace’ button and follow the steps in the wizzard.

See the workspace creation manual page for more guidance.

Access

This catalog item uses SURF Research Access Management (SRAM) authentication.

In order to be able to request access to Catalog Items, you must be marked as having the ‘Developer’ role in your Collaboration in SRAM. This is the case by default if the CO was created for you. However, if you were invited later, it may be necessary to add yourself (or ask your CO-admin to add you) to the src_co_developer group.

Follow the steps mentioned here to request access.

Data transfer options

See our data transfer manuals.

Usage

After logging in, OpenRefine will be accessible through your web browser.

Getting Started

  1. Click Create Project to upload your data file (supports CSV, Excel, JSON, XML, and more)
  2. Preview and configure import options
  3. Start cleaning and transforming your data

For detailed guides on using OpenRefine’s features, see the OpenRefine documentation.

Tips

Sudo rights

Sudo rights can be obtained by adding collaboration members to the src_co_admin group in SRAM.

Members with sudo rights can install system-level packages and perform administrative tasks on the workspace.