OpenRefine

Description

OpenRefine is a powerful free, open source tool for working with messy data: cleaning it, transforming it from one format into another, and extending it with web services and external data.

Each logged-in user on this workspace gets their own instance of OpenRefine.

Login is done via SRAM.

Creation

Create a storage volume

If desired, first create a storage volume before creating the workspace.

See the Getting started page for more info about how and why to create a storage volume.

Create a workspace

In the Research Cloud portal click the ‘Create a new workspace’ button and follow the steps in the wizzard.

See the workspace creation manual page for more guidance.

NoteRequesting Access to the Catalog Item

Most Utrecht University users will already have access to this catalog item. If you cannot find this catalog item when creating a workspace:

  1. You need to request access to the Utrecht University Catalog Item Collection.
  2. Follow these steps to request access.
  3. Once approved, you’ll be able to create the workspaces.

Access

Accessing the Workspace using the yellow Access button

  1. Open your SURF Research Cloud portal.
  2. When you login to your SURF Research Cloud portal, you will see your dashboard. Here, look for your workspace and ensure that it is running. You can access it by simply clicking the yellow “Access” button which will redirect you to a new page.
  3. You can then log in with your institutional credentials (Solis ID for Utrecht University users) to access the workspace.

Since you’re already signed in to access SURF Research Cloud, you may not need to authenticate again.

Data transfer options

See our data transfer manuals.

Usage

After logging in, OpenRefine will be accessible through your web browser.

Getting Started

  1. Click Create Project to upload your data file (supports CSV, Excel, JSON, XML, and more)
  2. Preview and configure import options
  3. Start cleaning and transforming your data

For detailed guides on using OpenRefine’s features, see the OpenRefine documentation.

Tips

Sudo rights

Sudo rights can be obtained by adding collaboration members to the src_co_admin group in SRAM.

Members with sudo rights can install system-level packages and perform administrative tasks on the workspace.