OpenRefine
Description
OpenRefine is a powerful free, open source tool for working with messy data: cleaning it, transforming it from one format into another, and extending it with web services and external data.
Each logged-in user on this workspace gets their own instance of OpenRefine.
Login is done via SRAM.
Creation
Create a storage volume
If desired, first create a storage volume before creating the workspace.
See the Getting started page for more info about how and why to create a storage volume.
Create a workspace
In the Research Cloud portal click the ‘Create a new workspace’ button and follow the steps in the wizzard.
See the workspace creation manual page for more guidance.
Access
This catalog item uses SURF Research Access Management (SRAM) authentication.
In order to be able to request access to Catalog Items, you must be marked as having the ‘Developer’ role in your Collaboration in SRAM. This is the case by default if the CO was created for you. However, if you were invited later, it may be necessary to add yourself (or ask your CO-admin to add you) to the src_co_developer group.
Follow the steps mentioned here to request access.
Data transfer options
See our data transfer manuals.
Usage
After logging in, OpenRefine will be accessible through your web browser.
Getting Started
- Click
Create Projectto upload your data file (supports CSV, Excel, JSON, XML, and more) - Preview and configure import options
- Start cleaning and transforming your data
For detailed guides on using OpenRefine’s features, see the OpenRefine documentation.
Tips
Sudo rights
Sudo rights can be obtained by adding collaboration members to the src_co_admin group in SRAM.
Members with sudo rights can install system-level packages and perform administrative tasks on the workspace.