Through the column drop down menus, OpenRefine provides options for common transformations to facilitate data cleaning such as:
Facets
Facet options group together identical cells across rows for a particular column, and indicates the number of rows within each group. Facets can be used to:
Access Facet options by clicking on the down arrow next to each column title.
Cluster
Cluster options attempt to group together different cell values that may be alternative representations of the same thing, for example, FFP, Firm Fixed Price and firmed fixed price.
Reorder and/or Remove Columns
Splitting Cells into Columns
Combining Cell Values/ Concatenate
value + cells[‘Column’].value
where "value" is the current column, and 'column' is the name of the column whose values you would like to combine. Note, nothing will happen if the column name is not exact.
You can also add additional syntax using a plus sign, such as,
value + "-" + cells[‘Column’].value
to add, in this example, a dash between the content being combined.
OpenRefine also can support splitting rows into columns, and splitting cells into columns or rows using the dropdown menu for the column.