Week 5: Open Refine

Think about your group’s own dataset. How might you manipulate the data to be more useful in answering your research questions? What OpenRefine operations will you need to perform in order to do so? What would you like to be able to do to your data that you’re not sure how to do?

The task of looking at large amounts of data is a bit daunting, especially since there are a lot of empty values in our data that clutters it up. Open Refine seems to be super useful and simplifying and cleaning up the data so we can analyze and visualize it correctly. We’d probably manipulate the data by removing some of the whitespace and empty columns.

When creating a facet on some columns of my data, OpenRefine had an issue with it being too large (This was for the column related to names). I’d have to become a bit more familiar with facets and facet options to understand when to apply this feature.

I want to be able to look at the data on counties, crime, age, and the economic class each witch was in side-by-side so that I could investigate our hypothesis that many innocent women were killed not necessarily for being a witch, but for some other motive, such as money or land.

Leave a Reply