OpenRefine Exercise

OpenRefine would be a very useful tool to help me sort and analyze the Contemporary Art dataset, especially because this dataset is so vast (consisting of 4870 rows of data). I think it would be very useful for me to use the merging tool to sort the art pieces in the dataset by medium. I would likely merge all oil paintings together. This would include those classified as “oil on canvas”, “oil on gelatin” and “oil on board.” In addition, many of the artists’ death dates are unrecorded. It can be assumed from this that the artists are still alive. It would be useful to fill in all of the blank dates of death with “still alive.” In addition, this dataset has an abundance of columns, many of which I think are irrelevant to analyze the data. I think it would be useful to delete certain columns like the “creation_date_earliest.”

One comment

  1. I’m working a more creative dataset as well, and something you could perhaps explore is which mediums seem to be most frequently used. To do so, you can use the facet function to group pieces of the same medium then sort by count to see which ones are most popular.

Leave a Reply