{"id":1875,"date":"2017-10-30T19:05:42","date_gmt":"2017-10-31T02:05:42","guid":{"rendered":"http:\/\/miriamposner.com\/classes\/dh101f17\/?p=1875"},"modified":"2017-10-31T12:56:57","modified_gmt":"2017-10-31T19:56:57","slug":"blog-post-4-the-power-of-openrefine","status":"publish","type":"post","link":"http:\/\/miriamposner.com\/classes\/dh101f17\/2017\/10\/30\/blog-post-4-the-power-of-openrefine\/","title":{"rendered":"Blog Post 4: OpenRefine is Pretty DOPE"},"content":{"rendered":"<h2><strong>Working with OpenRefine<\/strong><\/h2>\n<p>This week we had an opportunity\u00a0to test out the OpenRefine software and learn about the various methods to cleaning up and rearranging data. We started by creating &#8220;facets&#8221; which allowed us to break down an individual column into another dataset. Through this facet, we were able to remove extra white spacing, merge and re-cluster overlapping data, create and separate data into separate columns, and add characters to certain records.<\/p>\n<figure id=\"attachment_1876\" aria-describedby=\"caption-attachment-1876\" style=\"width: 500px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-1876\" src=\"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-content\/uploads\/sites\/7\/2017\/10\/OpenRefine-shabang-300x250.png\" alt=\"\" width=\"500\" height=\"350\" \/><figcaption id=\"caption-attachment-1876\" class=\"wp-caption-text\">Did I break it?! It legit did this for 25 minutes \u00af\\_(\u30c4)_\/\u00af<\/figcaption><\/figure>\n<h2>Our Data: Carnegie Museum of Contemporary Art<\/h2>\n<p>Group 15 is responsible for the Carnegie Museum of Contemporary Art dataset. I think our group will find it most helpful to separate some of our data in the &#8220;medium&#8221; section into multiple medium columns. Some pieces of art have multiple mediums, such as &#8220;watercolor\u00a0and pencil on paper&#8221; versus &#8220;watercolor on paper.&#8221; This is also true for the classification of different pieces, as some are classified into more than one type of art. Using the &#8220;merge and re-cluster&#8221; functions will also be especially helpful for this.<\/p>\n<p>Currently, we&#8217;re not able to load many of the images of the artwork, so it would be helpful to see if there was a way to manipulate the data to either view the actual artwork or to be able to analyze information about the art without actually seeing it.\u00a0For the most part, however, our data is fairly clean already. We are currently on a mission to fill in missing holes within the data, but OpenRefine will really help us to better sort through and understand what we have.<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Working with OpenRefine This week we had an opportunity\u00a0to test out the OpenRefine software and learn about the various methods<\/p>\n","protected":false},"author":139,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_eb_attr":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-1875","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/posts\/1875","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/users\/139"}],"replies":[{"embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/comments?post=1875"}],"version-history":[{"count":0,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/posts\/1875\/revisions"}],"wp:attachment":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/media?parent=1875"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/categories?post=1875"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/tags?post=1875"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}