{"id":2016,"date":"2017-11-01T11:32:33","date_gmt":"2017-11-01T18:32:33","guid":{"rendered":"http:\/\/miriamposner.com\/classes\/dh101f17\/?p=2016"},"modified":"2017-11-01T11:32:33","modified_gmt":"2017-11-01T18:32:33","slug":"blog-4-openrefine-2","status":"publish","type":"post","link":"http:\/\/miriamposner.com\/classes\/dh101f17\/2017\/11\/01\/blog-4-openrefine-2\/","title":{"rendered":"Blog 4: OpenRefine"},"content":{"rendered":"<p><span style=\"color: #000000;font-family: Calibri\">For my groups dataset on 19<sup>th<\/sup> century children\u2019s book publishers, there can be a lot of information that can be removed. The dataset can definitely benefit from cleaning. With our datasets, we have the ability to manipulate the data in many ways. The way I see it most beneficial is to focus our data on only certain authors from the United States. In addition to this, we can also focus on only covering the data about publishers as opposed to all the extra types of roles there are in the dataset.<\/span><\/p>\n<p><span style=\"color: #000000;font-family: Calibri\">Some tools on OpenRefine my group can use to perform this is the facet and cluster tool. The main problem of this dataset is that there is too much information. This is all information that our group can hopefully clean up using OpenRefine operations. We can clean up blank cells and repetitions in our data. <\/span><\/p>\n<p><span style=\"color: #000000;font-family: Calibri\">There are a few things I would like to do to my data that I\u2019m not sure how to do. First, I would like to merge data from two different datasets into one to make is simpler and more effective. My group has multiple datasets and some of them are connected to each other through certain types of information. Another thing I would like to do with my dataset is eliminate data repetitions of authors moving places of residence. For some, it is just moving down the street. It would be much more beneficial to just narrow it down and include only the major movements of an author.<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>For my groups dataset on 19th century children\u2019s book publishers, there can be a lot of information that can be<\/p>\n","protected":false},"author":167,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_eb_attr":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-2016","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/posts\/2016","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/users\/167"}],"replies":[{"embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/comments?post=2016"}],"version-history":[{"count":0,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/posts\/2016\/revisions"}],"wp:attachment":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/media?parent=2016"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/categories?post=2016"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/tags?post=2016"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}