{"id":1931,"date":"2017-10-31T21:31:41","date_gmt":"2017-11-01T04:31:41","guid":{"rendered":"http:\/\/miriamposner.com\/classes\/dh101f17\/?p=1931"},"modified":"2017-10-31T21:31:41","modified_gmt":"2017-11-01T04:31:41","slug":"openrefine-photo-dataset","status":"publish","type":"post","link":"http:\/\/miriamposner.com\/classes\/dh101f17\/2017\/10\/31\/openrefine-photo-dataset\/","title":{"rendered":"OpenRefine: Photo Dataset"},"content":{"rendered":"<p>Our issue with our dataset has been the vastness of the collection. It\u2019s broad subject matter, although they are all photographs, come from different periods and are by different artists from around the world. So, having software that can group and breakdown this vast data will be helpful, further, necessary. One of our questions is what location do these photos come from, or what country the artist comes from, and we have been considering sorting the photos by this type. We have formatting issues with the excel data and can reformat (lowercase, uppercase) using edit tools. Additionally, we can explore how the photographs are connected to each other by artist, origin, and content.<\/p>\n<p>For example, I would first create a text facet for artist. I would then clean up the data by going to Cluster. I would merge terms which are intended to be the same, thereby creating an accurate list. Additionally, I would be able to change the case of the entire column, which would be great for other categories like title or location. Further, which categories like date, getting rid of extra white space would clarify things.<\/p>\n<p>One thing that I would really like to know how to do it edit the format of dates. I have several versions of date formats within the dataset, and thus makes it impossible to organize\/visualize. Especially dates which have a specific month and date, and those who have a date range (ex: 1995-1997). I wonder how I could utilize OpenRefine to manipulate it and reorganize it. This way, my team and I will be able to see the relationships between photos based on date.<\/p>\n<p>As I note, I wonder if we could separate date ranges by comma and be able to split by multi-value: would this help for sorting the date at all?<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Our issue with our dataset has been the vastness of the collection. It\u2019s broad subject matter, although they are all<\/p>\n","protected":false},"author":155,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_eb_attr":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-1931","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/posts\/1931","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/users\/155"}],"replies":[{"embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/comments?post=1931"}],"version-history":[{"count":0,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/posts\/1931\/revisions"}],"wp:attachment":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/media?parent=1931"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/categories?post=1931"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/tags?post=1931"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}