{"id":1891,"date":"2017-10-31T13:01:30","date_gmt":"2017-10-31T20:01:30","guid":{"rendered":"http:\/\/miriamposner.com\/classes\/dh101f17\/?p=1891"},"modified":"2017-10-31T13:01:30","modified_gmt":"2017-10-31T20:01:30","slug":"using-openrefine-rebecca-tan","status":"publish","type":"post","link":"http:\/\/miriamposner.com\/classes\/dh101f17\/2017\/10\/31\/using-openrefine-rebecca-tan\/","title":{"rendered":"Using OpenRefine &#8211; Rebecca Tan"},"content":{"rendered":"<p>Our group received a dataset on what was in people\u2019s homes in 1700s Pennsylvania. Our research questions focused mainly on the issue of gender, although we also had a question on which items were considered luxury items or more costly.<\/p>\n<p>In answering our research question on which items were considered more luxury items, I could use the sort function to sort cell values (under the column Real Penn) as numbers, with largest first. This will then allow me to see the kind of items considered most monetarily valuable, as well as the categories they may fall into. For example, \u201cbonded\u201d items tended to be much more valuable because they referred to slaves, while random items like a bed and bolster had a value of 0 Real Penn.<\/p>\n<p>Another one of our research questions was whether women owned greater quantities of domestic items and why. To start with, we could determine all the instances of women in the dataset. This could be started by creating a text facet for the first name column, and then only including those names which sound indubitably female (e.g. Anne, Betty). However, determining the instances of women is more easily done using Excel pivot tables. In Excel, I can create a new column with both first and last name, and then create a pivot table to better understand the number of individuals in the dataset and how many records each person has. In the original dataset, I can create a new column that assigns gender to these records. However, sometimes in the pivot table, names may appear twice due to white space. I can use OpenRefine to trim the leading and trailing whitespace. This can be done by going to \u201cEdit cells,\u201d choosing \u201cCommon transforms\u201d and then clicking \u201cTrim leading and trailing whitespace.\u201d<\/p>\n<p>I would then want to sum up the amount of domestic items owned per individual in the dataset. This could be done by summing up the values of columns such as sewing and laundry per individual. Unfortunately, however, I am not sure how to do a sum of values for each column in OpenRefine, and then combine the sums of all the values for each column relating to domestic goods. I also need to determine the traits that make up a domestic item, so that I can pull out certain items from the content types and create a new dataset.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Our group received a dataset on what was in people\u2019s homes in 1700s Pennsylvania. Our research questions focused mainly on<\/p>\n","protected":false},"author":148,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_eb_attr":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-1891","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/posts\/1891","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/users\/148"}],"replies":[{"embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/comments?post=1891"}],"version-history":[{"count":0,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/posts\/1891\/revisions"}],"wp:attachment":[{"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/media?parent=1891"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/categories?post=1891"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/miriamposner.com\/classes\/dh101f17\/wp-json\/wp\/v2\/tags?post=1891"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}