{"id":1110,"date":"2016-10-23T17:26:42","date_gmt":"2016-10-24T00:26:42","guid":{"rendered":"http:\/\/miriamposner.com\/classes\/dh101f16\/?p=1110"},"modified":"2016-10-24T15:19:12","modified_gmt":"2016-10-24T22:19:12","slug":"blog-post-4","status":"publish","type":"post","link":"https:\/\/miriamposner.com\/classes\/dh101f16\/2016\/10\/23\/blog-post-4\/","title":{"rendered":"Blog Post #4:"},"content":{"rendered":"<p><span style=\"font-weight: 400\">For this week\u2019s blog post, I chose to do my data visualization on the topic of my final project- the characters of the DC comics. The dataset provides information on the identities, physical description, and appearances of each other characters. For my visualization, I utilized Google Fusion Table to represent the identities (public vs secret) of the characters, comparing between bad and good characters. Figure 1 represents the identity of the bad\u00a0characters and Figure 2 displays the identity of the good\u00a0characters. Immediately, two key things stood out to me from an initial glimpse.<\/span><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1111\" src=\"http:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/Screen-Shot-2016-10-23-at-3.14.39-PM-300x117.png\" alt=\"screen-shot-2016-10-23-at-3-14-39-pm\" width=\"382\" height=\"149\" srcset=\"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/Screen-Shot-2016-10-23-at-3.14.39-PM-300x117.png 300w, https:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/Screen-Shot-2016-10-23-at-3.14.39-PM-768x299.png 768w, https:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/Screen-Shot-2016-10-23-at-3.14.39-PM-1024x398.png 1024w, https:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/Screen-Shot-2016-10-23-at-3.14.39-PM.png 1132w\" sizes=\"auto, (max-width: 382px) 85vw, 382px\" \/><\/p>\n<p>Figure 1: Identities for Bad\u00a0Characters<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1112\" src=\"http:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/Screen-Shot-2016-10-23-at-3.10.07-PM-300x130.png\" alt=\"screen-shot-2016-10-23-at-3-10-07-pm\" width=\"383\" height=\"166\" srcset=\"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/Screen-Shot-2016-10-23-at-3.10.07-PM-300x130.png 300w, https:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/Screen-Shot-2016-10-23-at-3.10.07-PM-768x333.png 768w, https:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/Screen-Shot-2016-10-23-at-3.10.07-PM-1024x444.png 1024w, https:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/Screen-Shot-2016-10-23-at-3.10.07-PM.png 1168w\" sizes=\"auto, (max-width: 383px) 85vw, 383px\" \/><\/p>\n<p>Figure 2: Identities for Good\u00a0Characters<\/p>\n<p>First, I noticed the additional blank bar, which indicated the number with no values. I did not realize how many of the characters on the dataset had missing information and how much it will affect my analysis. Our group has not cleaned our data yet, but now, I realize the crucial decisions and judgments we will have to make on nearly 600 missing identities. This information severely hampers the narrative we would like to tell, such as how more good characters have public identities as opposed to the bad ones.<\/p>\n<p>Second, as I looked to distinguish between bad and good characters, I instantly looked at the pattens by length. As Nathan You, explains in his article, visual cues are one of the key components of data visualizations and is used to make comparisons. He goes on to explain, length is most commonly used in the context of bar charts and the longer the bar, the greater the value. Additionally, he chooses to display an example of a misleading bar graph, where the axis does not start at zero. This exact misconception occurred with me. For the second figure, I immediately deduced the number of public\u00a0identities to be double the number of secret\u00a0identities because the bar length looks double in length. It took me a few attempts to figure out that this was because the axis started at 500 rather than 0.<\/p>\n<p><span style=\"font-weight: 400\">In conclusion, this data visualization made me realize the extent of missing information we have and the rigorous process of data cleaning I must undergo. Additionally, I also realized how misleading some bar graphs may be because my brain immediately deduced a pattern by length, without looking at the numbers first. Graphs can thus be very useful, but also misleading if not careful. \u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>For this week\u2019s blog post, I chose to do my data visualization on the topic of my final project- the characters of the DC comics. The dataset provides information on &hellip; <a href=\"https:\/\/miriamposner.com\/classes\/dh101f16\/2016\/10\/23\/blog-post-4\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Blog Post #4:&#8221;<\/span><\/a><\/p>\n","protected":false},"author":41,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_eb_attr":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-1110","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/posts\/1110","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/users\/41"}],"replies":[{"embeddable":true,"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/comments?post=1110"}],"version-history":[{"count":0,"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/posts\/1110\/revisions"}],"wp:attachment":[{"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/media?parent=1110"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/categories?post=1110"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/tags?post=1110"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}