{"id":1045,"date":"2016-10-22T16:36:13","date_gmt":"2016-10-22T23:36:13","guid":{"rendered":"http:\/\/miriamposner.com\/classes\/dh101f16\/?p=1045"},"modified":"2016-10-30T17:22:00","modified_gmt":"2016-10-31T00:22:00","slug":"week-4-blog-post-data-visualization","status":"publish","type":"post","link":"https:\/\/miriamposner.com\/classes\/dh101f16\/2016\/10\/22\/week-4-blog-post-data-visualization\/","title":{"rendered":"Week 4 Blog Post- Data Visualization"},"content":{"rendered":"<p>For this assignment, I was interested in the Diamond Prices Database. This database included prices of cut diamonds, along with data on color, clarity, and ratings agency. It was taken from the <i>Journal of Statistics Education<\/i> online data archive. It includes data from\u00a0308 round-cut diamonds, taken from a newspaper ad. It had a column for ID number, color, clarity, rater, and price of the diamond.\u00a0I had to manipulate the data-set itself in order to make it presentable in a visual way.<\/p>\n<p>The first thing I did when I opened the data-set was remove the column for Identification Numbers because this was just numbers 1-308 numbering the diamonds in order. It was useless to me. Then, I deleted the column called Rater, which showed which one of\u00a0three independent rating agencies rated the specific diamond. I was not interested in who rated the diamond, so this information was useless to me.<\/p>\n<p>The next thing I did to the data-set was change the Color column from alphabetic data to numeric data. Color refers to the\u00a0degree of color purity in the diamond. In the legend of the data, it said that the color of the diamond was rated on an alphabetic scale from D-I, where D represents the top color purity grade, lesser than D is E, then F, then G, then H, then I. I though that numbers from 1-6 would do the exact same job of representing the color purity of the diamond, and would be easier to present visually. I changed all the D&#8217;s to 1, then the E&#8217;s to 2, then the F&#8217;s to 3 then the G&#8217;s to 4 then the H&#8217;s to 5 and then the I&#8217;s to 6. In my opinion, using an interval scale from 1-6 to rate color with 1 being the best and 6 being the worst color is much more clear and simple than using letters of the alphabet starting with D to represent color, so that is why I made this change in the data-set.<\/p>\n<p>Finally, I copy and pasted all this new data into RAW. I chose to use a scatter-plot to analyze\u00a0and present the data.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1047\" src=\"http:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/DH-300x187.png\" alt=\"dh\" width=\"477\" height=\"298\" srcset=\"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/DH-300x187.png 300w, https:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/DH-768x480.png 768w, https:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/DH-1024x639.png 1024w, https:\/\/miriamposner.com\/classes\/dh101f16\/wp-content\/uploads\/sites\/5\/2016\/10\/DH.png 1105w\" sizes=\"auto, (max-width: 477px) 85vw, 477px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>The X-Axis of the scatter-plot corresponds to the weight of the diamond, in carats. The Y-Axis of the scatter-plot corresponds to the price of the diamond in Singapore dollars. The size of the radius of the data points corresponds to the color, where the smallest data points have a color rating of 1, which means that they are the best color. In other words, the smaller the radius of the data point, the better the color and the bigger the radius of the data point is, the worse the color is. \u00a0The color of the data- points correspond to their clarity\u00a0(presence or absence of minute flaws). In the data-set, IF means internally flawless. Below IF, the second best clarity is VVS1, which means very very slightly imperfect, then VVS2, then VS1, which means very slightly imperfect, and finally the worst clarity is VS2. \u00a0I created a blue color scheme to portray clarity. The brightest blue represents the best clarity (the IF), and the second brightest blue represents VVS1, then the third brightest blue represents VVS2, and so on until the worst clarity is associated with the lightest blue color of the visualization.<\/p>\n<p>I love my visualization and I am very proud to have created it. I think that it is the best visualization for this type of data, because the most interesting component of the data is the weight of the diamond vs the price. This visualization shows me that generally, as the weight in Carats goes up, the price of the diamond goes up. This is interesting and it shows me that weight is really the biggest determining factor of price. Weight matters much more than color and clarity when determining price, because the size and color of the data points (corresponding to color and clarity, respectively) fluctuates\u00a0over the entire graph. However, there seems to be a strong positive linear relationship between price and weight of the diamonds, as seen in the X and Y axis.<\/p>\n<p>Another very interesting thing that the visualization\u00a0shows me that I never noticed in the data was the fact that the diamonds with the best clarity as generally the smallest diamonds. I can see this because the brightest blue points are clustered near the bottom left of the graph, which shows that they are the smallest and cheapest diamonds. It seems that clarity decreases generally as size increases. This makes sense because the bigger a diamond is, the more space there is for imperfection.<\/p>\n<p>Another interesting\u00a0thing that I noticed in the visualization was that all the outliers (the points that do not strongly adhere to the positive linear relationship between weight and price) are all tiny data points, which mean that they are the best color. This shows me that diamonds with exceptional color can be sold for more than they are worth from weight alone. So even though weight heavily determines the price of a diamond, it appears that diamonds with amazing color have the ability to be sold for more than their weight is worth.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>For this assignment, I was interested in the Diamond Prices Database. This database included prices of cut diamonds, along with data on color, clarity, and ratings agency. It was taken &hellip; <a href=\"https:\/\/miriamposner.com\/classes\/dh101f16\/2016\/10\/22\/week-4-blog-post-data-visualization\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Week 4 Blog Post- Data Visualization&#8221;<\/span><\/a><\/p>\n","protected":false},"author":40,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_eb_attr":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-1045","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/posts\/1045","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/users\/40"}],"replies":[{"embeddable":true,"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/comments?post=1045"}],"version-history":[{"count":0,"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/posts\/1045\/revisions"}],"wp:attachment":[{"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/media?parent=1045"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/categories?post=1045"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/miriamposner.com\/classes\/dh101f16\/wp-json\/wp\/v2\/tags?post=1045"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}