OpenRefine – Introduction to Digital Humanities

My group was assigned the dataset of graphic novels and through this we have discovered that most of the authors happen to be white males and produced in the United States. This OpenRefine would help our group clean up the names of the authors clustering by first and last name to get a better picture of just how many authors there truly are. We would use the white trim out for all the names getting rid of any differences within the spacing and because gender is a big factor in our study we would need to efficiently sort out the males and females in order to be able to identify a number in the difference between male and female graphic novel authors.

Another way this will help our group since we are focusing on the countries in which graphic novels come from would be the numbers that are presented in each country. By using the count system and applying it to the country of origins column we can easily see the difference between the numbers of these novels produced in each country.

Leave a Reply Cancel reply