Creation of ISS Spacewalks - Makeover Monday
Preparing the data set
1. Examine the data set for completenessI sorted each column to find missing data. The Spacewalkers column had some "n/a" fields so I filled in these values manually.
2. Add extra information to data set
I wanted to add the nationalities of the Spacewalkers and countries of the spacesuits to the data set.
For the nationalities, I used a vlookup to append the data based on the name. However, I did have to be careful about different name spellings used in the data set compared to Wikipedia.
As for the spacesuits used in a spacewalk - I entered this manually based on the footnote symbols in Wikipedia. American spacesuits and Russian spacesuits were used depending on where the mission took place.
3. Final check
I recreated the original visualization as a final check. Through this process, I discovered that some dates were mixed up in the data set. For example, Expedition 18 was listed as Dec 10, 2009 but it should be March 10, 2009 according to NASA.
If you'd like to explore the cleaned data set, I've made it available on Google Sheets. The spacewalk detail and cleaned tabs are most relevant.
Creating the visualization
Design
My design was inspired by this visualization. I liked that they combined a summary bar chart but also displayed each spacewalk individually. However, I thought presenting the data in a circular format was hard to read - although it looks very pretty and fits the space theme!
Thus, I kept the stacked bar chart as a summary and added more information in the scatter chart below.
Colour choices Thus, I kept the stacked bar chart as a summary and added more information in the scatter chart below.
The red and blue combination in the original visualization isn't the worst color combination and they both stand out well against black. However, they're very bright colours and it can be confusing because both Russian and American flags contain both colours.
After many iterations, I eventually chose to use white and beige as they represent the colours of the American and Russian spacesuits. I also had to change the background colour to gray so that they could still be seen.
Data displayed
I chose to exclude 2019 because the year wasn't finished yet and the data would be incomplete. Also, by doing that, I can have a year label at both the starting and ending year in the visualization (as I chose to only label every other year).
I used R to concatenate both of the spacewalker's names into a single label. Within my tooltip, I used the spacewalkers' names and mission. I didn't end up using the nationality of the spacewalker in my visualization as Data Studio currently has a limit of three dimensions for a scatter plot.
Overall, I'm happy with how my visualization turned out. I also appreciated the data preparation exercise I was able to get with this data set.
I chose to exclude 2019 because the year wasn't finished yet and the data would be incomplete. Also, by doing that, I can have a year label at both the starting and ending year in the visualization (as I chose to only label every other year).
I used R to concatenate both of the spacewalker's names into a single label. Within my tooltip, I used the spacewalkers' names and mission. I didn't end up using the nationality of the spacewalker in my visualization as Data Studio currently has a limit of three dimensions for a scatter plot.
------
Overall, I'm happy with how my visualization turned out. I also appreciated the data preparation exercise I was able to get with this data set.
Comments
Post a Comment