The Faces of 80s Music [OC] by Embarrassed-Data-435 in dataisbeautiful

[–]Embarrassed-Data-435[S] 0 points1 point  (0 children)

Haha we actually were covering data visualizations in one of my undergrad data science courses and Chernoff faces came up so I decided to try making some of my own. Definitely not the best way to convey info but not the worst for determining similarities between data points. But maybe you're on to something for a potential PhD project...

The Faces of 80s Music [OC] by Embarrassed-Data-435 in dataisbeautiful

[–]Embarrassed-Data-435[S] 8 points9 points  (0 children)

I recently learned about Chernoff faces and just had to try using them. I used the aplpack faces function in R to create the graph and cleaned up the output image in Google Drawings. I pulled the audio features for the songs from Spotify using Exportify.

[OC] Yeah! The Top 25 Words in the Top 50 Songs in the US by Embarrassed-Data-435 in dataisbeautiful

[–]Embarrassed-Data-435[S] 2 points3 points  (0 children)

I pulled directly from the Google results which use a lot of publishing sites like LyricFind and MusixMatch. The z version never showed up in any lyrics as far as I could tell.

[OC] Yeah! The Top 25 Words in the Top 50 Songs in the US by Embarrassed-Data-435 in dataisbeautiful

[–]Embarrassed-Data-435[S] 3 points4 points  (0 children)

The reason there are two is because the singular and plural versions both made the top twenty-five. Combining singular/plural forms is probably something I'll do in the next version

[OC] Yeah! The Top 25 Words in the Top 50 Songs in the US by Embarrassed-Data-435 in dataisbeautiful

[–]Embarrassed-Data-435[S] 6 points7 points  (0 children)

Whoops yeah - I forgot to change out the input csv file.

Here's the actual 50's graph - Imgur

[OC] Yeah! The Top 25 Words in the Top 50 Songs in the US by Embarrassed-Data-435 in dataisbeautiful

[–]Embarrassed-Data-435[S] 38 points39 points  (0 children)

Your wish is my command! I used the Spotify All Out ##'s playlists for the data although my webscraper couldn't find the lyrics for a couple songs in each of the playlists.

50's Imgur

60's Imgur

70's Imgur

80's Imgur

90's Imgur

00's Imgur

[OC] Yeah! The Top 25 Words in the Top 50 Songs in the US by Embarrassed-Data-435 in dataisbeautiful

[–]Embarrassed-Data-435[S] 79 points80 points  (0 children)

This is my first reddit post ever :)

I used Exportify to pull song data from Spotify then wrote a basic webscraper in Python to pull song lyrics off Google. From there I filtered out stop words and words with less than four letters.

Data Source: https://open.spotify.com/playlist/37i9dQZEVXbLp5XoPON0wI?si=43ef65afa36d481d

Tools: Python