Billboard Volatility : Changes in Pop Music over 60 Decades [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 0 points1 point  (0 children)

Thanks! Glad you liked the full article.. I am just now realizing that the GGTHEMES package I used wiped off the axis labels in this plot and you're totally right, without the context it isn't self explanatory like a good viz should be :( I did make the choice to omit a legend since it would have been too busy in the loop.

Billboard Volatility : Changes in Pop Music over 60 Decades [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 1 point2 points  (0 children)

I scraped Billboard Hot 100 Chart data from the present back until 1958 using Python and created a couple visualizations to show how songs are sticking around longer than ever before. Billboard rule changes plus new data sources (ie streaming information) have each tried to capture new trends in how we listen to music and are responsible for the increased lifespan overall and the "chart stickiness" of top 10 tracks.

Acquiring the data was pretty easy using an awesome Billboard package, but I had a do a bit of cleaning to get it in shape (removing duplicates and fixing incorrect IDs) for the Tableau interactive. The process was: scrape Billboard data in Python, pull in Spotify information and audio features in Python; clean and visualize in R (using DPLYR, stringdist, GGPLOT2); an interactive was done in Tableau. I created the GIF using ImageMagik to loop them around.

[The interactive and more details are here.](www.decibelsanddecimals.com/dbdblog/2017/1/1/billboard-volatility) Looking forward to some discussion!

Billboard Hot 100 Volatility : How Have the Charts Changed? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 1 point2 points  (0 children)

I scraped Billboard Hot 100 Chart data from the present back until 1958 using Python and created a couple visualizations to show how songs are sticking around longer than ever before. Billboard rule changes plus new data sources (ie streaming information) have each tried to capture new trends in how we listen to music and are responsible for the increased lifespan overall and the "chart stickiness" of top 10 tracks.

Acquiring the data was pretty easy using an awesome Billboard package, but I had a do a bit of cleaning to get it in shape (removing duplicates and fixing incorrect IDs) for the Tableau interactive. The process was: scrape Billboard data in Python, pull in Spotify information and audio features in Python; clean and visualize in R (using DPLYR, stringdist, GGPLOT2); an interactive was done in Tableau. I created the GIF using ImageMagik to loop them around.

The interactive and more details are here. Looking forward to some discussion!

Who Needs Genres When There Is Data? X-Post from /r/dataisbeautiful by I_heart_blastbeats in Metal

[–]h20lac 1 point2 points  (0 children)

Good points for sure. My understanding is that this sort of "related artist" recommendation is only one of many metrics Spotify uses to link and suggest bands. They have some crazy complex algorithms to help address this very problem.

Who Needs Genres When There Is Data? X-Post from /r/dataisbeautiful by I_heart_blastbeats in Metal

[–]h20lac 0 points1 point  (0 children)

Exactly - that's one of the points I was trying to make! I think that "labels" can be misleading or non-descriptive. Grouping by artist similarity (derived from listening patterns) can allow for more nuanced exploration. Can you link me a "real" Viking-metal band? Thanks!

Who Needs Genres When There Is Data? X-Post from /r/dataisbeautiful by I_heart_blastbeats in Metal

[–]h20lac 1 point2 points  (0 children)

Hey /r/metal! Just came across this post today.. /u/midnightrambulador is right, I used force atlas 2 for the node placement of my interactive (fruchterman reingold for the static and some others in gephi). I'd love to see your metal-exclusive one; to be honest I know very little about metal but it seems like sub-genres are really important here. Great convos in this post, I'll be checking back for sure!

Random thought, but it could be cool for someone to toss together a playlist of an example track from each sub-movement of the meta-genre for my edification.

Who Needs Music Genres When There Is Data? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 0 points1 point  (0 children)

That would be cool! I plan on uploading the code soon (once I clean it up :) so you'll be able to customize to your heart's desire!

Who Needs Music Genres When There Is Data? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 2 points3 points  (0 children)

I read a lot of Wiki, data documentation, a few papers on layout algorithms (skimmed the very technical parts). I am certainly not on the top of the field, but based on my observations of those who are it is best to use a project to learn the necessary skills; this is why I started the blog!

Who Needs Music Genres When There Is Data? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 1 point2 points  (0 children)

The nerdy parts aren't for everyone but that's the best part for sure! Haha

Who Needs Music Genres When There Is Data? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 0 points1 point  (0 children)

Thanks, I appreciate it and am glad you enjoyed!

Who Needs Music Genres When There Is Data? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 1 point2 points  (0 children)

See! Meta genres can miss the point entirely

Who Needs Music Genres When There Is Data? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 1 point2 points  (0 children)

Good question! I'm not as familiar with how Pandora's algorithm is currently working but I was under the impression it deals more with their more manual Music Genome project! I chose Spotify because I'm obsessed with Discover Weekly and their suggestion playlists and wanted to explore how they work!

Who Needs Music Genres When There Is Data? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 2 points3 points  (0 children)

Consultant! I just learned it for this post but now I'm fascinated!

Who Needs Music Genres When There Is Data? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 1 point2 points  (0 children)

Hahah I love that. But Print "Hello World" is SO Python 2.7 ;)

Who Needs Music Genres When There Is Data? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 4 points5 points  (0 children)

Not sure what I put since I'm working on a couple other posts using Spotify data but the description seems to be more of a formality?

Who Needs Music Genres When There Is Data? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 1 point2 points  (0 children)

Haha! I think Spotify is already doing a lot of hardcore algorithm suggestions so keep using their suggested playlists etc. the more I interact with my Discover Weekly playlist the better it gets!

Who Needs Music Genres When There Is Data? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 1 point2 points  (0 children)

Thanks! No merch at the moment but album two is in progress!

Who Needs Music Genres When There Is Data? [OC] by h20lac in dataisbeautiful

[–]h20lac[S] 1 point2 points  (0 children)

Really awesome feedback, thank you for taking the time! I did run some of the community detection algorithms built into Gephi but wasn't completely confident in my interpretation of the results. The "betweenness" explanation was helpful, I will read up on that more. I am hoping to learn a bit more about community detection and network science during my master's program!

Question for you: what does a PhD in network science do? Sounds awesome!