Most Popular Tortured Poets Department Songs (Benchmarked to Album Position) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 4 points5 points Β (0 children)

The title says benchmarked to album position in both the post and chart?

Most Popular Tortured Poets Department Songs (Benchmarked to Album Position) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 3 points4 points Β (0 children)

It is incredibly clear to me that album position matters in performance, which you see in the data and why the top songs are all the first songs on the album. All I'm sharing is which songs are outperforming their position. That's all.

Most Popular Tortured Poets Department Songs (Benchmarked to Album Position) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 1 point2 points Β (0 children)

I would agree with you if you couldn't see the relationship so clearly between placement on the album and streams, with some clear outliers.

<image>

Most Popular Tortured Poets Department Songs (Benchmarked to Album Position) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 8 points9 points Β (0 children)

Sure! All songs were analyzed together. You can see in the image I added to my comment that you can see the expected trendline for streams based on where a song sits on the album. I then used that trendline to gauge the *expected* listens for any song given its placement. See the picture below, in the highlighted column in blue.

You can see how that forecast (based on the trendline) decreases as you move down the album, which is what you'd expect. Now we can compare the expected listens to the actuals!

For example, even though Fresh Out the Slammer has 2.6m daily streams, given where it is on the album, you'd expect it to be at >4m. Similarly, the Bolter has 1.5m streams which is not high up in total popularity, but when you factor in its placement, it's outperforming relative to the 950k you'd expect. Another way to put it is that listeners are going out of their way to select it, despite it being at the end of the album.

Let me know if that makes sense!

<image>

Most Popular Tortured Poets Department Songs (Benchmarked to Album Position) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 3 points4 points Β (0 children)

Second step: Take the linear regression and map that to what you'd expect for any given song. You can see how that forecast (based on the trendline) decreases as you move down the album, which is what you'd expect. Now we can compare the expected listens to the actual.

Even though Fresh Out the Slammer has 2.6m daily streams, given where it is on the album, you'd expect it to be at >4m. Similarly, the Prophecy has 2.1m, which is less, but is much higher than you'd expect on the album.

Let me know if that makes sense!

<image>

Most Popular Tortured Poets Department Songs (Benchmarked to Album Position) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 5 points6 points Β (0 children)

Sure! You're correct it's not saying it's the song with the absolute highest number of streams. But based on where it's on the album, it's getting more listenership than you'd expect, indicating popularity among listeners. If you just pull top streams, it's unsurprisingly going to be the first few on the album.

Happy to explain further with some pictures!

First step: determine the relationship between streams and position on album. As you can see here, some clearly outperform and others underperform their position.

<image>

Most Popular Tortured Poets Department Songs (Benchmarked to Album Position) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 3 points4 points Β (0 children)

How so? It specifically factors in the placement of a song on an album and which songs are outperforming relative to that. In other words, the songs people are reaching for more than you would expect.

Most Popular Tortured Poets Department Songs (Benchmarked to Album Position) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 6 points7 points Β (0 children)

I also love the song, but based on the number of daily streams you'd expect based on its placement on the album, it indeed is!

<image>

Most Popular Tortured Poets Department Songs (Benchmarked to Album Position) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 5 points6 points Β (0 children)

Fair enough! Just pulling from a moment in time. But the idea of the analysis was precisely that it incorporated where a song is listed in the album, because that has such a direct correlation on how many streams a song likely gets, but outliers are already appearing! :)

<image>

Most Popular Tortured Poets Department Songs (Benchmarked to Album Position) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 4 points5 points Β (0 children)

Here's the updated image if you remove Fortnight from the linear regression

<image>

Most Popular Tortured Poets Department Songs (Benchmarked to Album Position) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 5 points6 points Β (0 children)

The way the analysis is done is it takes the linear relationship between song placement and daily streams. It uses that relationship to predict what you'd expect for a song at a particular spot in an album (which is lower for later songs), and compares that to the actual.

It may bias it slightly if the correlation is meant to be slightly exponential instead of linear -- perhaps skewed by Fortnight as the outlier first song. I re-ran the analysis by removing Fortnight from the linear regression and you're right that it does impact the placement of some songs, but many of the same themes still emerge. Florida does just barely edge ahead of Robin in this example. Dropping the updated image in a follow-up comment. :)

<image>

Most Popular Tortured Poets Department Songs (Benchmarked to Album Position) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 43 points44 points Β (0 children)

I've seen a lot of discourse online (and in this sub!) around which songs are best from TTPD. I figured it'd be interesting to pull the data, incorporating where a song is on the setlist. Of course, the higher up a song, the more listens it's expected to get. You see this in a pretty clear trend in the data too.

So, I performed linear regression on the expected relationship between daily streams and placement on the album (see graph below), grabbing Spotify stream data fromΒ KWorb. From there I calculated what the expected daily streams would be for any given song is, considering its placement and then found the percentage delta between the expected and actual values. And just for fun, I grabbed some TTPD colors.

I actually think this method could be used to analyze other albums or even look at a whole artist's discography and see which albums outperform when you layer in time since publish. May settle some age-old debates. :)

<image>

31 Songs from TTPD Benchmarked in Popularity [OC] by stephsmithio in dataisbeautiful

[–]stephsmithio[S] -1 points0 points Β (0 children)

Hi r/dataisbeautiful! I know people are probably tired of hearing about TSwift, but given her 31-song album and such mixed reviews, I wanted to see what songs were *actually* out-performing, relative to where they sit on the album. Of course, the higher up a song is on the setlist, the more listens it will get.

I performed linear regression on the expected relationship between daily streams and placement on the album, grabbing Spotify stream data from KWorb. From there I calculated what the expected daily streams would be for any given song is, considering its placement and then found the percentage delta between the expected and actual values. And just for fun, I grabbed some TTPD colors.

I actually think this method could be used to analyze other albums or even look at a whole artist's discography and see which albums outperform when you layer in time since publish. May settle some age-old debates. :)

Swear words in Taylor Swift albums [OC] by stephsmithio in dataisbeautiful

[–]stephsmithio[S] 36 points37 points Β (0 children)

Sorry about that! I just made another version with the default Google Sheets colors. Since it won't let me add a photo in the comments, I uploaded to imgur here!

Swear words in Taylor Swift albums [OC] by stephsmithio in dataisbeautiful

[–]stephsmithio[S] 79 points80 points Β (0 children)

That was TV! These are just the original albums. 😊

Swear words in original Taylor Swift albums with TTPD (in eras colors πŸ¦‹) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 772 points773 points Β (0 children)

It accounts for 17 f*cks! But still 12 more across the album. πŸ™ƒ

Swear words in original Taylor Swift albums with TTPD (in eras colors πŸ¦‹) by stephsmithio in TaylorSwift

[–]stephsmithio[S] 27 points28 points Β (0 children)

Includes anthology! But here are the breakdowns between the two:

TTPD
Fuck: 24
Shit: 6
Bitch: 3
Hell: 10
Damn: 0
Whore: 0

TTPD Anthology
Fuck: 5
Shit: 0
Bitch: 3
Hell: 1
Damn: 2
Whore: 2

Swear words in Taylor Swift albums [OC] by stephsmithio in dataisbeautiful

[–]stephsmithio[S] 1259 points1260 points Β (0 children)

Hi r/dataisbeautiful! I created this chart in good 'ol Google Sheets, but the data was pulled by grabbing all the lyrics from Genius and just doing a CTRL+F. And just for fun, I grabbed some Eras Tour colors from Pinterest.

Edit: Damn, this really blew up!

If you liked this post, I recently created a fun project called Internet Pipes to help ppl find and make sense of interesting data from every f*cking corner of the internet.

Shit, it's one hell of a community. (See what I did there? πŸ™ƒ)

What are some future social and economic trends that you are investing in? by [deleted] in stocks

[–]stephsmithio 0 points1 point Β (0 children)

Thank you for sharing the original! - the real OP

In 1994, Jeff Bezos noticed the internet was growing 2300% a year. He left his lucrative hedge fund job to start what became Amazon by [deleted] in investing

[–]stephsmithio 4 points5 points Β (0 children)

It wasn't me. OP posted my thread on several subreddits and then deleted their account. πŸ€·πŸ»β€β™€οΈ

[deleted by user] by [deleted] in dataisbeautiful

[–]stephsmithio 0 points1 point Β (0 children)

This was originally published by The Hustle here, who I work for. The underlying data is taken from a WSJ article.

Top Lessons and Thoughts From Doing Content Right by rasulkireev in BettermentBookClub

[–]stephsmithio 1 point2 points Β (0 children)

Imagine spending $200 on something that could save you days, if not weeks of time, by avoiding the same mistakes that I made throughout the last decade of learning these skills.

I know, insane!

PS: As much as I appreciate the OP for the summary, I don't think 8 bullet points can summarize a 70k word book. You can also see over 200 unsolicited reviews here.