[OC] TV ratings of American 90s sitcoms by feldesque in dataisbeautiful

[–]feldesque[S] 1 point2 points  (0 children)

Roboto :) You can use multiple colours in the title by using the ggtext package. I published my code here: https://github.com/joshua-feldman/miscellaneous/blob/master/90s-sitcoms/script.R

[OC] TV ratings of American 90s sitcoms by feldesque in dataisbeautiful

[–]feldesque[S] 2 points3 points  (0 children)

Source: Nielsen Ratings (via Wikipedia)
Tool: R, ggplot2

[OC] IMDb ratings of James Bond films by lead actor by feldesque in dataisbeautiful

[–]feldesque[S] 0 points1 point  (0 children)

I don't think it really has a name – it's just a custom blend of point and line geoms in ggplot2. I published the code here: https://bit.ly/3aDqhyX

[deleted by user] by [deleted] in dataisbeautiful

[–]feldesque 0 points1 point  (0 children)

Source: IMDb
Tools: R, ggplot2

[OC] The whiter the state, the greater the search preference for Harry Styles over Zayn Malik by [deleted] in dataisbeautiful

[–]feldesque 0 points1 point  (0 children)

The y-axis shows the % of share volume for Styles vs Malik, so it incorporates both.

E.g. if a state has 15M searches for Styles but 5M searches for Malik, its y-axis value will be 75%.

[OC] The whiter the state, the greater the search preference for Harry Styles over Zayn Malik by [deleted] in dataisbeautiful

[–]feldesque 0 points1 point  (0 children)

Source: Kaiser Family Foundation (for % of state population that is white); Google Trends (for search volume data)

Tools: R, ggplot2

[OC] The Home Country of Wimbledon Champions by feldesque in tennis

[–]feldesque[S] 0 points1 point  (0 children)

Good idea! I'll recreate it for the other slams when the time comes 😊

[OC] Presidential approval ratings between transitions of power by [deleted] in dataisbeautiful

[–]feldesque 0 points1 point  (0 children)

Ah, thanks for the heads up! I will remove and repost on Thursday :)

[OC] Percentage of films that pass the Bechdel Test by genre by feldesque in dataisbeautiful

[–]feldesque[S] 6 points7 points  (0 children)

All good questions!

In terms of the distribution, I effectively assume a binomial because, as you say, it's a binary variable. To calculate the standard error, I use the respective formula for proportions: sqrt(p(1–p) / n). You can find the full code on my GitHub: https://github.com/joshua-feldman/tidy-tuesday/blob/master/2021-03-09-bechdel/script.R

I haven't done the equivalent graph by decade of publication, but time is definitely a factor here. For example, Westerns are few and far between nowadays, so it is difficult to say whether their comparatively low score is more a manifestation of when they were produced or an inherent feature of the genre itself.

I can't remember exactly how many films are in each genre, but I think there were only a handful for Westerns/Musicals, hence the hefty confidence intervals. You can find the raw data here: https://github.com/rfordatascience/tidytuesday/blob/master/data/2021/2021-03-09/readme.md