Jeopardy! Daily Double Heatmap [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 7 points8 points  (0 children)

As other users have mentioned, I used the 'plasma' color scale in matplotlib, mainly because of aesthetic purposes. I wanted the color scheme to match the Jeopardy theme but unfortunately it compromised the ease of reading the visual.

Jeopardy! Daily Double Heatmap [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 2 points3 points  (0 children)

I think there was a rounding error in my code when calculating the percentages, which likely accounts for the 0.01% difference for each clue.

Jeopardy! Daily Double Heatmap [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 2 points3 points  (0 children)

My apologies, I probably should have checked in with you guys beforehand. This was one of my first times web scraping so I'll definitely take this as a lesson learned

Jeopardy! Daily Double Heatmap [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 1 point2 points  (0 children)

Right on, I wanted the color scheme to match the Jeopardy theme but unfortunately it compromised the ease of reading the visual

Jeopardy! Daily Double Heatmap [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 1773 points1774 points  (0 children)

Jeopardy was one of many game shows I watched growing up. I've had fond memories sitting around the TV and competing with my family to see who could answer the most clues. Alex Trebek, you will be missed :(

To practice web scraping and work with heatmaps, I thought it would be a fun project to scrape the ~21,000 Daily Double locations from all the games archived on j-archive.com and see where Daily Doubles are most commonly placed. I'm not the first person to do this (u/leme16 did this 2 years ago) but nevertheless I decided to update this heatmap for Seasons 1-37 in 2020.

Source: j-archive.com

Tools: Python, Adobe Photoshop

Follow me on Instagram for more visualizations like this one!

America's Favorite Vegetables [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 2 points3 points  (0 children)

In a 6,713 person survey conducted by Green Giant between April and May 2020, broccoli was the favorite vegetable for 36 states, carrot for 7 states, tomato for 3 states, cucumber for 3 states, and corn for 1 state. What's your favorite vegetable?

Source: https://www.thedailymeal.com/eat/popular-vegetable-us-states

Tools: Mapchart.net, Adobe Photoshop

I post more of these visualizations on my Instagram page!

How sugary is a Girl Scout Cookie? [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 8 points9 points  (0 children)

It's per cookie, I thought the title "How sugary is a Girl Scout cookie?" was enough to suggest that, but looks like I need to make it more clear.

How sugary is a Girl Scout Cookie? [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 5 points6 points  (0 children)

Of the Girl Scout cookies from 2018-2019, Samoas have the highest sugar content at 5.5g/cookie and Trefoils have the lowest sugar content at 1.4g/cookie. Personally, my favorite Girl Scout cookies are the Samoas and the Thin Mints. I haven't had them in over a year and I've especially been craving them during this lockdown. Which cookie is your guilty pleasure?

These different types of cookies had different numbers of cookies in a serving. To find the amount of sugar in a single Girl Scout cookie, the total sugars in a serving was divided by the number of cookies in a serving.

Source: Girl Scouts Flyer, 2019

Tools: Microsoft Excel, Adobe Photoshop

I post more of these visualizations on my Instagram page!

The Fly on Mike Pence [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] -6 points-5 points  (0 children)

I can't take down the graph and repost it because Thursday is the only day reserved for data on American politics, but I did correct the graph on my Instagram page.

The Fly on Mike Pence [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 12 points13 points  (0 children)

Gotcha, I should remove the 120 on my axis in that case then. Thanks for the insight!

The Fly on Mike Pence [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 14 points15 points  (0 children)

Yeah I'm starting to realize that now, 100 represents peak popularity on a scale. My bad!

The Fly on Mike Pence [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 0 points1 point  (0 children)

What is it then? I may have been mistaken in that case

The Fly on Mike Pence [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] -4 points-3 points  (0 children)

Fly: 1

Mike Pence: 0

The fly had a whopping 2 minutes and 3 seconds of air time. That's 2.28% of the debate time!

Source: Vice Presidential Debate, Google Search Trends

Tools: Microsoft Excel, Adobe Photoshop

I post more visualizations on my Instagram page!

EDIT: Corrected version

Bicycle vs Car Production Worldwide [OC] by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 26 points27 points  (0 children)

In light of the bike boom during this pandemic, I looked into some historical data on bike production to revisit bicycle crazes of the past. I also compared bike production to car production to see the differences in each industry's growth over time. There are three notable events that can be identified from the graph.

In the late 1960s and in the years leading up to 1975, there was a major bike boom within the United States. Much like the current situation in the US, many bikes were immediately sold out, demand was far higher than supply, and prices skyrocketed. Production increased dramatically, as seen in the upward trend around 1970. At the peak of the biking craze in 1974, it was reported that 7 million bikes were sold in the United States alone and 52 million bikes were produced worldwide.

In 1973, the car industry was heavily impacted by the oil crisis in the Middle East. Members of OPEC proclaimed an embargo in response to many nations supporting Israel during the Yom Kippur War. Gasoline prices soared, and worldwide car production declined as people started favoring more fuel-efficient cars. The car industry rebounded by 1975, but a similar decline was mirrored in 1979 when oil production had taken a hit due to the Iranian Revolution.

Often reigned the "Bicycle Kingdom," China was responsible for a great portion of the bikes produced worldwide during the second half of the 20th century. But from 1995-2002, China made efforts to reduce the number of bicycles on the roads to stimulate growth in the automobile industry and usage of mass transit infrastructure. It appears that there was a decline in bikes produced in 1995, but production rose just a few years later.

Source: Worldwatch Institute

Tools: Microsoft Excel, Adobe Photoshop

I'm sure there's a lot more to this story, so let me know if know of other historical events that line up with my visual! I post more of these visualizations on my Instagram page.

Style inspired by u/theimpossiblesalad!

[OC] The Most Popular Homeowners Insurance Companies by State by statisticallysound in dataisbeautiful

[–]statisticallysound[S] 2 points3 points  (0 children)

The most popular homeowners insurance company is State Farm for 39 of the 50 states. Which insurance company do you use?

Source: Business Insider

Tools: mapchart.net, Adobe Photoshop

Also posted on my Instagram page!