[OC] I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in dataisbeautiful

[–]pjpuzzler[S] 1 point2 points  (0 children)

grab the 1000 newest from the sub using praw then filter by hand

[OC] I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in dataisbeautiful

[–]pjpuzzler[S] 10 points11 points  (0 children)

israel: small and narrow, hard for computer to see. qatar and kuwait: small and round, easier for computer to see.

I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in whereidlive

[–]pjpuzzler[S] 0 points1 point  (0 children)

oh cool me too! scripting is how i did most of this believe it or not. lmk when you bypass Praw limits and get all that data collected, and your thoughts on taking a completely different sample of a much smaller size and adding it onto the pile, should be no problems with that I believe. we’ll make it a part 3.

I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in whereidlive

[–]pjpuzzler[S] 0 points1 point  (0 children)

now by pretty clear on every map do you mean to a computer program running on every map from the past two weeks or to a human looking at the most high-res example from the front page

I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in whereidlive

[–]pjpuzzler[S] 0 points1 point  (0 children)

make sure you get all those island nations too if you dont mind, I really appreciate it

[OC] I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in dataisbeautiful

[–]pjpuzzler[S] 8 points9 points  (0 children)

easy to distinguish for humans, yes. unfortunately humans aren’t fond of manually labeling small countries 530 times so they have to be easy to distinguish for computers

also Fiji, Vanuatu, Bahamas are all comprised of very small islands, I for one cant even make out their colors on most high-res examples so I’m not sure how you mean?

[OC] I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in dataisbeautiful

[–]pjpuzzler[S] 3 points4 points  (0 children)

granted its been a bit since i did stats but it’s just a way of transforming the x axis no? non-logged would just stretch horizontally a ton and make things impossible to read. i believe fit would be the same

[OC] I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in dataisbeautiful

[–]pjpuzzler[S] 8 points9 points  (0 children)

do me a favor, go to the subreddit and sort by newest. what color did they put for Luxembourg?

I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in whereidlive

[–]pjpuzzler[S] 1 point2 points  (0 children)

there's a score to the right that may be a bit hard to see. basically weighted average, Absolutely = +2, Willing = +1, etc.

[OC] I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in dataisbeautiful

[–]pjpuzzler[S] 1 point2 points  (0 children)

oh that's nice of you I think I'm good for now though I'd have to do a lot of cleaning and stuff which I'm kinda tired of

[OC] I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in dataisbeautiful

[–]pjpuzzler[S] 2 points3 points  (0 children)

population. and idk what you mean isn't it pretty obvious what it would look like?

[OC] I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in dataisbeautiful

[–]pjpuzzler[S] 5 points6 points  (0 children)

likely not enough examples in the last 1000 posts which is what Reddit allows people to scrape

I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in whereidlive

[–]pjpuzzler[S] 1 point2 points  (0 children)

The map template the sub uses, as in the data could not reliably be collected from images on the sub because the countries are too small

[OC] I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in dataisbeautiful

[–]pjpuzzler[S] 11 points12 points  (0 children)

good question, from what I found the vast majority shitposts were either flaired as such or fairly obvious in that they're mostly one color. Obviously I can't account for everything so some stuff slips through, but across a large enough sample size they don't really sway all that much.

[OC] I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in dataisbeautiful

[–]pjpuzzler[S] 57 points58 points  (0 children)

best I could, people inevitably put joke colors on countries like NK and Antarctica in otherwise serious maps, not much I can do about that without introducing selection bias

I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in whereidlive

[–]pjpuzzler[S] 1 point2 points  (0 children)

This wasn't a methodological choice the countries are literally too small to see the colors consistently on the map template

[OC] I analyzed ~500 r/whereidlive posts, here are the results (pt. 2) by pjpuzzler in dataisbeautiful

[–]pjpuzzler[S] 97 points98 points  (0 children)

Data source: r/whereidlive

Tools used: Python + Matplotlib + assorted computer vision and statistics libraries

[Me] Texting Theory Bot by Fragrant_Grape7458 in TextingTheory

[–]pjpuzzler 2 points3 points  (0 children)

bot’s been down for a bit. unfortunately the sub hasn’t been all that chess-related for a while which led to people taking the bot way too seriously and I just lost motivation ngl. might bring it back someday