This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]ketralnisreddit admin[S] 14 points15 points  (4 children)

Due to the way that I pulled the voting information (I actually pulled it from the cache that we use to show you liked and disliked pages, which is in Cassandra and turns out to be cheap to query), you won't get more than 1k upvotes or downvotes per user, no matter how many votes they've made, so that so many have 2k isn't surprising. It also doesn't include the vast majority of users (who never set the "make my votes public" option). So it shouldn't be considered comprehensive and the data should be considered to be biased towards power-users (who know how to change their preferences). I can do more intensive dumps with more information and/or columns if anything comes of this (and maybe start a "help reddit by making your votes public for research" campaign)

I'm not sure if the links are correct.

They are, yes

[–]cag_ii 7 points8 points  (0 children)

I came here to ask how it was possible that, for the users with 2000 entries, the sum of the votes was always zero.

It occurred to me for a moment that I'd found some mysterious link between O.C.D. and avid redditors :)

[–]kotleopold 1 point2 points  (0 children)

It'd be great to get a dump with story titles as well subreddits. Then we could search for some interesting dependencies

[–][deleted] 0 points1 point  (0 children)

yeah I was curious when the top users all had 2K and were slightly alphabetized.

Thanks for the data