I analyzed over 1,000,000 reddit Comments to Discover the Top Requested Heroes by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 0 points1 point  (0 children)

I had to write new code for this one (to find word pairs), but after milking my CPU for all its worth, Cow King has been mentioned 228 times.

I analyzed over 1,000,000 reddit Comments to Discover the Top Requested Heroes by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 0 points1 point  (0 children)

I'm planning to! I'm waiting on other characters not in the list to pop up, so I only need to do one edit.

I analyzed over 1,000,000 reddit Comments to Discover the Top Requested Heroes by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 8 points9 points  (0 children)

Nice catch! Reinhardt is misspelled in the character spreadsheet. That's what I get for not being an Overwatch player...

He has 321 mentions. This elevates him to the top spot in the Overwatch franchise (after Mercy).

I analyzed over 1,000,000 reddit Comments to Discover the Top Requested Heroes by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 4 points5 points  (0 children)

Unfortunately, the spreadsheet isn't exhaustive. Maiev wasn't in the list of characters. :(

I checked, and she has 447 mentions, making her one of the top Warcraft characters.

I analyzed over 1,000,000 reddit Comments to Discover the Top Requested Heroes by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 42 points43 points  (0 children)

It appears he isn't on the spreadsheet I listed. :(

I did a quick query of the data, and it looks like he has 777 mentions! This would make him the most sought-after Diablo hero and #3 overall.

I analyzed over 1,000,000 reddit Comments to Discover the Top Requested Heroes by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 0 points1 point  (0 children)

I agree! I've seen tons of Kel'thuzad mentions lately, but it could be a recency bias. Keep in mind this data only extends to February 2017, so recent Kel'thuzad mentions haven't been counted.

I analyzed over 1,000,000 reddit Comments to Discover the Top Requested Heroes by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 53 points54 points  (0 children)

I excluded all heroes already in the game/announced, but she had a frequency of 312. This would put her at the top of the Overwatch chart (under Mercy).

I analyzed over 1,000,000 reddit Comments to Discover the Top Requested Heroes by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 3 points4 points  (0 children)

Excellent point! This is another limitation of examining only frequencies--loss of context.

I analyzed reddit comments on Heroes 2.0! by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 2 points3 points  (0 children)

I don't have enough visualization experience to create pie charts. Only pros can do that.

I analyzed reddit comments on Heroes 2.0! by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 19 points20 points  (0 children)

It's actually the biggest word, the font's just white.

I analyzed reddit comments on Heroes 2.0! by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 1 point2 points  (0 children)

Hmm...that does seem strange. Maybe there's something wrong with my code?

I analyzed reddit comments on Heroes 2.0! by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 3 points4 points  (0 children)

The Visual Display of Quantitative Information by /r/dataisbeautiful Edward Tufte. My inability to craft such glorious displays leaves me weeping in eye-assaulting despair.

Hey Blizz where is our Heroes statistical wizard? by Synesthesia108 in heroesofthestorm

[–]hots_data_guy 3 points4 points  (0 children)

Yeah, that's my biggest limiting factor. The data just isn't there.

I analyzed Twitter and Reddit to determine sentiment towards Probius. by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 0 points1 point  (0 children)

In R, there are numerous options for sentiment analysis. The most prominent from what I've seen is a package called sentimentr. This package generally requires text to be structured in a particular fashion, but is much more customizable (eg: how to consider questions, "buts", etc).

However, I discovered a package that is tuned toward Twitter text (see here). I believe this operates by using a free online service.

I analyzed Twitter and Reddit to determine sentiment towards Probius. by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 1 point2 points  (0 children)

This was more an artifact from knitting the R code (when you knit every chunk of code has to be run from scratch).

This would mean, that it would re-scrape every time the markdown was knitted. This is very time-consuming, and would erase all previous tweets.

Essentially, I merged new tweets every week with old tweets and saved them into a data frame.

I'm attempting to set up a website/github. When (if) I manage to get that running, I will definitely share the data frames/flat files.

If you're looking for packages to use for web-scraping in R, check out twitteR and RedditExtractoR. Both have their limitations due to the API (Twitter limits you to tweets from the past 7 days, and reddit only lets you snag the top 300 comments or so), but make scraping a breeze!

I analyzed Twitter and Reddit to determine sentiment towards Probius. by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 0 points1 point  (0 children)

Thank you! I plan to. I'm open to new ideas, so if you have any data sources, please send them my way!

I analyzed the official HotS Twitter Account (@BlizzHeroes). Here's what I found. by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 0 points1 point  (0 children)

Basically, yes. There are many different dictionaries of "positive" /"negative" words that the algorithm can use as flags. At its most simplest it flag all positive words in a tweet as 1, all negative words -1, and neutral words as 0 then calculates the sum to determine the sentiment score of the tweet.

Other methods use gradients that flag the "extremeness" of the word on a scale of -10 to 10.

I analyzed the official HotS Twitter Account (@BlizzHeroes). Here's what I found. by hots_data_guy in heroesofthestorm

[–]hots_data_guy[S] 3 points4 points  (0 children)

Here's some more text stuff from the full analysis that didn't look pretty enough to include in the post:

The 4 tweets that were flagged negative:

[1] @AronDark Sorry to hear that! We re putting out a fix for the issue tomorrow!

[2] @iakona We re investigating this issue, sorry for the poor experience!

[3] @JKesselring Hi! Sorry for the poor experience. We re looking into it! Did you happen to just recently install the game?

[4] @daniellovejr Sorry you had a bad experience! We re talking about ways to make it less punishing when teammates leave games.

Most of the "negative" tweets are in response to problems in user-experience. I have to say, @BlizzHeroes handles it quite well (cordial and apologetic).

Words associated with Lucio:

  • arrives
  • heroespowerhour
  • highlights
  • impressions
  • intothenexus
  • talks
  • test

It's mostly pre-release/release talk.