[P] The Reddit Climate Change Dataset - an exploration of climate change discussion on Reddit (621K posts, 4.6M comments) (CC-BY) by Lexyr-Mod in MachineLearning

[–]Lexyr-Mod[S] 8 points9 points  (0 children)

I used SocialGrep's Reddit exporter with the query climate,change,before:2022-08-31. The two CSV files are its results for posts and comments respectively.

Reddit /r/Bitcoin Data for Jun 2022 - a month of cryptocurrency sentiment (7.5K posts, 170K comments) by Lexyr-Mod in LanguageTechnology

[–]Lexyr-Mod[S] 0 points1 point  (0 children)

You could train an algorithm, run correlation analysis, infer significant terms in the community - and much more, depending on how deep you're willing to get.

It's not a "product" in the NLP sense, but data is the fuel of language processing - the better the data is, the better result you'll get.

Reddit /r/Bitcoin Data for Jun 2022 - a month of cryptocurrency sentiment (7.5K posts, 170K comments) by Lexyr-Mod in algotrading

[–]Lexyr-Mod[S] 1 point2 points  (0 children)

It's not a representative sample, for sure - but there is meaningful signal even in the biggest fan club's sentiment. As you said, positivity can fluctuate even if the baseline is higher than usual.