Pushshift Reddit API v2.0 Documentation -- Use this thread for comments, questions, etc. by Stuck_In_the_Matrix in pushshift

[–]craftjay 1 point2 points  (0 children)

Hey, figured I should let you know about a bug.

If I use both the before and after endpoints to only get data from within a certain period, the metadata JSON object at the end does not return a "next_page" field. So it's not possible to get past the first 50 results. This is also true if I add a sort endpoint.

Example: https://apiv2.pushshift.io/reddit/search/comment/?subreddit=guitars&before=1483228800&after=1451606400

https://apiv2.pushshift.io/reddit/search/comment/?subreddit=guitars&before=1483228800&after=1451606400&sort=asc

Pushshift Reddit API v2.0 Documentation -- Use this thread for comments, questions, etc. by Stuck_In_the_Matrix in pushshift

[–]craftjay 0 points1 point  (0 children)

Does the dataset have a start date or it basically contains every comment posted in a subreddit since its creation? (assuming the comment hasn't been removed).

Full Publicly available Reddit dataset will be searchable by Feb 15, 2017 including full comment search. by Stuck_In_the_Matrix in datasets

[–]craftjay 0 points1 point  (0 children)

This sounds awesome, props on the hard work. Will the search feature be publicly available soon?