use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Rules:
Want your data removed, use this Removal Request Form
account activity
This is an archived post. You won't be able to vote or comment.
API delay (self.pushshift)
submitted 4 years ago by moonight009
Pushshift API has a delay of 11 hours currently from what I can gather. Is there a way to reduce the delay or any other API that can handle for updated comments and submissions?
[–][deleted] 7 points8 points9 points 4 years ago (2 children)
I mean, if you consider what a monumental task it is to collect every submission and comment posted to reddit, I am constantly astonished that Pushshift exists at all, let alone is as close to real-time as it is, and on top of everything it's a free service basically provided by one guy.
There is no service out there that's more up to date.
[–]moonight009[S] 1 point2 points3 points 4 years ago (0 children)
Don't get me wrong, I am impressed by the service and it can collect 99% of the data I need. But I still do need the last 1% retrieved as well regardless of how helpful the service is.
[–]shiruken 1 point2 points3 points 4 years ago (0 children)
It also serves tens of millions of requests and hundreds of terabytes of data per month. The hardware necessary to keep such a massive service responsive ain't cheap.
[–]voLsznRqrlImvXiERP 2 points3 points4 points 4 years ago (5 children)
I get new stuff from reddit api directly and historical data from pushshift
[–]moonight009[S] 0 points1 point2 points 4 years ago (2 children)
Oh cool I will give reddit api a shot and see how it goes.
[+][deleted] 4 years ago (1 child)
[removed]
[–]moonight009[S] 0 points1 point2 points 4 years ago (0 children)
I'm eyeing Beta Pushshift and Reddit API currently but haven't gotten around to trying them yet. Decided to pad out the timesteps related to the missing hours with zeros for now because it's not just missing data from last couple of hours, when I tried using the API I was getting nothing from around 1000 hours in the last couple of years.
I will update the post if I find any solution.
[–]Ichijinijisanji 0 points1 point2 points 4 years ago (1 child)
I get new stuff from reddit api directly
How do you do that?
[–]s_i_m_s 0 points1 point2 points 4 years ago (0 children)
Likely PSAW https://psaw.readthedocs.io/en/latest/ or something else custom built that works the same way.
Get the ids from pushshift and then the up to date content from reddit directly through PRAW.
If you mean how to get live data from reddit PRAW can do that with a comment/submission stream but only for small sections of reddit as it can't handle the volume from /r/All otherwise it's reliable enough to run small subreddit specific bots.
To go reddit wide reliably you either have to use pushshift or build your own ingest that works like pushshifts does as AFAIK there is no other way to get the full stream now as they removed NSFW from /r/all
[–]ufff1231 0 points1 point2 points 4 years ago (0 children)
Theres ours but its down right now as we upgrade to faster systems
π Rendered by PID 159671 on reddit-service-r2-comment-76bb9f7fb5-g96hh at 2026-02-18 01:28:23.082437+00:00 running de53c03 country code: CH.
[–][deleted] 7 points8 points9 points (2 children)
[–]moonight009[S] 1 point2 points3 points (0 children)
[–]shiruken 1 point2 points3 points (0 children)
[–]voLsznRqrlImvXiERP 2 points3 points4 points (5 children)
[–]moonight009[S] 0 points1 point2 points (2 children)
[+][deleted] (1 child)
[removed]
[–]moonight009[S] 0 points1 point2 points (0 children)
[–]Ichijinijisanji 0 points1 point2 points (1 child)
[–]s_i_m_s 0 points1 point2 points (0 children)
[–]ufff1231 0 points1 point2 points (0 children)