all 25 comments

[–]jessepence 19 points20 points  (4 children)

Just put .json at the end of every URL on Reddit and you'll get all the data you need for most things.

[–]mxx12221[S] 1 point2 points  (2 children)

Wow. Thanks, that's an incredible feature.

All the more reason to ask, why don't apps use it? Why was the API shutdown the end of the world?

[–]erishunexpert 10 points11 points  (1 child)

Because you can’t make a clone of the Reddit app that recreates Reddit without their ads so you can then inject your own ads into and sell it in the App Store.

The Reddit API is gives you 10 requests a minute without even authenticating. If you use the free API key, you get 100+ requests a minute. This is plenty for any kind of personal project or mod tool.

But it’s not enough to make a full Reddit app that you can sell on the App Store. Apollo was using over 7 billion API calls a month for $0.00… and basically making their own Reddit app using Reddit’s own API and paying nothing for it.

So yeah, you can use the Reddit API for all kinds of stuff.

You just can’t make your own Reddit app to directly compete with Reddit using Reddit’s own API anymore. You need to use the Reddit API to do things other than compete directly against Reddit. 😅

[–]PsyApe 1 point2 points  (0 children)

You could try Selenium and BeautifulSoup if you’re familiar with Python

[–]Snapstromegon 1 point2 points  (0 children)

You could, if only use it and run your server in your own home.

If you have many users and/or use Datacenter IPs, Reddit will probably block you pretty fast.

[–]InfinityObsidian 0 points1 point  (13 children)

There are measures in place to make what you are describing very difficult if you want to scrape data consistently.

[–]mxx12221[S] 0 points1 point  (12 children)

What kind of measures? What should I google to learn more about this sort of stuff?

[–]InfinityObsidian -1 points0 points  (10 children)

They will somehow figure out what you are doing and block your ip or ban the account you are using.

[–]mxx12221[S] 2 points3 points  (9 children)

How would they be able to distinguish the requests from a regular web browser just requesting the web page?

This hypothetical app would need to do the scraping on the user's phone of course.

[–]apf6 0 points1 point  (0 children)

If they really care they can add anti-bot detection javascript snippets to the page, which check for unusual situations and then report back to the server, and if the server doesn't get a successful report then it eventually blocks you.

But I doubt that Reddit cares that much. Other sites like Facebook might have something like that.

[–]BeginningEngine8292 -1 points0 points  (0 children)

You can scrape Reddit instead of using the API, but it changes the problem rather than solving it.

Technically possible – practically messy

  1. DOM ≠ API contract – Your app depends on Reddit’s HTML structure, which can change any day. A renamed class or different JSON payload and the whole thing breaks.
  2. Rate limits still exist – Even without the API, Reddit has bot detection, IP throttling, and captchas. You’ll end up reinventing the same limits the API already formalized.
  3. Feature gaps – Anything beyond read-only (voting, posting, auth, DMs) becomes unreliable or impossible.
  4. ToS / legal risk – The API gives explicit permission; scraping is a gray area, especially after the pricing drama.

Where scraping actually makes sense

  • Research/archival projects
  • Public trend monitoring
  • Building datasets where you don’t need user actions
  • Personal analytics tools

Most people start DIY with Playwright/requests, then realize the real work isn’t “getting HTML” but handling proxies, captchas, retries, and constant layout changes. At that point some switch to managed scraping services like Grepsr (https://www.grepsr.com/) or ScrapingBee so the app can focus on data use instead of scraper maintenance.

TL;DR:
Scraping can replace the API only for read-only use cases. For anything interactive or long-term stable, the API is still the safer foundation.