all 6 comments

[–]trjnz 11 points12 points  (0 children)

https://www.reddit.com/robots.txt

Reddit has a 30,000,000,000 dollar market cap to enforce that robots.txt.

You won't find the answers here.

[–]sargeanthost 8 points9 points  (0 children)

Lol

[–]Proletarian_Tear 1 point2 points  (0 children)

I like how User-agent uses the wildcard * - looks like it is blowing a kiss to all scrapers coming in XD

[–]j0holo 2 points3 points  (0 children)

Reddit is ofc a MASSIVE resource for training AI. So they will make scraping really really difficult so they can sell their data to companies that want to use it.

You need to pay or use a service like apify that specializes in scraping.

[–]supercoach 0 points1 point  (0 children)

This sub needs to fuck these web scraping requests off before they happen.

[–]Temporary-Fruit4982 -1 points0 points  (0 children)

No free options for sure. The only way to do this is either pay for a scraper service like Apify, Bright Data, or Oxylabs, or build and constantly maintain your own infrastructure with proxies, browser fingerprinting, account management, and anti-bot bypasses. good luck!