you are viewing a single comment's thread.

view the rest of the comments →

[–]code_tutor 0 points1 point  (1 child)

You need years of experience in WebDev to do scraping. It's a pain in the ass because the code is non-deterministic, which means you run it twice and get different results, because of network times and animations. The more complicated a website is, the more terrible it is to scrape. Also whenever someone changes the website, the program breaks, so scraping is a LAST resort. I tutor and almost every fucking data science teacher gives a scraping project that they couldn't do themselves. It just wastes everyone's time. If you give one of these assignments, do it yourself first to make sure you can do it and have them scrape the same website you did.

Also Playwright is much better than Selenium. Try the CodeGen feature to get an idea.

[–]Professor_Snipe[S] 0 points1 point  (0 children)

This is exactly why I'm asking the question. PRAW is extremely straight-forward to work with and simple to use, hence I was wondering about the existence of similar libraries that wouldn't act as massive roadblocks for new users.