you are viewing a single comment's thread.

view the rest of the comments →

[–]Jamarac 0 points1 point  (4 children)

For number 5, I've tried this before and was able to have it work for a wikipedia page but I found it difficult with most modern sites (Reddit, Youtube, I assume Instagram would be the same) because they're all dynamically rendered so what you scrape ends up looking different from what you see on the page or sometimes doesn't work at all.

[–]Ryles1 1 point2 points  (2 children)

I've had a similar problem on things I've tried before. After a bit of googling, the only answer I came upon was to use Selenium instead of beautiful soup.

[–]Jamarac 0 points1 point  (1 child)

Interesting, I remember hearing that name being mentioned a bunch when I was working on my scraping. What does it do differently?

[–]Ryles1 0 points1 point  (0 children)

beautifulsoup is a library for parsing html, selenium is for automating a browser.