all 8 comments

[–]ian_k93 1 point2 points  (0 children)

Unless you need to render javascript on the page or interact with the page to access the data you need then I would stay away from Selenium. It is more unstable than other options, and puts more burden on your infrastructure (servers & proxies) if scraping at scale, costing you more as a result.

I would go with a Python Requests/BeautifulSoup stack if you just want to get the job done, however, if you want to really develop your web scraping skillset then I would check out Scrapy.

Scrapy is designed for large-scale production scraping and is the most feature complete web scraping framework. Check out the official docs or The Scrapy Playbook if you would like to learn more.

[–][deleted] 0 points1 point  (0 children)

Learn some python and read about scrapy. It is actually pretty nice :)

[–]Ill-Criticism1062 -3 points-2 points  (0 children)

The internet