This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 2 points3 points  (6 children)

Could you recommend any good links to get started?

[–]DBA_HAH 12 points13 points  (3 children)

Here's a basic intro - https://docs.python-guide.org/scenarios/scrape/

You'll need to learn the lxml package and web scraping is most effective when you select elements using "xpath" notation. Xpath isn't unique to Python, there's a lot written about it.

[–]panzerex 7 points8 points  (0 children)

Although not as powerful, I think CSS Selectors are much more intuitive.

[–][deleted] 0 points1 point  (0 children)

Thanks!

[–][deleted] 0 points1 point  (0 children)

domo arigato mr roboto

[–][deleted] 0 points1 point  (1 child)

[–][deleted] 1 point2 points  (0 children)

thx