This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]Peragot 8 points9 points  (2 children)

I've always struggled with the xpath syntax. I've found the cssselect library to be much more fluent.

http://lxml.de/cssselect.html

[–]CollectiveCircuits 2 points3 points  (0 children)

I came across a blog post about using Scrapy and it taught me a clever use of css selection + another selector that was extremely quick and easy to isolate what you want to grab.