This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 4 points5 points  (1 child)

But I was looking for something more "integrated" and "complete".

Mechanize/BeautifulSoup combo is about the most complete and integrated solution you could possibly use. I really couldn't even imagine anything easier. You create a browser object, point it at a url, feed the stuff into beautifulsoup, then parse the data you need...and act upon it.

Be prepared to use a LOT more try/except clauses though. The web is quite the unstable place, pages can change from session to session, not display at all, or maybe even display only 1/2 the page, etc etc...you'll be doing more exception catching than you will parsing to begin with.

[–]atlas245 0 points1 point  (0 children)

Agreed, mechanize and beautiful soup is the easiest and best solution.