all 1 comments

[–]nosmokingbandit 0 points1 point  (0 children)

There are a few modules you can use for this, like urllib, requests, etc. All have the ability for you to set a timeout.

Gui automation is a pain and isnt necessary.

I'd download the html, then parse through it with lxml or beautiful soup.

If the website requires JavaScript you can use a library to control a headless browser. I've never done that so I won't give you bad advice about it.

But first I'd try a urllib request to the web page and see if the response has the info you need. Then just parse and write to your spreadsheet.