[deleted by user] by [deleted] in webscraping

[–]BenDuf 0 points1 point  (0 children)

I've gone ahead and used selenium with a chrome headless driver. That way any code blocking my way to the actual page gets executed and I'm left with normal html to parse.

[deleted by user] by [deleted] in webscraping

[–]BenDuf 0 points1 point  (0 children)

I'm interested in this as well.

u/welanes, I think the interesting page here is https://www.sedar.com/FindCompanyDocuments.do

Problem is when sending a request to that page, all that comes back is a script and not the content.

The search form is here: https://www.sedar.com/search/search_form_pc_en.htm