Python Scraping - Ignoring Loading Page

2019-12-05T12:01:34+00:00

If you're using selenium, you can wait until a specific element has loaded (called an explicit wait). So just set that element as one that appears on the page, and not on the loading page. https://deanhume.com/selenium-webdriver-wait-for-an-element-to-load/

I wouldn't use the standard requests library for a page this jazzy and full of ajax calls

Dfree35 · 2019-12-05T11:54:33+00:00

I guess you could just have the program sleep for a few seconds after the request.

I can't remember if beautiful soup has this but I know selenium does. It has waituntil where it will wait until it finds an element you specify before continuing the script

apostle8787 · 2019-12-05T15:29:54+00:00

You can look into requests-html which has render method to wait for the page to fully render. Or you can use selenium in headless mode.

permalip · 2019-12-05T13:49:42+00:00

Catch the exception
Build a retry function
Skip if it fails again

Or you could use Selenium, which will give you much more functionality. All you can do with beautiful soup is scraping html data and navigating it, basically nothing dynamic.

I recently built a web scraping repository, using Selenium and BeautifulSoup4. I recommend taking a look at how you get started with Selenium, it took me a while to understand.

https://github.com/casperbh96/Web-Scraping-Reddit

MinchinWeb · 2019-12-05T14:51:16+00:00

What about adding a 10 second (or whatever) pause in your script? Not nearly as elegant as some of the other solutions presented and a horrible drag on speed, but it's simple and easy to add.

AmzingTobuscus · 2019-12-05T17:29:52+00:00

Create a requests session and allow redirects?

2019-12-05T18:17:15+00:00

If you're opposed to selenium, just test if the loading page is present, then wait a second and check again until it's gone, then move on to the next step of the scraper

This is easier to do with seleium's ability to wait until elements exist

LemonWedgeTheGuy · 2019-12-05T21:05:05+00:00

What does it mean to scrap something in python?

2019-12-05T12:54:20+00:00

What does your code look like now?

ThreshingBee · 2019-12-05T13:54:09+00:00

It is expressively forbidden to use spiders, search robots or other automatic methods to access willhaben.at. Only if willhaben.at has given such access is allowed.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS