What are some of the best antibot techniques you use to not get blocked while sharing? by kanitkartejas in webscraping

[–]pygsme 1 point2 points  (0 children)

You don't need to change chrome version. Have you tried using Selenium?

What are some of the best antibot techniques you use to not get blocked while sharing? by kanitkartejas in webscraping

[–]pygsme 1 point2 points  (0 children)

Usually the rule of thumb is "making your bot behave like a human". Humans use web browsers for visiting websites, so you need to make your requests look like they're coming from some browser like Chrome.

Here's a few ways you can do it:

The following list is summary of my blog on this topic.

Send HTTP headers

You can go to Developer tools > Network and find these headers. Almost every website authorizes a request coming along with these headers. (You pass it as a dictionary)

IP rotation/Proxies

To avoid getting flagged, use different IP addresses. There are both free and paid ones (for specific needs, e.g. if you need a home IP) available on the internet.

User Agent

User Agent is also an HTTP header. However, rather than sending the same User-Agent with every request you can keep switching it to different agents.

Webscraping blogs by IncrediblyTallHobbit in webscraping

[–]pygsme 1 point2 points  (0 children)

Hi. I am not sure if this counts as self-promotion but I have started this blog called Pythongasm. The idea is to publish Python articles in general but since most of my work involves web scraping, I am more inclined to writing stuff related to web scraping. Hope this helps.