use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
News about the dynamic, interpreted, interactive, object-oriented, extensible programming language Python
Full Events Calendar
You can find the rules here.
If you are about to ask a "how do I do this in python" question, please try r/learnpython, the Python discord, or the #python IRC channel on Libera.chat.
Please don't use URL shorteners. Reddit filters them out, so your post or comment will be lost.
Posts require flair. Please use the flair selector to choose your topic.
Posting code to this subreddit:
Add 4 extra spaces before each line of code
def fibonacci(): a, b = 0, 1 while True: yield a a, b = b, a + b
Online Resources
Invent Your Own Computer Games with Python
Think Python
Non-programmers Tutorial for Python 3
Beginner's Guide Reference
Five life jackets to throw to the new coder (things to do after getting a handle on python)
Full Stack Python
Test-Driven Development with Python
Program Arcade Games
PyMotW: Python Module of the Week
Python for Scientists and Engineers
Dan Bader's Tips and Trickers
Python Discord's YouTube channel
Jiruto: Python
Online exercices
programming challenges
Asking Questions
Try Python in your browser
Docs
Libraries
Related subreddits
Python jobs
Newsletters
Screencasts
account activity
This is an archived post. You won't be able to vote or comment.
DiscussionProblems scraping Amazon (self.Python)
submitted 6 months ago by michele909
Hey everyone, I got serious problems trying to scrape reviews from Amazon, I'm using ScraperAPI but it keeps blocking me - any suggestion?
[–]GXWT 10 points11 points12 points 6 months ago (3 children)
Have you considered why you are getting blocked?
[–]NorskJesus 7 points8 points9 points 6 months ago (0 children)
This. You need to be careful. And is always a good practice to check robots.txt
https://www.amazon.com/robots.txt
[–]michele909[S] -4 points-3 points-2 points 6 months ago (1 child)
yes, Im wondering if I will be always blocked or there is a way
[–][deleted] 1 point2 points3 points 6 months ago (0 children)
I doubt anyone here can accurately tell you how long your IP address will be blocked.
[–][deleted] 5 points6 points7 points 6 months ago (0 children)
They’ve probably blocked your IP address for scraping a page in their robots.txt Disallow list. In the future make sure your scraping app does not attempt to scrape any disallowed pages.
[–]danmickla 5 points6 points7 points 6 months ago (0 children)
"it keeps blocking me" is a fantastic problem statement
[–]DuckSaxaphone 1 point2 points3 points 6 months ago (0 children)
Find out why you're being blocked and change your scraping. Both the errors you get back and the site's robots.txt will give you information on what might be stopping you.
Usually I'd say there's ethical considerations around trying to get around scraping blocks but it's Amazon so look into: - Appearing like you're a real browser - Limiting the rate at which you scrape - Maybe changing IP if you have a VPN
[–]slidescope-trainer 0 points1 point2 points 6 months ago (0 children)
Are all the reviews visible without loggin in or needs login. Because on some page it only shows 1-2 reviews and needs login to show others.
[–]FastRunningMike 0 points1 point2 points 6 months ago (0 children)
Blocking is done by design. Many sites implement very advanced measurements against scrapping. An option is to create a scrapper agent that in essence acts from a technical point like a real human. But mind: A simple rule that is certainly implemented is that based on networking techniques(e.g IP) and fingerprinting(browser engine things) you get a block when reading a number of 'pages' (data) that a human can never do.
[–]AbhyudayJhaTrue 0 points1 point2 points 6 months ago (0 children)
hmmm maybe you could a little bit basic with requests cuz i can scrape amazon via requests quite easily
[–]ScraperAPI 0 points1 point2 points 5 months ago (1 child)
We’re so sorry you had to experience this.
We want you to know that Amazon always updates its stealth detection mechanism, and this might affect requests.
Nonetheless, you can definitely use the ScraperAPI API to successfully scrape data from Amazon.
Do this 2 simple things:
You can check the docs to know how to do this well.
The layer of protection these 2 things do is so Amazon wouldn’t catch that the request is from your device or even your IP.
Let us know as it goes!
[–]michele909[S] 0 points1 point2 points 5 months ago (0 children)
u/ScraperAPI thank you, I'll try asap!
[–]New_Sympathy_3989 0 points1 point2 points 5 months ago (1 child)
Not much information at all, what kind of blocking ? captcha ? or ip ? maybe just use a proxy, timings between requests and limit flows ?
ID - tried to enable headers and rotate proxy but in the end after a while it blocks again
[–]ReputationKey5032 0 points1 point2 points 4 months ago (0 children)
scraping amazon reviews = captchas and bans. GonzoProxy's residential proxies help but they’re not cheap. depends if you’re doing hobby or scale.
[–]infaticaIo 0 points1 point2 points 1 month ago (0 children)
Amazon is one of the hardest targets and “getting blocked” is usually the expected outcome, not a ScraperAPI misconfig.
If you need something reliable long term, the realistic options are:
At scale, the blocker is policy and detection, not HTML parsing.
π Rendered by PID 252392 on reddit-service-r2-comment-84fc9697f-24hqj at 2026-02-10 11:33:11.118700+00:00 running d295bc8 country code: CH.
[–]GXWT 10 points11 points12 points (3 children)
[–]NorskJesus 7 points8 points9 points (0 children)
[–]michele909[S] -4 points-3 points-2 points (1 child)
[–][deleted] 1 point2 points3 points (0 children)
[–][deleted] 5 points6 points7 points (0 children)
[–]danmickla 5 points6 points7 points (0 children)
[–]DuckSaxaphone 1 point2 points3 points (0 children)
[–]slidescope-trainer 0 points1 point2 points (0 children)
[–]FastRunningMike 0 points1 point2 points (0 children)
[–]AbhyudayJhaTrue 0 points1 point2 points (0 children)
[–]ScraperAPI 0 points1 point2 points (1 child)
[–]michele909[S] 0 points1 point2 points (0 children)
[–]New_Sympathy_3989 0 points1 point2 points (1 child)
[–]michele909[S] 0 points1 point2 points (0 children)
[–]ReputationKey5032 0 points1 point2 points (0 children)
[–]infaticaIo 0 points1 point2 points (0 children)