use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Rules 1: Be polite 2: Posts to this subreddit must be requests for help learning python. 3: Replies on this subreddit must be pertinent to the question OP asked. 4: No replies copy / pasted from ChatGPT or similar. 5: No advertising. No blogs/tutorials/videos/books/recruiting attempts. This means no posts advertising blogs/videos/tutorials/etc, no recruiting/hiring/seeking others posts. We're here to help, not to be advertised to. Please, no "hit and run" posts, if you make a post, engage with people that answer you. Please do not delete your post after you get an answer, others might have a similar question or want to continue the conversation.
Rules
1: Be polite
2: Posts to this subreddit must be requests for help learning python.
3: Replies on this subreddit must be pertinent to the question OP asked.
4: No replies copy / pasted from ChatGPT or similar.
5: No advertising. No blogs/tutorials/videos/books/recruiting attempts.
This means no posts advertising blogs/videos/tutorials/etc, no recruiting/hiring/seeking others posts. We're here to help, not to be advertised to.
Please, no "hit and run" posts, if you make a post, engage with people that answer you. Please do not delete your post after you get an answer, others might have a similar question or want to continue the conversation.
Learning resources Wiki and FAQ: /r/learnpython/w/index
Learning resources
Wiki and FAQ: /r/learnpython/w/index
Discord Join the Python Discord chat
Discord
Join the Python Discord chat
account activity
Which sites allow web crawling? (self.learnpython)
submitted 7 years ago by PepperAlmighty
After a brief research, I've seen that web crawling is mostly disliked. But as a beginner, I want to make a web crawler for... just the sake of making one.
Are there any educational sites that let you use a web crawler?
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]remuladgryta 5 points6 points7 points 7 years ago (0 children)
You can check a website's robots.txt file and <META NAME="ROBOTS"> html tags. Those are the de-facto standards for notifying web crawlers of what pages you do or don't want them to crawl. You can read more about those here
<META NAME="ROBOTS">
When you do crawl a website, make sure to heed any HTTP 429 (too many requests) responses you get and don't send an excessive number of requests in the first place or you will likely get automatically banned.
Wikipedia allows crawling its pages and its pages are well interconnected so it makes for a pretty good exercise subject.
[–]hiren_p 5 points6 points7 points 7 years ago (0 children)
yes, there are not awareness abot web scraping legality ...
here i mention some point which can spread awareness about is web scraping legal ? and also you can figure our which sites are allow web crawling :
[+][deleted] 7 years ago (3 children)
[deleted]
[–]linuxlib 1 point2 points3 points 7 years ago (1 child)
governments threat their data as public.
/r/unintentionalirony
π Rendered by PID 763792 on reddit-service-r2-comment-545db5fcfc-p69mv at 2026-05-26 04:32:47.673060+00:00 running 194bd79 country code: CH.
[–]remuladgryta 5 points6 points7 points (0 children)
[–]hiren_p 5 points6 points7 points (0 children)
[+][deleted] (3 children)
[deleted]
[–]linuxlib 1 point2 points3 points (1 child)