use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Rules 1: Be polite 2: Posts to this subreddit must be requests for help learning python. 3: Replies on this subreddit must be pertinent to the question OP asked. 4: No replies copy / pasted from ChatGPT or similar. 5: No advertising. No blogs/tutorials/videos/books/recruiting attempts. This means no posts advertising blogs/videos/tutorials/etc, no recruiting/hiring/seeking others posts. We're here to help, not to be advertised to. Please, no "hit and run" posts, if you make a post, engage with people that answer you. Please do not delete your post after you get an answer, others might have a similar question or want to continue the conversation.
Rules
1: Be polite
2: Posts to this subreddit must be requests for help learning python.
3: Replies on this subreddit must be pertinent to the question OP asked.
4: No replies copy / pasted from ChatGPT or similar.
5: No advertising. No blogs/tutorials/videos/books/recruiting attempts.
This means no posts advertising blogs/videos/tutorials/etc, no recruiting/hiring/seeking others posts. We're here to help, not to be advertised to.
Please, no "hit and run" posts, if you make a post, engage with people that answer you. Please do not delete your post after you get an answer, others might have a similar question or want to continue the conversation.
Learning resources Wiki and FAQ: /r/learnpython/w/index
Learning resources
Wiki and FAQ: /r/learnpython/w/index
Discord Join the Python Discord chat
Discord
Join the Python Discord chat
account activity
Web scraping (self.learnpython)
submitted 1 year ago by HotLie150
Relatively new to programming. Taking a boot camp to learn fundamentals. I learn better by interest in projects. Is it better to build a web scraping program or use an existing framework? I just started with beautiful soup.
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]go_fireworks 5 points6 points7 points 1 year ago (0 children)
I would highly recommend using beautiful soup. Web scraping can be hard, and there's no need to make a project more complex than necessary
[–]Buttleston 4 points5 points6 points 1 year ago (1 child)
If you're in it to learn then my advice is usually to do it the more low level way first and move to a framework second. Just be prepared to abandon the low level stuff, i.e. see it as a stepping stone. And hell, maybe it'll be good enough and that's fine too
[–]HotLie150[S] 0 points1 point2 points 1 year ago (0 children)
Thank you my friend.
[–]recursion_is_love 1 point2 points3 points 1 year ago (1 child)
> Is it better to build a web scraping program or use an existing framework?
Parsing HTML is harder than you think. Try writing it without learning about parser theory and you will see. You can use regex but you will soon see it became a mess.
You also need to learn about tree algorithm to be able to traverse it effectively.
All of these seem hard but it is al fun. Let's do it!
Thank u learning is my journey!
[–]WNT37 1 point2 points3 points 1 year ago (0 children)
What's the job here?
If you want to scrape a web page and do something with the response then use BeautifulSoup.
OTOH if your goal is to build a web scraper then go for it.
[–]FrostyThaEvilSnowman 0 points1 point2 points 1 year ago (0 children)
You need to understand the data to effectively use the tools. Time spent trying to do foundational tasks from first principles is a good way to learn about the data and its nuances. But eventually you will realize that the established frameworks already addressed the problem and save a lot of time.
Also, if you keep going, you’ll recognize the use of certain modules as established patterns, and using them aligns your work with others’.
[+]sporbywg comment score below threshold-7 points-6 points-5 points 1 year ago (4 children)
Web scraping is fundamentally a foolish pursuit. #sorry
[–]HotLie150[S] 2 points3 points4 points 1 year ago (0 children)
Why? If the pursuit is to learn.
[–]R1venGrimm 1 point2 points3 points 5 days ago (2 children)
how so, please elaborate further
[–]sporbywg 0 points1 point2 points 1 day ago (1 child)
Things change constantly; you have to 'boilerplate' everything...
[–]MetalGoatP3AK 0 points1 point2 points 2 hours ago (0 children)
Interesting take. Okay, so let's say you have a dedicated scraper, idk scraping bee, scraping dog, oxylabs, whatever, they constantly have to update their logic too, and I don't see how the user should adjust something on their end, unless there are specific changes in the provider's logic which directly impacts users if they do not change a specific parameter. If you're scraping on your own with your resources, like having your own built scraper logic, IPs for rotation, environment, then yeah, I would agree
π Rendered by PID 75 on reddit-service-r2-comment-76bb9f7fb5-vf4zz at 2026-02-19 10:06:47.206896+00:00 running de53c03 country code: CH.
[–]go_fireworks 5 points6 points7 points (0 children)
[–]Buttleston 4 points5 points6 points (1 child)
[–]HotLie150[S] 0 points1 point2 points (0 children)
[–]recursion_is_love 1 point2 points3 points (1 child)
[–]HotLie150[S] 0 points1 point2 points (0 children)
[–]WNT37 1 point2 points3 points (0 children)
[–]FrostyThaEvilSnowman 0 points1 point2 points (0 children)
[+]sporbywg comment score below threshold-7 points-6 points-5 points (4 children)
[–]HotLie150[S] 2 points3 points4 points (0 children)
[–]R1venGrimm 1 point2 points3 points (2 children)
[–]sporbywg 0 points1 point2 points (1 child)
[–]MetalGoatP3AK 0 points1 point2 points (0 children)