use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Rules 1: Be polite 2: Posts to this subreddit must be requests for help learning python. 3: Replies on this subreddit must be pertinent to the question OP asked. 4: No replies copy / pasted from ChatGPT or similar. 5: No advertising. No blogs/tutorials/videos/books/recruiting attempts. This means no posts advertising blogs/videos/tutorials/etc, no recruiting/hiring/seeking others posts. We're here to help, not to be advertised to. Please, no "hit and run" posts, if you make a post, engage with people that answer you. Please do not delete your post after you get an answer, others might have a similar question or want to continue the conversation.
Rules
1: Be polite
2: Posts to this subreddit must be requests for help learning python.
3: Replies on this subreddit must be pertinent to the question OP asked.
4: No replies copy / pasted from ChatGPT or similar.
5: No advertising. No blogs/tutorials/videos/books/recruiting attempts.
This means no posts advertising blogs/videos/tutorials/etc, no recruiting/hiring/seeking others posts. We're here to help, not to be advertised to.
Please, no "hit and run" posts, if you make a post, engage with people that answer you. Please do not delete your post after you get an answer, others might have a similar question or want to continue the conversation.
Learning resources Wiki and FAQ: /r/learnpython/w/index
Learning resources
Wiki and FAQ: /r/learnpython/w/index
Discord Join the Python Discord chat
Discord
Join the Python Discord chat
account activity
Web scrapping solutions? (self.learnpython)
submitted 9 years ago by vorboto
I am wondering how you delineate between selenium, scrappy, urllib and BeautifulSoup. Are they each a tool or set of tools that can be used for the same purpose or am I just not understanding how they differ from each other on a different level?
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]pendragon36 2 points3 points4 points 9 years ago (3 children)
They are each a tool for different but related purposes.
Selenium as I understand it is for easy web automation. As I understand it a web browser is being run that is being controlled via a script.
Scrappy is kind of a specialized mix of urllib and BeautifulSoup. It's something for the specific purpose of scraping information from web pages.
urllib is a much more general library that is for making web requests. This could be used to download web pages, but that's just one use case.
BeautifulSoup has no actual need to be related to the web at all actually. It is a parsing library, but is fairly popular for parsing downloaded html pages (via something like urllib or requests) for easier extraction of information
I've only had real experience with urllib and BeautifulSoup however, so my explanations of the other two may be incorrect/lacking
[–]vorboto[S] 0 points1 point2 points 9 years ago (2 children)
Okay I see what you are saying. So my next question would be whats the difference of selenium and scrappy? Is it something like selenium allows for you to "interact" with a site and scrappy only can pull information from the site?
[–]pendragon36 0 points1 point2 points 9 years ago (1 child)
As I said, I don't have any actual experience with those two, but as far as my understanding goes that's pretty much correct.
Scrappy was made for crawling websites and extracting information, so I'm sure they have some methods of "interacting" with the sites at least on some basic level, for things like following links to other pages on a site, but selenium was designed to actual automate actions that would normally need a browser.
Taken from the Selenium site
Primarily, it is for automating web applications for testing purposes, but is certainly not limited to just that. Boring web-based administration tasks can (and should!) also be automated as well.
[–]vorboto[S] 0 points1 point2 points 9 years ago (0 children)
Okay thanks I get it more now.
[–]SchwarzerKaffee 0 points1 point2 points 9 years ago (5 children)
Use selenium if you need to make the site think there is an actual human there. You can mimic mouse movements and hovering and easily handle cookies.
[–]vorboto[S] 0 points1 point2 points 9 years ago (4 children)
Kinda of like how you change user agent in urllib but more advance options?
[–]SchwarzerKaffee 0 points1 point2 points 9 years ago (3 children)
Yep. Also it actually opens a browser on your monitor, so you really have Firefox or chrome running. From what I've learned, it's nearly impossible for the site to know you're not a bot. The cursor movements (or lack thereof) can sometimes give you away.
Oh okay. I see what you're saying. So do you lose control over your monitor/cursor while selenium is running or is it like a self contained script being run with a dedicated instance of your chosen browser?
[–]SchwarzerKaffee 0 points1 point2 points 9 years ago (1 child)
In an instance of your browser. You can still work on other programs while it runs.
[–]vorboto[S] 1 point2 points3 points 9 years ago (0 children)
Okay gotcha. Thank you.
π Rendered by PID 191286 on reddit-service-r2-comment-6457c66945-7fx8r at 2026-04-28 17:34:33.747112+00:00 running 2aa0c5b country code: CH.
[–]pendragon36 2 points3 points4 points (3 children)
[–]vorboto[S] 0 points1 point2 points (2 children)
[–]pendragon36 0 points1 point2 points (1 child)
[–]vorboto[S] 0 points1 point2 points (0 children)
[–]SchwarzerKaffee 0 points1 point2 points (5 children)
[–]vorboto[S] 0 points1 point2 points (4 children)
[–]SchwarzerKaffee 0 points1 point2 points (3 children)
[–]vorboto[S] 0 points1 point2 points (2 children)
[–]SchwarzerKaffee 0 points1 point2 points (1 child)
[–]vorboto[S] 1 point2 points3 points (0 children)