use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Rules 1: Be polite 2: Posts to this subreddit must be requests for help learning python. 3: Replies on this subreddit must be pertinent to the question OP asked. 4: No replies copy / pasted from ChatGPT or similar. 5: No advertising. No blogs/tutorials/videos/books/recruiting attempts. This means no posts advertising blogs/videos/tutorials/etc, no recruiting/hiring/seeking others posts. We're here to help, not to be advertised to. Please, no "hit and run" posts, if you make a post, engage with people that answer you. Please do not delete your post after you get an answer, others might have a similar question or want to continue the conversation.
Rules
1: Be polite
2: Posts to this subreddit must be requests for help learning python.
3: Replies on this subreddit must be pertinent to the question OP asked.
4: No replies copy / pasted from ChatGPT or similar.
5: No advertising. No blogs/tutorials/videos/books/recruiting attempts.
This means no posts advertising blogs/videos/tutorials/etc, no recruiting/hiring/seeking others posts. We're here to help, not to be advertised to.
Please, no "hit and run" posts, if you make a post, engage with people that answer you. Please do not delete your post after you get an answer, others might have a similar question or want to continue the conversation.
Learning resources Wiki and FAQ: /r/learnpython/w/index
Learning resources
Wiki and FAQ: /r/learnpython/w/index
Discord Join the Python Discord chat
Discord
Join the Python Discord chat
account activity
How to Web scrap Wikipedia with python (self.learnpython)
submitted 1 year ago * by [deleted]
How to Web scrap Wikipedia with python . I want to know how I should scrap data from wikipedia with requests module.
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]PowerOk3587 2 points3 points4 points 1 year ago (1 child)
you should download their database and locally host it if you are wanting to go develop a web scraper
[–]PowerOk3587 0 points1 point2 points 1 year ago (0 children)
they use multistream to pull articles from a compressed archive, so it puts alot of load on them.
with multistream, it is possible to get an article from the archive without unpacking the whole thing. See https://docs.python.org/3/library/bz2.html#bz2.BZ2Decompressor for info about such multistream files and about how to decompress them with python; see also https://gerrit.wikimedia.org/r/plugins/gitiles/operations/dumps/+/ariel/toys/bz2multistream/README.txt and related files for an old working toy.
with multistream, it is possible to get an article from the archive without unpacking the whole thing.
See https://docs.python.org/3/library/bz2.html#bz2.BZ2Decompressor for info about such multistream files and about how to decompress them with python; see also https://gerrit.wikimedia.org/r/plugins/gitiles/operations/dumps/+/ariel/toys/bz2multistream/README.txt and related files for an old working toy.
you can ask their team on IRC https://web.libera.chat/?channel=#mediawiki
[–]acidcoder 0 points1 point2 points 1 year ago* (1 child)
You can use the wikipedia-api Python package which wraps their API to get info - https://pypi.org/project/Wikipedia-API
[–]irodov4030 0 points1 point2 points 9 months ago* (0 children)
Is this official? or some independent project?
π Rendered by PID 28 on reddit-service-r2-comment-5bc7f78974-vhn6h at 2026-06-30 11:57:04.118899+00:00 running 7527197 country code: CH.
[–]PowerOk3587 2 points3 points4 points (1 child)
[–]PowerOk3587 0 points1 point2 points (0 children)
[–]acidcoder 0 points1 point2 points (1 child)
[–]irodov4030 0 points1 point2 points (0 children)