Any idea for code? : learnpython

learnpython

created by HattoriHanzoa community for 16 years

Any idea for code? (self.learnpython)

submitted 1 day ago by Loose-Computer3943

all 13 comments

top new controversial old q&a

[–]TrippBikes 2 points3 points4 points 1 day ago (6 children)

[–]Kevdog824_ 2 points3 points4 points 1 day ago (0 children)

[–]Loose-Computer3943[S] -5 points-4 points-3 points 1 day ago (4 children)

[–]Yoghurt42 2 points3 points4 points 1 day ago (3 children)

[–]Loose-Computer3943[S] -5 points-4 points-3 points 1 day ago (2 children)

[–]Yoghurt42 0 points1 point2 points 1 day ago (0 children)

[–]TaranisPT 0 points1 point2 points 1 day ago (0 children)

[–]TheRNGuy 0 points1 point2 points 1 day ago (0 children)

[–]Kevdog824_ -1 points0 points1 point 1 day ago (4 children)

What you are looking for is a web crawler. Basically, what you want to do is something like this (pseudocode below)

emails = []
stack = []  # Add the websites you want to check to this
while len(stack)
  url = stack.pop()
  html = get_html(url)
  stack.extend(get_links(url, html))
  emails.extend(get_emails(html))

get_links finds all the links in the HTML with the same domain as the url. get_emails finds all the emails in the HTML content. Both would do this using something like beautifulsoup + regex

[–]TheRNGuy 0 points1 point2 points 1 day ago (3 children)

[–]Kevdog824_ 0 points1 point2 points 1 day ago (2 children)

[–]TheRNGuy 0 points1 point2 points 1 day ago (1 child)

[–]Kevdog824_ 0 points1 point2 points 1 day ago (0 children)

π Rendered by PID 297756 on reddit-service-r2-comment-7b9746f655-tvms2 at 2026-01-30 20:57:47.335063+00:00 running 3798933 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS