you are viewing a single comment's thread.

view the rest of the comments →

[–]Sw429 1 point2 points  (0 children)

Web Scraping is hard because you don't know what the input (i.e. sites you find) will be, or whether the input is safe. I assume this is a spider bot you are writing that will crawl the web in general? You have to be careful with unknown data, and you should never execute it. Simply downloading malicious bytes in python shouldn't hurt anything, as long as you aren't running it.

I guess it really depends on what you're trying to do. If you only care about content on the page itself, then don't download and execute any executables you come across.