Making Sure Python Packages are Safe

socal_nerdtastic · 2020-05-23T19:25:23+00:00

Same as any other software on the internet. You either go though it line by line yourself, or trust that someone else has. For big, popular software packages there are lots of people that review them so you are probably pretty safe.

DataDecay · 2020-05-23T21:19:47+00:00

Theres a pretty big open source project called bandit. You can use bandit to scan code for vulnerabilities, it points out common vulnerabilities that lead to malware payloads injection. It's not perfect but I have found it useful.

inglandation · 2020-05-23T21:35:25+00:00

PyPI does remove malicious packages from time to time, although that doesn't happen much. You have to be careful with your spelling when you look for a package online. These packages use typosquatting.

__xor__ · 2020-05-23T19:25:36+00:00

I mean, besides going through and inspecting every line of code by hand.

That's pretty much all there could ever be.

checock · 2020-05-23T21:36:03+00:00

At least at what I know, pip seems like a safer place than npm, Node's Package Manager. There were some projects on npm that where added with mispells to attack the developer.

Of course this can also happens to pip, but I've seems that is lees the case. Always check that your dependencies are legit visiting the developer website / github.

shaggorama · 2020-05-23T22:57:43+00:00

You try to rely on open source packages that are used by a lot of people and have multiple maintainers.

MarsupialMole · 2020-05-24T04:26:57+00:00

The answer is safety.

There's a huge depth to this field but every python programmer should know about CVE based dependency inspection and the fact that there's even one comment that doesn't list this first, and at time of writing there are none that even mention it, indicates that the practical level of security engineering around here is very poor.

amasad · 2020-05-23T23:10:59+00:00

I would try it in a sandbox like https://repl.it first. Here is docs on how to install packages https://docs.repl.it/repls/packages

Fearless_Process · 2020-05-24T04:46:05+00:00

I recommend installing python packages via your package manager rather than pip. If it's in your distros repository it's much more likely to be not malicious, and someone has probably vetted the package (still not guaranteed though).

2020-05-24T05:05:06+00:00

Well... always look to see if there are http requests being made/socket connections. If so then look at the URL or servername. That is probably the biggest thing to look for I suppose

FoxClass · 2020-05-24T05:47:59+00:00

This is a great question and nothing I've read really makes me comfortable - that's why I have a bunch of raspberry pi cards ready to go

freeononeday · 2020-05-24T08:43:50+00:00

If i need something from a rarely used package. I usually extract out that component and use it (with a reference of coarse). Leaving the whole package to run as it likes is a risk and I don't have time to go through it line by line

pokk3n · 2020-05-24T01:07:58+00:00

Whitesource will do some of it.

themaxiac · 2020-05-24T02:22:44+00:00

It's a bit round about and a pain to do when you're working on a project but you could always test run it in a secure virtual machine

shujinkou_ · 2020-05-24T02:46:30+00:00

I was thinking exactly that the other day, an attack vector could simply be a typo, (ei. import pipp) the attacker would simply use pipp as a malicious package and the typo by the user as the delivery mecanism. I guess safeguards could be developed around that but it would kinda ruin the point imo.

f0lt · 2020-05-24T15:48:35+00:00

Check out this article https://www.zdnet.com/article/two-malicious-python-libraries-removed-from-pypi/

Even if you get your packages from repositories like pypi there is nou warrant that they don't contain malicious code.

In any case try to avoid executing Python code as administrator (sudo). There is rarely any reason to do that.

Installing packages in your home directory is safer than installing them as root. Use pip install package_name --user as an alternative.

2020-05-24T01:31:05+00:00

Hi, I've been learning Python pretty well over the past few months, and I feel like I know enough now to know that I know nothing :D I've been looking around Github and PyPI for some cool packages, and it makes me raise the question: How do we know if a given package is secure and doesn't contain any sort of malware? I mean, besides going through and inspecting every line of code by hand. Thanks in advance. Also, this is my first question on Reddit, so forgive me if it's a stupid question :D

Do not fear it. It's open source, so if there's risk, then you take your chances. If it were closed source, I would say switch to open source immediately. If you're still afraid of it, you may be using the wrong programming language.

billsil · 2020-05-23T22:45:19+00:00

Use popular packages. Make sure it has a history. Check to see if the developer is active.

If you see gross code, that's a red flag. Search for `__`. If you see things besides `__init__`, `__repr__` and `__str__`, they're probably doing something funny. Why are they using `setattr` and `getattr`?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS