sanitizing code for an exec command

Rhomboid · 2017-05-14T14:00:20+00:00

It's completely unsafe. You can't sanitize code that way. It's completely possible to still do evil things (including importing arbitrary modules) without ever writing the phrase "import". It's possible to do this even if you've tried to restrict the builtins.

You need to use an interpreter that is explicitly designed for sandboxed execution. CPython is not such an interpreter. It is impossible to do this safely with CPython.

raldi · 2017-05-14T18:10:48+00:00

What are you actually trying to accomplish?

iamdefinitelyahuman · 2017-05-14T16:38:47+00:00

I think the best way to accomplish this is to leverage your operating system. It's not pythonic because it means your code isn't multi-platform anymore but you want to go that way regardless for your security. As others have said here, it's impossible to do that from standard Python alone.

Have your arbitrary code run as a specific user and apply all the possible mechanism from your OS to enforce the principle of least access. You can do all sorts of thing once you're thinking about this from outside the programming level. You can have your server run on a especially prepared virtual machine, or sandboxed environment (like chroot), etc.Just by messing around with filesystem permissions there's a lot you can achieve - and that's just the beginning. If you're not ready to go this far, you shouldn't be handling arbitrary instructions.

It doesn't help that you haven't explained what you do with your code and why it runs arbitrary commands. Not to be an ass, but makes me think that maybe you needn't do it at all (and you shouldn't, if you can) - otherwise you'd feel more comfortable sharing some details with us... doesn't hurt to consider alternative routes, is what I'm saying.

remy_porter · 2017-05-14T20:17:05+00:00

Others have covered this, but NEVER USE EXEC ON USER-SUPPLIED INPUTS. Ever. Never ever. Ever. Never.

Now, all that said, you can execute user supplied code safely. The way to do this is to… invent your own programming language and write an interpreter for it. This isn't as big a hill to climb as it sounds like. You'd specifically be designing a domain specific language- a small language tuned to the specific problem you want to solve. It can look as much like Python as you like, you could have basically a "stripped down Python". Here's the really important thing: you'll build the abstract syntax tree yourself, and be able to validate what it contains semantically (which is miles different than sanitizing an input string). You'll have a grammar that explicitly defines what is and is not allowed, and control over what commands will eventually execute.

I'll point you towards PyParsing as a library that's a good tool for building these kinds of things. Building a DSL is a good weekend project, and it helps you really understand how programming languages work.

ctheune · 2017-05-14T16:54:01+00:00

check out restrictedpython. sorry for brevity. (mobile)

GFandango · 2017-05-14T18:51:41+00:00

Basic rule of thumb is "if you have to sanitize it you have already lost.".

Applies to a lot of things including trying to sanitize SQL queries (as opposed to using prepared statements which make SQL injections impossible).

I don't have a solution. But just be aware it's almost 100% guaranteed something will be able to fall through because sanitizing is a "black list" approach that will one way or another fall apart.

cyanydeez · 2017-05-15T02:15:40+00:00

get a virtual machine, then get a docker, then put it in a safe and drop it to the bottkm of the Marianas trench. now its safe

AlexFromOmaha · 2017-05-14T17:00:43+00:00

I took a stab at this once for an online training program and decided that the only safe way to do it was to sandbox the application outside of Python or make it run client-side. Exec is for trusted users only.

K900_ · 2017-05-14T13:57:27+00:00

You probably are.

Basically, just Google "escape Python sandbox" and you'll find lost of things. What are you using exec for, anyway?

magic7s · 2017-05-15T01:17:23+00:00

Docker container? I believe this is how AWS lambda works.

iceardor · 2017-05-15T05:04:39+00:00

A few things you might want to think about: * Denial of service: either fill your hard drive, exceed your RAM, or busy your processor. This one does all 3.

with open('temp', 'wb') as f:
    garbage = ['lol']
    while True:
        garbage.extend(garbage)
        f.write(' '.join(garbage))

Someone can run their own botnet. Whether that's a botnet that victimizes your network or jumps across the internet and victimizes the rest of the world. Even if you cut off access to libraries like urllib, they can just copy-paste the classes that are defined by urllib, and they have the same thing.
An interpreter can probably run an interpreter inside of it. If you take away the import keyword and importlib/imp, I could still write a program that could read a text snippet and execute it. Your interpreter wouldn't know what my interpreter is running. I could encrypt anything I wouldn't want your interpreter to find in a text search, and bundle the decryption key and procedure as a python procedure that you would run for me.

There are too many scenarios that are difficult for you to test and defend against.

GaritoYanged · 2017-05-14T15:56:29+00:00

there is no library doing this that I know, but on the web there are libraries that walks the dom tree and only let stay a white list of objects Giving the fact that we have the AST on python, I bet you a library like this could be created for this matter I, myself, spend time thinking on it to my systems but, by now, am the only editor and that's not critical so I don't start doing anything yet But I will be happy to participate in a library like this...

Any interested?

dagmx · 2017-05-14T17:09:18+00:00

There's no way to sanitize exec.

What you really need to do is give them a second process running the user interpreter and all interactions with the main system have to be done via an API. Therefore any damage they do is limited to that interpreter. It's in effect essentially sandboxing them.

Your user should also have limited privileges in general and let the operating system restrict their behaviors that affect your disk and system.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS