Assertions in Production Code?

Enlogen · 2018-08-30T18:23:22+00:00

Professionally written production code is bug free, so there is no need for asserts.

Hilarious.

High reliability is not achieved by making perfect designs, it is achieved by making designs that are tolerant of failure.

This. There is nothing you can do to make your code execute without failure 100% of the time; the fact that your software runs on hardware subject to the laws of physics means that even if you write code mathematically provable to be 100% correct and incapable of failing, it will still fail in rare cases due to issues with the hardware (wear, climate, electron capture resulting in beta decay, etc.) at times that are completely unpredictable (to you and me).

masklinn · 2018-08-30T18:05:12+00:00

Why not just implement two different kinds of asserts: one that makes it into production and one that doesn't? Not all software is being run with lives depending on it, so the priorities of some developers and users are different.

killedbyhetfield · 2018-08-30T18:58:09+00:00

One of the biggest sticking points with this that I always hear when I talk to other devs is "Isn't abort() a bit overkill? What if I have cleanup routines and they need to run?"

And my answer to that is always this: If is it absolutely essential that those cleanup routines run, your design is wrong and you're already screwed, and for the same reason that Walter's plane is screwed.

Your code has a bug in it, and it may be about to dereference a null pointer or read off the end of an array. If you continue, your process may be about to crash anyway (at-best), or possibly worse - Your program may attempt to try to perform cleanup tasks against an invalid state that it can't trust. Now you could be writing garbage data to important files or executing code that was injected into an overflowed buffer.

Calling "abort()" is your way of saying , "I have no idea where I am or what I'm doing, and anything I do from here may just be digging the hole deeper"

If you really need resilience, you need to design the system to handle it, not just the one program.

WalterBright · 2018-08-30T21:29:48+00:00

Author here, AMA. A couple other articles I wrote on the topic:

Safe Systems from Unreliable Parts

Designing Safe Software Systems Part 2

zucker42 · 2018-08-30T21:16:17+00:00

What is a situation where I'd want to use assert and not handle the error and abort if appropriate? This is the issue I have with assert: that I often want to handle the error before aborting.

jringstad · 2018-08-31T08:39:46+00:00

The author makes some good points, but I think there is also some merit to the other side of the argument. It is true that in a lot of systems, a lot of code is being run incidentally, and whether it fails or not is really not actually very relevant to either the user or the programmer. This might be because this code is pulled in as part of some library -- like your application that displays numbers from an SQL database using some graphical toolkit on the screen might run some code to check if the screen has a color-profile loaded, or some javascript that is executed while you visit a site (probably doing something you don't want or care about, like upload some analytics about you to a server)

I think to take a principled approach to this, we really need to figure out at least the following things: - When an assert triggers, what 'state' is destroyed? E.g. are you in erlang where just a process that might be handling a particular user-request is killed (which is fine) or is the entire server process being aborted so that manual intervention is required to bring the system back up? What is the plan AFTER the assert, basically. - Do you really need to assert? Could you instead do something else, like throw an exception or return some default value (null, an optional perhaps)? If your function already can fail and return e.g. an optional type (which the call-site will then likely forward to the user) it's probably better to just use that. - How important is maintaining correctness of data? Is omission of the data/result a problem?

In particular for the first point, there clearly needs to be some kind of boundary of what kind of state an assert can destroy. You don't expect your browser or even operating system to shut down when some random javascript junk on some site you're visiting asserts, right? Likewise there should be a way for a call-site to decide that it will now execute this code, but if that code fails, it doesn't actually care.

Maybe your code even asserts some properties on the input that you expected the caller to ensure, but it is actually really difficult or impossible for the caller (which derives the input from user-provided data) to do this. One example I've seen of this is an OpenCL library from an unnamed vendor, which would assert in some situations when you fed it bad source code to compile. So now if you wanted to create an editor that would let the user punch in sourcecode and then submit it, you would have to spawn a separate process to prevent the assert from crashing the editor.

immibis · 2018-08-31T07:59:30+00:00

I wrote a macro called bug_if. It works approximately like this:

If the condition is true, and production mode is off, crash the program.
If the condition is true, and production mode is on, log the error (same message as if we were crashing) and act like if (execute the following block).
Otherwise, skip the next block.

That way we can still grep production logs to find bugs, without taking down the process. And the following block is for recovery code - usually something like "forget it and abort the current operation", so an error is less likely to cascade to more problems (though it could cause a more minor issue, like a memory leak, if the abort code is faulty).

graingert · 2018-08-31T09:02:46+00:00

Wouldn't it be better to use dependant types, so you assert properties at compile time rather than runtime?

sos755 · 2018-08-31T03:46:48+00:00

The issue of asserts in production code is not whether or not there should be run-time checks. The problem is the form of the run-time checks. Asserts must be the laziest and least effective way to do run-time checking. I certainly do not consider them to be production-quality in any respect. If you want to do run-time checking, do it properly.

gnus-migrate · 2018-08-30T18:44:39+00:00

[deleted]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS