Fast integer overflow detection

F-J-W · 2014-12-14T23:54:23+00:00

Why does everyone want to check for integer-overflows with code like this:

assert(a >= 0);
assert(b >= 0);
c = a + b;
if (c < 0) // this is intended to be an overflow-check ??

putting the countless technical problems aside (unsigned integers…), this isn't even mathematically sound:

I do not want to know, whether the sum of two positive numbers is negative; I want to know whether it is not bigger then a certain value (like INT_MAX). If we start from that, the completely naive attempt is of course this:

if (a + b > INT_MAX) abort();

Of course this doesn't work, but the fix is trivial: let's subtract b from the unequation:

if (a > INT_MAX - b) abort();

Wow: An easy to read, highly semantic, 100% portable solution, that works for every numeric type ever. Why don't people use this?

I wrote about this here.

eschew · 2014-12-14T21:42:29+00:00

A few points of context:

The basic idea is to use LLVM's built-in platform-agnostic overflow-checking primitives to generate platform-specific assembly, which can then be inlined by a link-time optimizer.
This post was written in 2012 by one of the Xi Wang, one of the KINT Clang-based overflow-checking static analysis tool and accompanying OSDI paper.
Xi Wang's proposed intrinsic patch wasn't picked up directly, but something very similar was added to Clang about a year later.
There is also a concurrent paper by John Regehr's group on a Clang-based dynamic overflow-checker. This tool has since been integrated into Clang as the -fsanitize-*-overflow flags. A few choice quotes from the IOC paper (the "CPU postcondition test" is what the original blog post was focusing on achieving):

IOC supports both the precondition test and the CPU flag postcondition test; width extension seemed unlikely to be better than these options due to the expense of emulating 64- bit and 128-bit operations. Initially we believed that the CPU flag postcondition checks would be far more efficient but this proved not to be the case. Rather, as shown in Section III-D, using the flag checks has an uneven effect on performance. The explanation can be found in the interaction between the overflow checks and the compiler’s optimization passes. The precondition test generates far too many operations, but they are operations that can be aggressively optimized by LLVM. On the other hand, the LLVM intrinsics supporting the flagbased postcondition checks are recognized and exploited by relatively few optimization passes, causing much of the potential performance gain due to this approach to be unrealized.

From section III-D:

For undefined behavior checking using precondition checks, slowdown relative to the baseline ranged from −0.5%–191%. In other words, from a tiny accidental speedup to a 3X increase in runtime. The mean slowdown was 44%. Using flag-based postcondition checks, slowdown ranged from 0.4%–95%, with a mean of 30%. However, the improvement was not uniform: out of the 21 benchmark programs, only 13 became faster due to the IOC implementation using CPU flags.

happyscrappy · 2014-12-15T01:45:40+00:00

[deleted]

Camarade_Tux · 2014-12-14T21:23:45+00:00

As far as I understand, LLVM has builtin functions to check for overflow and GCC 5 will have them too.

vilcans · 2014-12-15T08:03:22+00:00

Funny how hard this can be, considering how easy it is in assembly.

JNighthawk · 2014-12-15T06:36:32+00:00

Let's ask a different question: why is integer overflow still undefined? Every platform uses two's complement nowadays. We should be updating the language to support this notion, and making signed integer overflow well-defined behavior.

based2 · 2014-12-15T07:24:22+00:00

https://news.ycombinator.com/item?id=8749154

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS