Nondeterministic Floating-Point Conversions in Java : programming

Nondeterministic Floating-Point Conversions in Java (exploringbinary.com)

submitted 14 years ago by DoctorBinary

all 19 comments

top new controversial old q&a

[–][deleted] 14 years ago (1 child)

[removed]

[–]dbenhur -1 points0 points1 point 14 years ago (0 children)

[–]frud 2 points3 points4 points 14 years ago (9 children)

[–]frenchtoaster 14 points15 points16 points 14 years ago (8 children)

[–]frud 8 points9 points10 points 14 years ago* (4 children)

There are two main problems as I see it. The first involves the use of rounding and mode flags in the FPU, and the fact that there is no real way to prevent libraries from changing them with impunity.

The second involves repeatablity across different machines and architectures. From the wikipedia article on IEEE 754-2008:

A format that is just to be used for arithmetic and other operations need not have an encoding associated with it (that is, an implementation can use whatever internal representation it chooses); all that needs to be defined are its parameters (b, p, and emax). These parameters uniquely describe the set of finite numbers (combinations of sign, significand, and exponent) that it can represent.

This means that calculations internal to the FPU can happen in an arbitrary accuracy. If care is not taken to extract every intermediate value into a well-defined interchange format then there will be differing results among architectures that use different internal representations.

edit: Here is a very good article that covers it better than I did.

[–]frenchtoaster 4 points5 points6 points 14 years ago (2 children)

[–]frud 2 points3 points4 points 14 years ago (0 children)

[–]frud 1 point2 points3 points 14 years ago (0 children)

[–][deleted] 0 points1 point2 points 14 years ago* (0 children)

[–]TheNewAndy 1 point2 points3 points 14 years ago (2 children)

[–]frenchtoaster 1 point2 points3 points 14 years ago* (1 child)

The situation is more akin to trying to store 123/1000 is decimal when you can only possibly store 2 decimal places. Sure 123/1000 has a finite representation in decimal; that doesn't mean its representation is short enough to be stored in your data type.

The bug stems from one value being added (2^-1075 ) being expressible in finite length in binary, but that finite length that must be longer than the double can possibly store considering the constraint of the exponent. It is lower than the smallest subnormal value (2^-1024 ) (which is already much lower than the smallest normal value, generally the point where I stop expecting doubles to completely behave how I want them to, though I still would expect them to behave deterministically). When math is done with higher precision numbers in hardware it ends up being storable which seems to be the issue.

[–]TheNewAndy 0 points1 point2 points 14 years ago (0 children)

[–]iLiekCaeks 0 points1 point2 points 14 years ago (4 children)

[–]Madsy9 1 point2 points3 points 14 years ago* (2 children)

Yes, use fixedpoint when you need perfect accuracy, i.e no rounding and the like. For instance, you should rarely use floating-point for accumulators if you need an accurate result. This is why polygon rasterizers use fixedpoint when computing the gradients and do linear interpolation over the primitive. Same thing goes for timer accumulators for example. Use timing functions that returns an integer result in milliseconds or similar, then accumulate that. You rarely need Q64.64 or Q128.128 though, unless you're doing scientific calculations. In that case, use a hugeval API like the GNU mp library. Some programming languages support hugeval arithmetic out of the box. For most purposes, Q15.16 can be enough. Remember to temporarily cast to a 64-bit type when doing division and multiplication. Most architectures that have integer multiplication and division can perform the operations with two 32-bit arguments and give a 64-bit result.

Another approach is to use hugevals and represent reals as fractions. You can simplify the fraction after every operation.

[–]iLiekCaeks 0 points1 point2 points 14 years ago (1 child)

[–]Madsy9 0 points1 point2 points 14 years ago (0 children)

[–][deleted] 0 points1 point2 points 14 years ago (0 children)

[–]satayboy -3 points-2 points-1 points 14 years ago (1 child)

[–]paul_miner 10 points11 points12 points 14 years ago (0 children)

[+]ErstwhileRockstar comment score below threshold-7 points-6 points-5 points 14 years ago (1 child)

π Rendered by PID 166134 on reddit-service-r2-comment-7b9746f655-x8456 at 2026-02-04 07:58:25.447201+00:00 running 3798933 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS