Incorrect optimization in 1963

StefanKarpinski · on Jan 20, 2015

> Floating-point users today are accustomed (or resigned, sometimes) to compilers that make invalid optimizations by assuming all arithmetic is mathematically correct instead of rounding.

Huh? Modern compilers don't do that. They will only reorder or re-associate operations if they know that it will produce the exact same result. Anything else is considered a severe compiler bug. The exception is if you explicitly ask for such sketchy optimizations with the `-ffast-math` flag, of which the GCC man page says: "This option is not turned on by any -O option besides -Ofast since it can result in incorrect output for programs." This option is off by default. The rest of the article is interesting, if correct – unfortunately, it's a little hard not to be skeptical when the first sentence is so flagrantly wrong.

MaulingMonkey · on Jan 20, 2015

> Huh? Modern compilers don't do that.

Depends on what you're calling "invalid" I suppose.

> The exception is if you explicitly ask for such sketchy optimizations with the `-ffast-math` flag,

MSVC defaults to /fp:precise which doesn't round doubles after every operation (you have to opt out of this optimization with /fp:strict.) This can and does cause problems with e.g lockstep networking where you need consistent results on both sides of the pipe, when one end is accumulating in 80-bit registers and the other end is accumulating in 64-bit.

http://msdn.microsoft.com/en-us/library/e7s85ffb.aspx

Maybe you don't consider this "invalid", or don't consider the latest MSVC compiler "modern", but if either of these is your argument, I'd argue you're writing off the article over minor quibbles of semantics - not over it being "flagrantly wrong."

Joky · on Jan 20, 2015

"invalid" is relative to a spec. You are writing in a language (usually) that specify what guarantee you get.

MaulingMonkey · on Jan 21, 2015

Specs have holes and ambiguities. And C++ in general likes to avoid guarantees to my frustration. In this case, we're not even guaranteed which specs apply.

In this case, the C++ standard references the C standard contains the (optional/normative) Annex F referencing the IEC 60559 standard (which has the same content as IEEE 754). Annex F defines float and double in terms of the IEC 60559 single & double formats... and the standard C++ operators in terms of their IEC 60559 counterparts... and recommends long double be implemented as an extended double format (such as the 80-bit format commonly used by x86.)

Annex F also mentions allowed optimizations, disallowed optimizations (involving various corner cases, rounding modes, implementation defined signaling exceptions, etc. ad infinitum). I don't see temporarily handling a double as an IEC 60559 Extended Double format anywhere on the allowed optimizations list, but I can't tell for certain if it runs afoul of any of the specifically disallowed optimizations of Annex F either.

I'm guessing /fp:strict is meant to imply strict compliance with some standard, but MSDN talks about /fp:precise maintaining enough precision to be within "strict ANSI compliance" as well (under Annex F? Or perhaps they mean within Annex E's epsilon precision requirements...)

MSVC (2008-2013) doesn't appear to define __STDC_IEC_559__ which would definitively bind it to Annex F, but it does define std::numeric_limits<...>::is_iec559 = true for float/double/long double, which at least binds it to IEC 60559...

aidenn0 · on Jan 20, 2015

Plenty of modern compilers will optimize e.g. a/c + b/c to (a+b)/c when dealing with floating point. There is nothing in the C standard which makes that invalid.

kps · on Jan 21, 2015

> There is nothing in the C standard which makes that invalid.

C only permits¹ optimizations that do not change semantics, so changing the result of a floating point expression is invalid. Example 5 on p16 of C11² makes this explicit: “Rearrangement for floating-point expressions is often restricted because of limitations in precision as well as range. The implementation cannot generally apply the mathematical associative rules for addition or multiplication, nor the distributive rule, because of roundoff error, even in the absence of overflow and underflow. Likewise, implementations cannot generally replace decimal constants in order to rearrange expressions.”

¹ Of course many compilers have options that invoke something other than standard-conforming C.

² C11 final draft: http://www.open-std.org/jtc1/sc22/wg14/www/docs/n1570.pdf

aidenn0 · on Jan 22, 2015

Thanks for that; I've purchased the C11 standard, but haven't gotten around to reading it yet since I'm not likely to write anything in C11 for a few years yet.

Joky · on Jan 20, 2015

This seems IEEE754 invalid to me. Any reference to a compiler which would do this transformation (without fast-math of course)?

aidenn0 · on Jan 21, 2015

60559 specifically allows value-changing optimizations, so long as optimizations are not disabled. I have seen more than one compiler that allows at least some value-changing optimizations with their equivalent to gcc -O2.

Also, as far as I can tell, C99 (even with Annex F) does not follow parts of IEC 60559 with regards to:

> A language standard should also define, and require implementations to provide, attributes that allow and disallow value-changing optimizations, separately or collectively, for a block.

[edit] And note that one example "value-changing optimization" is applying associative and distributed rules. Fused multiply-add is another that is really common.

Someone · on Jan 20, 2015

Undesirable and surprising, yes, but incorrect? The manual clearly describes the behaviour and AFAIK, there wasn't a fortran standard yet.

I guess the reordering was done to simplify later compiler passes, in particular the common subexpression elimination one.

acqq · on Jan 20, 2015

Yes. The compilers at that time had as little as only 4 K words of RAM in the whole computer. Everything that can be simplified in one pass made a huge difference.

kazinator · on Jan 20, 2015

The "not grouped by parentheses" is already where things go wrong. Parentheses should determine the syntactic parse of the expression, not the order of evaluation. Optimization works with an abstract representation of the program where the parenthesis tokens are long gone.

That is to say (a + b) + c should not have any influence relative to a + b + c, at least if addition is already left associative. Because then a + b + c already means (a + b) + c. Adding the parentheses to make (a + b) + c doesn't change the parse whatsoever.

If parentheses are needed to establish the precise meaning, it means that the compiler writer has neglected to document, and commit to, an associativity for the operator; the compiler writer has not specified whether a + b + c means (a + b) + c or a + (b + c). That is a mistake: a gratuitous ambiguity in the syntax of the programming language.

If there is such a mistake in the design, a stray ambiguity, then you cannot call the optimization wrong. If it is not specified whether a + b + c means (a + b) + c, or a + (b + c) or something else like add3_in_any_order(a, b, c), then the compiler is in fact free to use any of the possible parses as a guide for the operation order.

Joky · on Jan 20, 2015

Indeed: "The ANSI Fortran standard is less restrictive than the C standard: it requires the compiler to respect the order of evaluation specified by parentheses, but otherwise allows the compiler to reorder expressions as it sees fit."

kazinator · on Jan 21, 2015

That piece of text, if taken literally and perhaps out of context, means that if we have a * b + c, the Fortran compiler is allowed to add b+c first, and then multiply by a, because there are no parentheses. It's just a question of whether the compiler writers "sees fit" such a thing.

In C, the order of operations is not ambiguous, and is distinct from the evaluation order. In a * b + c, three operands a, b and c have to be evaluated (the variables be reduced to the operand values they hold). This evaluation takes place in an unspecified order (possibly leading to undefined behavior). On these values, however, the arithmetic order of operations is clear: multiplication takes precedence and so is done first.

The evaluation order makes a difference if the expressions a, b and c themselves have side effects.

InfiniteRand · on Jan 21, 2015

If you want really accurate math, the best idea for a program is maintain a representation of an expression until it absolutely needs to, since every time the math is condensed into actual numbers there is usually some loss of precision. Even as humans, it makes sense, we rarely write 0.33333 instead of 1/3 unless we have to by circumstance (or at least you souldn't)

hamburglar · on Jan 20, 2015

They say never optimize too early. 1963 is definitely too early in my experience.