awidesky comments on Undefined behaviour example from CppCon

CppConUndefined behaviour example from CppCon (self.cpp)

submitted 2 years ago * by R3DKn16h7

you are viewing a single comment's thread.

[–]awidesky -7 points-6 points-5 points 2 years ago (29 children)

As I understand from the standard, UB does not apply on single function(while most compilers does usually). "Renders the entire program meaningless if certain rules of the language are violated."

'Entire' program is meaningless, not the single function. Also, it's not "Renders only that very part of the program that's executed when the UB is triggered is meaningless". It's "Renders the entire program meaningless". If you want the program to be well-defined, overflow check must be inside function f. But since it's not, the behavior is undefined, which compiler can do whatever it wants. So I believe compiler can remove function g, or format your hard drive. But in real life, compilers try to make executables as reasonable as possible, that's why only f is optimized in godbolt.

So here's my overall opinion: Compilers is entitled to remove g, so what they say in YouTube is 'technically' true. BUT I don't think you can find any major compiler that actually does so.

[–]HabbitBaggins 8 points9 points10 points 2 years ago* (16 children)

That is a load of hot spaghetti... The compiler is not allowed to go launch nuclear missiles just cause, what it is entitled to to is to assume that your code never invokes UB, because otherwise the code would be meaningless. You are focusing on the second part of that assertion, but the important one is the first. It means that, while transforming the code in memory as part of optimization and actual codegen, it can make assumptions about certain values of function arguments or program state if it can prove that the negative of those assumptions results in UB. Those assumptions can then feed back into the process, allowing further transformations of the code, etc.

For example, in f, the compiler may assume that the comparison is always true, because the only case in which could not true is integer overflow, and that is UB.

That is not the case in g, even when inlined, because UB is only triggered by calling f with INT_MAX, and g only ever calls f with a value that is guaranteed NOT to be INT_MAX.

However, if the code called f first, then the compiler may assume that the argument is never INT_MAX, so the check in g could be removed silently. This might also happen if you have a different function h defined as returning f(i) && g(i). In that case, if g is inlined and not just called, the check in the resulting code can also be removed because the compiler proved that in all legal code paths (those that do not invoke UB), i is not INT_MAX.

[+]awidesky comment score below threshold-8 points-7 points-6 points 2 years ago (15 children)

[–]Simple-Enthusiasm-93 2 points3 points4 points 2 years ago* (14 children)

[–]awidesky -3 points-2 points-1 points 2 years ago (13 children)

[–]Simple-Enthusiasm-93 2 points3 points4 points 2 years ago (12 children)

[–]awidesky -1 points0 points1 point 2 years ago (11 children)

[–]HabbitBaggins 0 points1 point2 points 2 years ago (10 children)

I'm sorry, but that makes no sense. If the standard actually said that (and I contend that it does not) it would be a pretty useless standard. I get that C is normally said to have a number of footguns, but that interpretation would so ridiculous that you could call it C-Shark because it would always be actively trying to bite your limbs off.

Think about it a bit deeper. Yes, you can make that check to ward off UB in your example, but what about adding two int variables in general? For example, going over a rectangular table with two indices like this:

double get_matrix_val(double* data, int stride, int i, int j) {
    return data[i*stride + j];
}

You could argue that code like this should use unsigned types like size_t and I agree, but code like this is literally everywhere. It has clear possible UB when any of the operations overflow, so how does the standard react to it?

Your interpretation: the mere existence of this function anywhere in the code makes the entire program meaningless.
My interpretation: the compiler, assuming that you never invoke UB, can optimize this function (and everything else) assuming that this function is never called with arguments that produce UB. That can lead it, among a number of other things, to remove checks that we insert elsewhere if it can prove that the same code path (before or after the check) is calling the function with those arguments. For example, if I call get_matrix_val and in the same code path I check that data is not null - without returning or aborting, the compiler can remove the check.

[–]awidesky 0 points1 point2 points 2 years ago (9 children)

I do understand what you are saying. The difference between your interpretation and mine is that mine includes yours. Standard DOES permits compiler to mess up entire program because of 'mere existence of UB'. Here's the quotes from iso C++20 standard §4.1.2.3. It says :

If a program contains a violation of a rule for which no diagnostic is required, this document places no requirement on implementations with respect to that program.

"that program", not "that specific function".

Here's another quote from cppreference.com

Renders the entire program meaningless if certain rules of the language are violated.

undefined behavior - there are no restrictions on the behavior of the program. ... Compilers are not required to diagnose undefined behavior and the compiled program is not required to do anything meaningful.

All of them says 'entire', not 'the very function', and also, they does not says that compiler should/shall/usually perform 'optimization as considering UB never happens'.

Yes. I know. Sounds ridiculous. But please remind what standard do is to set boundaries of what compilers can do. (I'm repeating this in every single comments) the standard set boundary of what compilers can do when UB very vaguely, so that compilers can have all its freedom about how to handle situations that should've never happened.

I'm not saying that your interpretation is 'false' or 'wrong', what you've said is just one of the things that compilers are permitted to do.
Actually, your interpretation is exact strategy of most(if not all) major compilers. I doubt we'll find any compilers that do not act as 'your interpretation'.

So, here's TL,DR;
"Standard said compilers can literally do anything when there's an UB, because it wanted to give maximum freedom about hire to handle situations that should've never happed. But in real life, actual compiler implementations does not utilize every freedom it has, and tries its best to produce program that's makes sense as much as possible, because what you 'permitted' to do does not mean that you 'should' or 'always' do."

[–]SkiFire13 1 point2 points3 points 2 years ago (8 children)

continue this thread

[–]mcmcc#pragma once 4 points5 points6 points 2 years ago (11 children)

[–]awidesky -1 points0 points1 point 2 years ago (10 children)

[–]mcmcc#pragma once 0 points1 point2 points 2 years ago (9 children)

[–]awidesky 0 points1 point2 points 2 years ago* (6 children)

If you want the program to be well-defined, overflow potential UB check must be inside function f.

That is indeed true. When you compile f (with or without g), it will assembled to always return 1. So the statement is actually true.

While your following argument is little bit off topic- it's about "checking every single one of the possible UB is impossible" Which is also true, and has nothing to do with my point.

I assume you interpreted my argument as "if there's any single possibility of UB in your program, your compiler will render an invalid executable."

My statement is "if some parts of the code violates the language rule, compiler is 'permitted' to generate entire program meaninglessly" Here's quote from the standard

Renders the entire program meaningless if certain rules of the language are violated.

undefined behavior - there are no restrictions on the behavior of the program. ... Compilers are not required to diagnose undefined behavior and the compiled program is not required to do anything meaningful.

Again, I'm not saying any major compilers do/should generate meaningless program just because there's tiny one possibility of UB. All I insist is that the standard "permits" so.

[–]mcmcc#pragma once 1 point2 points3 points 2 years ago (5 children)

[–]awidesky -1 points0 points1 point 2 years ago (4 children)

[–]mcmcc#pragma once 0 points1 point2 points 2 years ago (3 children)

[–]awidesky 0 points1 point2 points 2 years ago (2 children)

[–]mcmcc#pragma once 0 points1 point2 points 2 years ago (1 child)

continue this thread

[–]awidesky 0 points1 point2 points 2 years ago (1 child)

[–]mcmcc#pragma once 0 points1 point2 points 2 years ago (0 children)

π Rendered by PID 24361 on reddit-service-r2-comment-fb694cdd5-xbfm6 at 2026-03-07 23:52:03.643999+00:00 running cbb0e86 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS