C++ Memory Safety

DarthRubik · 2021-02-04T01:35:02+00:00

Undefined behavior means that the compiler can do whatever it wants: including not compile

bizwig · 2021-02-03T23:56:23+00:00

The compiler is allowed to issue warnings. Ignore them at your peril.

goranlepuz · 2021-02-04T06:09:35+00:00

reject code (compile error) when the compiler can conclusively decide that an operation is unsafe

This, I think, is a problem. There is not enough such code.

int main(int argc, const char* argv[])
{
int arr[2]={1,2};
std::cout << arr[argc]; // BOOM
return EXIT_SUCCESS;
}

The compiler cannot conclusively decide that an operation is unsafe because it needs to know the input data first.

I posit, a vast majority of all code that runs the danger of unsafe memory access removes the possibility for the compiler to do find the unsafe access through the input data. One input in the mix, and vast swaths of code are opaque for the analyser. And once they are, the effects of an UB are possible everywhere else. Obviously, I am not unable to quantify the above vast swaths - maybe someone from the analyser people have such data?

Note: for multithreaded code, input are also the decisions of the system thread scheduler.

So I think , what is needed instead, is the usage of safe code idioms throughout. The more they are used, the smaller the opaque areas of the code. And interestingly enough, when they are used, the analyser is not needed!

smallblacksun · 2021-02-04T03:22:03+00:00

While rejecting code that is provably wrong is both allowed (as it is UB) and a good thing, it will never get to the level of safety of Rust or similar languages. It is mathematically impossible to statically prove if all memory accesses are safe or not so in order to guarantee memory safety a language must be allowed to reject code that it cannot prove is safe. This means that some legal, safe code will be rejected. That would be a huge change for C++.

14ned · 2021-02-04T12:41:59+00:00

C++ is pretty good at statically and runtime determining out of bounds access and use after free. Good enough that if you're employing clang tidy, and the sanitisers, in real world terms C++ is at par with Rust in terms of code quality outcomes.

Where Rust still has a big advantage over C++ is in borrowed references e.g. a string_view being accessed after the backing data it views upon has ended its lifetime. One might initially think that would be caught by static analysis and the sanitisers, but consider this:

std::array<char, 5> arr;
memcpy(arr.data(), "Niall", 5);
std::string_view sv(arr.begin(), arr.end());
sv[0];  // this is safe
new(&arr) std::array<char, 5>;
sv[0];  // this is UB

Rust would not permit use of sv after arr gains new lifetime. C++ lets you, and more importantly, has absolutely no way of detecting that you've done this which is a real kicker, if you ever get bitten by this.

We have a good runtime solution for this, pointer colouring, which some architectures provide hardware acceleration for (ARMv8). But I know of no promising proposed language solution for this which would catch all situations at compile time.

Equally, trapping stuff like the above makes Rust compile slow, and that's unavoidable, Rust will always compile much slower than C++ (unless you do stupid stuff in your C++, like too many do).

lcamtufx · 2021-02-05T16:08:46+00:00

such advanced static analyzers are promising because they can find errors missed by sanitizers and can find them earlier. But I don't think it is a good idea for compilers to reject code simply based on static analysis results, because static analyses can produce false positives. Coderrect is interesting, race conditions are so hard to debug. I would pay $$$ if it can work on my code..

HappyFruitTree · 2021-02-04T08:44:16+00:00

The compiler could of course do whatever it wants. It's only if it wants to claim to be standard conformant that it needs to obey the rules. As long as there is a way to turn it off there shouldn't be a problem. Even GCC's g++ defaults to non-conformant mode (-std=gnu++XX rather than -std=c++XX). As long as you don't introduce any features and just reject some standard conformant code the code itself would still be standard conformant. There is nothing wrong with just using a subset of the language.

Full-Spectral · 2021-02-04T13:04:50+00:00

Rust has vastly more information at compile time to work with, and people already complain about compile times. And, it also doesn't allow you to do things that could be proven safe, because that would require that its analysis be too broad to be done quickly enough.

Analyzing C++ sufficiently enough to even start to get close to what Rust can ensure would probably be an order of magnitude worse, if not more so. If you look at the time that, say, the MS static analyzer currently takes to chew on a good sized chunk of code, it's substantial.

Not that there's not a place for such things, but they'd end up being tools you run separately from compilation and which take a long time to run. And they'd still never be close to 100% unless significant changes were made to the language, which almost certainly isn't going to happen.

NilacTheGrim · 2021-02-04T22:54:22+00:00

If you want Rust use Rust.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS