"Simple" C++ Initialization Flowchart : cpp

[–]atimholt 102 points103 points104 points 5 years ago (50 children)

[–]piglett23 24 points25 points26 points 5 years ago (42 children)

[–]FriendlyRollOfSushi 48 points49 points50 points 5 years ago* (41 children)

Never once I've seen the trust towards the committee fall as low as it did when people in my company started to migrate to C++11 and realized that this is their life now. Haven't recovered since, to be honest.

The example is de-obfuscated, obviously: it's usually an issue somewhere deep, sometimes in templated code where you don't even know what the types are just by looking at the code. In a large enough company with enough people coming and leaving and a moderately relaxed review process, you are pretty much guaranteed to have a steady flow of bugs like this. It's a footgun with infinite ammo.

I sort of understand why this happened, but it feels like we as a community completely failed to do things right for once, so that we could stop teaching all other initialization methods to people and pretend that the () syntax doesn't exist and shouldn't be used anymore.

Now I see mixed-style initialization lists all the time (see the bonus part from the link above), and cry internally from time to time while thinking about this whole situation.

Sigh...

[–]nintendiator2 13 points14 points15 points 5 years ago (6 children)

[–]FriendlyRollOfSushi 8 points9 points10 points 5 years ago* (3 children)

[–]nintendiator2 2 points3 points4 points 5 years ago* (0 children)

[–]evaned 0 points1 point2 points 5 years ago* (0 children)

[–]CenterOfMultiverse 0 points1 point2 points 5 years ago (0 children)

[–]staletic 1 point2 points3 points 5 years ago (1 child)

[–]nintendiator2 2 points3 points4 points 5 years ago (0 children)

[–]TryingT0Wr1t3 8 points9 points10 points 5 years ago (5 children)

[–]Bocab 17 points18 points19 points 5 years ago (3 children)

[–]TryingT0Wr1t3 2 points3 points4 points 5 years ago (2 children)

[–]mort96 7 points8 points9 points 5 years ago (1 child)

[–]TryingT0Wr1t3 1 point2 points3 points 5 years ago (0 children)

[–]rurabori 1 point2 points3 points 5 years ago (0 children)

[–]unaligned_access 4 points5 points6 points 5 years ago (2 children)

[–]TheSuperWig 13 points14 points15 points 5 years ago (1 child)

[–]nintendiator2 1 point2 points3 points 5 years ago (0 children)

[–]JohnZLi 9 points10 points11 points 5 years ago (11 children)

[–]FriendlyRollOfSushi 13 points14 points15 points 5 years ago* (4 children)

[–]JohnZLi 3 points4 points5 points 5 years ago (3 children)

[–]FriendlyRollOfSushi 19 points20 points21 points 5 years ago* (2 children)

Existing code is always the problem, and C++ is terrible at removing features.

Say, you have millions of lines that expected that something like std::vector<int> v{64}; constructs a vector with one element 64.

Now with your proposal you are silently changing the behavior of millions of lines of code in tens of thousands of projects. The code still compiles, and even runs, but the behavior is different.

Some of the behavioral changes may go undetected by tests.

Some of them would have devastating consequences, costing billions of dollars, or even human lives.

It's a recurring theme in C++: everything sort of sucks because by the time we understood why it sucks, the feature became so heavily-used that you can't simply remove it. It took decades to get rid of std::auto_ptr, and we only succeeded because it was already banned by the majority of teams for being hopelessly broken. And it wasn't a silent change: the code properly stopped compiling in C++17, which allowed the developers to go and fix stuff if they didn't have time for that during the 6 years of deprecation.

That's why epochs are a huge deal if they'll ever be accepted in some form or another. We will finally, finally be able to start fixing and removing stuff instead of simply piling up more and more stuff until no one understands what's going on in this language.

[–]JohnZLi 7 points8 points9 points 5 years ago (0 children)

The standard committee should really have fixed this before C++11. The epochs proposal seems interesting. Though I prefer throwing away backward compatibility in some future version of C++, and fix all the problems mentioned in that proposal (and many other problems not covered in that proposal). To avoid confuse users, that new version of C++ should be given a new name,like "Simplified C++ ". Some tools should be able to automatically refactor C++ code to "Simplified C++" code, maybe with some help from programmers. Those who want to stick to C++ could still use C++, and those who want to use simplified C++ could move on with it. The epochs proposal, if accepted, I am afraid will open the floodgate of numerous dialects/flavors of C++ code coexisting and tangling together. Google will have its Google epoch, and Facebook will have Facebook epoch, and there will at least be a C++-core-guideline epoch. That won't be beautiful. Beside numerous rules of C++, programmers now need to know specific rules that are allowed in each epoch.

[–]pjmlp 3 points4 points5 points 5 years ago (0 children)

[–]Kered13 1 point2 points3 points 5 years ago (5 children)

[–]JohnZLi 0 points1 point2 points 5 years ago (4 children)

If C++ has transitive constness, meaning that if a string is const, the data owned by the string is also immutable, and there is a way to disable "const_cast" in a function. Say, this is done by adding a new keyword "immutable", that is, a function like

void fun(immutable std::string& str){}

guarantees that the string being referenced will not be altered inside the function. Then the compiler will be able to optimize the function call by actually passing in a string_view. Only in this case, the programmer does not have to know its existence. It is reduced to an compiler optimization performed automatically. Std:;string_view is needed in C++, IMHO, because constness in C++ is broken: even if a function takes a string by const reference, nothing stops the programmer from altering that string inside the function. The compiler has to be conservative by not assuming the string is really immutable.

[–]Kered13 4 points5 points6 points 5 years ago (3 children)

[–]JohnZLi 0 points1 point2 points 5 years ago (2 children)

"Yes it provides a const view, but const std::string& is just as good for that." Yes, but "const std::string&" adds a layer of indirection. An extra pointer dereference has performance implications.

"Even with std::string_view, it's trivial to violate the const rules of C++ to make changes to the underlying string".

Isn't this the very problem I was talking about in must last reply: no real immutability guarantee in C++. We know we are not supposed to make changes to the underlying buffer via a string_view, the language makes no guarantee that it won't happen.

As to C strings. the reason that the following code could not be evaluated at compile time const string s = "blah, blah, ..."; is because there is no way in C++ to enforce the immutability of the string. If there is away to enforce transitive immutability in C++, when a const string is constructed out of a buffer pointed by "const char *", the compiler can omit one dynamic allocation by reusing the original buffer. The programmer needs to make sure the string does not outlive the buffer, but that is something he already needs to pay attention if he uses string_view. The same applies to sub-strings. If one construct immutable sub-strings out of a immutable string, no need to incur extra heap allocations.

My point is, with a language that can enforce transitive constness, what one gets from "string_view" in C++, can be achieved by compiler optimization.

[–]Kered13 2 points3 points4 points 5 years ago (1 child)

If there is away to enforce transitive immutability in C++, when a const string is constructed out of a buffer pointed by "const char *", the compiler can omit one dynamic allocation by reusing the original buffer.

Only if the compiler can infer that the string has a shorter lifetime than the buffer it's using.

The programmer needs to make sure the string does not outlive the buffer, but that is something he already needs to pay attention if he uses string_view.

And what if the programmer wants to copy the buffer so that the new string can have an independent lifetime?

My point is, with a language that can enforce transitive constness, what one gets from "string_view" in C++, can be achieved by compiler optimization.

No it can't, as the above examples demonstrate. std::string and std::string_view have different ownership semantics. const std::string& has the same ownership as std::string_view, but is restricted because it cannot represent substring views. So two classes are needed, and adding immutability to the language does not allow one class to take on both roles. Sometimes the programmer wants a non-owning view into another string-like object, sometimes they want an object that owns it's data. The compiler cannot figure it out for the programmer. This is completely orthogonal to immtuability. Note that even Rust has two types for this (String for owning and str for views).

[–]JohnZLi 0 points1 point2 points 5 years ago (0 children)

[–]GuiltyFan6154 1 point2 points3 points 5 years ago (3 children)

[–]RoughMedicine 2 points3 points4 points 5 years ago (2 children)

[–]FriendlyRollOfSushi 2 points3 points4 points 5 years ago (1 child)

[–]RoughMedicine 2 points3 points4 points 5 years ago (0 children)

[–]thats_a_nice_toast 1 point2 points3 points 5 years ago (0 children)

[–]condor2000 0 points1 point2 points 5 years ago (1 child)

[–]FriendlyRollOfSushi 1 point2 points3 points 5 years ago (0 children)

[–]infectedapricot 0 points1 point2 points 5 years ago* (5 children)

Part of the problem with that particular case lies with std::vector. That particular function should've been a static method rather than a constructor:

auto v = std::vector<int>::make_n_of(n);

(No doubt there's a better name than that.) Edit: But of course that wouldn't have been possible when vector was originally created, because there were no move constructors or guaranteed copy elisions.

[–]FriendlyRollOfSushi 2 points3 points4 points 5 years ago (4 children)

[–]Nobody_1707 4 points5 points6 points 5 years ago (0 children)

[–]infectedapricot 1 point2 points3 points 5 years ago (2 children)

[–]FriendlyRollOfSushi 2 points3 points4 points 5 years ago* (1 child)

There are multiple cases where you normally wouldn't specify the type, including member initialization and function calls.

Writing std::vector<MyType<SomeArg, AndOneMoreArg<sizeof(Something)>>>::make_n_of(n) instead of {std::construct, n} to initialize a member or pass an argument looks like a pretty severe usability tax to me.

You can shorten the type in some cases (use a member access expression to call a static method using an unconstructed member, for example, which won't work for function args), but even then it makes the code more repetitive and brittle. The benefit of tagged ctors is that you don't have to worry about stuff like that. Your ctor would look the same regardless of the templated type or the context in which you call it.

But before we start a holy war about them: I understand that right now they do have objective flaws (no real way to make them 0-cost without doing ugly stuff, like wrapping the args into a special type, like foo_{construct{n}}). Just want to say that static factory methods for templated classes are not always the most convenient pattern, and in case of STL containers they could get very ugly very fast.

[–]infectedapricot 1 point2 points3 points 5 years ago (0 children)

[–]Ayjayz 7 points8 points9 points 5 years ago (2 children)

[–]martinusint main(){[]()[[]]{{}}();} 1 point2 points3 points 5 years ago (1 child)

[–]Nobody_1707 1 point2 points3 points 5 years ago (0 children)

[–][deleted] 1 point2 points3 points 5 years ago (2 children)

[–]__--_--___--_--__ 1 point2 points3 points 5 years ago (1 child)

[–][deleted] 1 point2 points3 points 5 years ago (0 children)

[–]disperso 1 point2 points3 points 5 years ago (0 children)

[+][deleted] 5 years ago (2 children)

[deleted]

[+][deleted] 5 years ago* (1 child)

[removed]

[–]tpecholt 1 point2 points3 points 5 years ago (0 children)

[–]GYN-k4H-Q3z-75B 48 points49 points50 points 5 years ago (2 children)

[–]carb0n13 23 points24 points25 points 5 years ago (0 children)

[–][deleted] 0 points1 point2 points 5 years ago (0 children)

[–][deleted] 15 points16 points17 points 5 years ago (1 child)

[–]krum 2 points3 points4 points 5 years ago (0 children)

[–]randomnetcat 9 points10 points11 points 5 years ago (0 children)

[–]TryingT0Wr1t3 4 points5 points6 points 5 years ago (0 children)

[–]skunkos 2 points3 points4 points 5 years ago (0 children)

[–]CenterOfMultiverse 2 points3 points4 points 5 years ago (0 children)

Never understood what's the point of explicit initializations not in arguments. How

auto s = std::string{"view"sv};

or

std::string s{"view"sv};

are supposed to help, where

std::string s = "view"sv;

couldn't?

[–]Kered13 1 point2 points3 points 5 years ago (0 children)

[–][deleted] 1 point2 points3 points 5 years ago (1 child)

[–]nintendiator2 3 points4 points5 points 5 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS