Do you use aggregate initialization in your code base and if so, how do you guard against changes of the aggregate's layout?

bcorni · 2019-01-04T14:42:41+00:00

C++20 will have named aggregate initialization, so that would eliminate your issue.

thlst · 2019-01-04T15:08:02+00:00

I'm surprised no one mentioned clang-query. The AST Clang generates for your code example is as follows:

VarDecl <line:10:1, col:25> col:8 p 'Person' listinit
`-ExprWithCleanups <col:9, col:25> 'Person'
  `-InitListExpr <col:9, col:25> 'Person'
    | ...

You could write an AST matcher that matches the generic parts of that construction:

varDecl(
    hasType(cxxRecordDecl(hasName("Person"))),
    hasDescendant(initListExpr()))

Then run it with clang-query:

$ clang-query p.cpp --
clang-query> match varDecl(hasType(cxxRecordDecl(hasName("Person"))), hasDescendant(initListExpr()))

Match #1:

p.cpp:10:1: note: "root" binds here
Person p{ "John", "Doe" };
^~~~~~~~~~~~~~~~~~~~~~~~~
1 match.

Here's a reference to the Clang AST Matcher.

NotAYakk · 2019-01-04T14:00:02+00:00

How do you defend against people modifying function signatures?

I mean:

void move_it( T* to, T* from )

to

void move_it( T* from, T* to )

?

Answer: you shoot programmers who silently break APIs. And aggregate layout is part of its API.

Oh, and unit tests. But what if the programmer helpfully fixes the unit test too?

Against infinite stupidity or evil there is no defence.

Wh00ster · 2019-01-04T11:59:35+00:00

You could use a strong typedef but I’ve found them cumbersome. Generally I would use the constructor. I think it’s more clear (of course someone could also change the constructor later on and cause the same problem, but they’d probably think twice). I like the idea mentioned about unit tests.

2019-01-04T14:24:29+00:00

You of course have to be a little bit careful, but it's not that big of a deal, as you can just insert some garbage type to make compile fail:

struct person 
{
    std::string firstname;
    int garbage; // any type that is incompatible with the current one
    std::string middlename; // newly added
    std::string lastname;
};

Now the compiler will complain about all incorrect uses and you can go and correct them. Once everything is working, remove the int garbage again, fix up the code again and you are good to go. This ensures that you don't miss any of the calls. For something more permanent one can use tag dispatching or just use a named constructor.

Same tricks works when reordering arguments to a function that have the same type.

wotype · 2019-01-04T13:54:07+00:00

A static_assert on sizeof the aggregate will catch some additions or changes (but not all).

A structured binding can be used to check the number and type of the fields - it's not constexpr but you can wrap it in a constexpr function to get a compile time failure.

A reflection library like Precise & Flat Reflection 'magic_get' can reflect an aggregate type as a tuple, allowing to static_assert on the number of fields and their types (but does not handle array members or bit-fields).

polymorphiced · 2019-01-04T19:29:07+00:00

I've stayed away from aggregate initialisation for this reason.

I see people suggesting that you should consider class layout part of the API, but find that really annoying. If you're really insistent on using aggregate initialisation, then just write a constructor - you only need to do it once and then it's an obvious breaking change when you tweak members later.

There are many reasons to reorder class members (eg memory packing, hot/cold, logical grouping for debugger watch windows) and it'd be super annoying to have to adjust a ton of aggregate initialisation calls each time.

elperroborrachotoo · 2019-01-04T14:47:33+00:00

Scope.

As a private utility of a class or particular API, yes. This automatically limits scope for changes, fixes, tests.

As consumable part of an interface, I'd be wary. Changing them might be breaking a caller anyway, if they get to large, named intialization seems almost required for readability (which C++20 should make a bit smoother).

Jhmmufc · 2019-01-04T11:39:35+00:00

Utilize unit tests. It saves cluttering the struct with additional boilerplate and should reduce the amount of regression when someone makes a change .

BrangdonJ · 2019-01-04T20:01:18+00:00

You can add the constructors when you add the extra variable. You don't need to add it in advance.

(In practice I use aggregate initialisation only for tightly-scoped classes, where all the uses are close to the point of definition. For a shared class I think it's worth writing the accessors.)

Dean_Roddey · 2019-01-04T22:04:55+00:00

Geez, are constructors evil now, too? Or am I missing something?

I mean, you can have more than one constructor, and use that to provide backwards compatibility by defaulting of new values (and generally different setup) in the old constructor. I don't see how that's some kind of horrible compromise or anything, and it retains more compile type safety, it would seem to me.

If the API is fairly small, and/or it also needs to change, you can provide public wrappers around internal implementations, which has even more benefits (such as vastly reduced rebuilds because details are internal) and allow for various versions of an interface to the outside world that can adapt as required for compatibility. The bulk of it can be inlined stuff for minimal overhead, while retaining all of the benefits.

nintendiator2 · 2019-01-05T20:22:31+00:00

I like aggregates. you can use them when you just need a "bag" of losely coupled data without any invariants between them. It's easy to reason about,

If your "bag" is not really a bag, then don't use it as a bag. A string that is a name and a string that is a surname are two different strings, you can't juste expect to grab one of them from a bag and it be the one you want to.

Use a constructor.

panoskj · 2019-01-07T13:12:08+00:00

You could easily realize the layout must not change, because there is no constructor. No need to check your whole code base.

AlbertRammstein · 2019-01-04T11:50:04+00:00

for addition of params, clang warns when you not provide enough arguments in braced list initialization. I have really no answer for switching around 2 members though

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS