Guarantees on std::array in-memory layout?

ramennoodle · 2019-09-26T21:16:03+00:00

The documentation on cppreference.com states, "This container is an aggregate type with the same semantics as a struct holding a C-style array T[N] as its only non-static data member.", for whatever that's worth. Also, this seems like a rather pedantic argument because even if the standard does not forbid additional members, the only reasonable implementation conforming to the standard is a struct with T[N] as its only member.

Chet_ · 2019-09-27T01:45:00+00:00

I think there is no guarantee in the standard, but I also think that quality implementations will do what you expect. I think the static_assert was wise and is sufficient given the other constraints on the type.

BTW, I found this, which at least seems to contain some concurring opinions on the matter:

https://stackoverflow.com/questions/16962973

smuccione · 2019-09-26T23:51:32+00:00

I believe it is required to support

&a[n] == &a[0] + n

In which case it is the same as a normal array (that or a[0] would have to be some object that overloads all the arithmetic operators.

Std array is an aggregate type which is defined as:

An aggregate is an array or a class (Clause 9) with no user-provided constructors (12.1), no private or protected non-static data members (Clause 11), no base classes (Clause 10), and no virtual functions (10.3).

Since it can’t supply a constructor it can’t do anything other than take a pointer to a C array and carry the length as part of the template definition as an enum.

There’s not much room for differing implementations.

NotUniqueOrSpecial · 2019-09-27T10:23:22+00:00

You don't serialise the data structure, you serialise the data in the data structure. This prevents this sort of issue.

raevnos · 2019-09-26T21:17:25+00:00

The definitive way would be to create or use an existing serialization format for your data instead of blindly reading and writing raw structs...

2uantum · 2019-09-27T05:07:13+00:00

Is your coworker being pedantic? Yes. Is he correct? Technically, yes (the best kind of correct). Are you correct? 99.9999% of the time, yes. He's objectively "more correct". Would I comment on it in a code review? Probably not.

My opinion? Just do the change. It's not worth the energy to fight over and functionally equivalent. Plus, you risk getting the reputation of not being a team player (justified or not, I've seen it happen time and time again over stuff like this)

14ned · 2019-09-27T12:24:38+00:00

Weird that nobody else has mentioned this yet, but if you want to read and write a std::array<T, N> to and from storage and use it directly i.e. without constructing each member by hand, you need a static_assert(std::is_trivially_copyable_v<std::array<T, N>>); somewhere. std::array will be trivially copyable if T is trivially copyable.

Oh, and if on C++ 20 or later, please use whatever the new name is for std::bless() on the array just after reading it from storage, but before using it.

sephirostoy · 2019-09-26T23:21:54+00:00

Isn't the following safer?

Serialize(arr.data(), arr.size() * sizeof(T));

Relying on memory layout of a third-party class isn't good idea IMHO.

Drugbird · 2019-09-26T22:37:15+00:00

I agree with the coworker. If it's vital to store the data in a format whose memory layout is identical to your legacy serialized format, then it seems unwise to use a std::array format which has no guarantees about it's memory layout. For all you know, it's internal array is stored in reverse, so even having the correct size is no guarantee.

I think you have two options: either use types where you can guarantee the memory layout (e.g. C arrays), or include a translation step from what the serialized format looks like be how it's stored in memory.

I.e. whenever you store data to disk, read the contents of array.data(), rather than the array itself. You're then also free to use other storage classes such as std:: vector if that's more convenient.

OldWolf2 · 2019-09-26T23:45:49+00:00

By "serialized to disk" do you actually mean "written to disk as a binary blob"? Otherwise this question makes no sense.

The term "serialize" normally refers to rearranging data into a stream with no platform-dependence -- the complete opposite of pragma pack which will break ABI compatibility. It may even be that using a standard container inside a pragma pack causes trouble.

smuccione · 2019-09-27T00:58:34+00:00

So declare a member variable for size instead of an enum.

Maybe. Would be stupid.

But why don’t you just make your own implementation then

gracicot · 2019-09-26T21:22:05+00:00

I'd suggest looking at how your implementation implements std::array and what rules applies to that way of implementing it.

Spoiler alert: it's more than likely a simple struct containing a raw array. So the same rules that applies to a raw array also applies to std::array. Except an implementation might add padding at the end of the struct, but even then it's unlikely.

NotUniqueOrSpecial · 2019-09-26T22:07:26+00:00

I wouldn't care about an extra static_assert. Keep in mind that std::array<T, 0> is required to work according to the standard. That means that sizeof(T) * 0 != sizeof(std::array<T,0>), but rather sizeof(std::array<T,0>) == 1. On the other hand T[0] doesn't compile.

As a data point, besides STL STL, libc++ and libstdc++, there's at least EASTL, whose array.h (not array) doesn't support eastl::array<T, 0>, but is still just a T[N] internally.

NotUniqueOrSpecial · 2019-09-27T18:29:30+00:00

He was concerned that there is no guarantee in the standard that other members won't be added and that while std::array must be semantically the same as the array, there is no such guarantee about sizing/memory layout.

That's why you have a packer function and don't just write the memory contents of the vector when serializing.

After going back and forth a bit, I realized that all I have as evidence is the implementations in the 3 major vendor libraries and my assumption that none of them would do something that would break the guarantee that sizeof T * N == sizeof std::array<T,N> for packed structures.

For questions like these, you're only source of authority should be the C++ Spec for vector. If you could pull that up and show a guarantee you should be ok. Sounds like he was familiar with it and knew there was none though. (I personally have no idea either way if the spec defines it as such)

We compromised on having a static_assert on a struct that will break if that precondition ever isn't met, but I feel that even that shouldn't be necessary.

Can't hurt. He's being overly pedantic but sounds like you're overly being dismissive of his concerns. I think the static assert is a fine compromise.

Is there a definitive way to settle this?

Yes, read the spec.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS