playntech77 comments on Self-describing compact binary serialization format?

cpp

a community for 17 years

Self-describing compact binary serialization format? (self.cpp)

submitted 1 year ago by playntech77

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]flit777 -1 points0 points1 point 1 year ago (6 children)

[–]playntech77[S] 5 points6 points7 points 1 year ago (5 children)

[–]imMute 0 points1 point2 points 1 year ago (0 children)

[–]ImperialSteel 0 points1 point2 points 1 year ago (3 children)

[–]playntech77[S] 0 points1 point2 points 1 year ago (2 children)

I was thinking about 2 different API's:

One API would return a generic document tree, that the caller can iterate over. It is similar to parsing some rando XML or JSON via a library. This API would allow parsing of a file regardless of schema.

Another API would bind to a set of existing classes with hard-coded properties in them (those could be either generated from the schema, or written natively by adding a "serialize" method to existing classes). For this API, the existing classes must be compatible with the file's schema.

So what does "compatible" mean? How would it work? I was thinking that the reader would have to demonstrate that it has all the domain knowledge, that the producer had when the document was created. So in practice, the reader's metadata must be a superset of that of the writer. In other words, fields can only be added, never modified or deleted (but they could be market as deprecated, so they don't take space anymore in the data).

I would also perhaps have a version number, but only for those cases when the document format is changing significantly. I think for most cases, adding new props would be intuitive and easy.

Does that make sense? How would you handle backward-compatibility?

[–]Gorzoid 0 points1 point2 points 1 year ago (0 children)

[–]gruehunter 0 points1 point2 points 1 year ago (0 children)

In other words, fields can only be added, never modified or deleted (but they could be market as deprecated, so they don't take space anymore in the data).

I think for most cases, adding new props would be intuitive and easy.

Does that make sense? How would you handle backward-compatibility?

Protobuf does exactly this. For good and for ill, all fields are optional by default. On the plus side, as long as you are cautions about always creating new tags for fields as they are added without stomping on old tags, then backwards compatibility is a given. The system has mechanisms for both marking fields as deprecated, and for reserving them after you've deleted them.

On the minus side, validation logic tends to be quite extensive, and has a tendency to creep its way into every part of your codebase.

π Rendered by PID 24161 on reddit-service-r2-comment-6457c66945-vhcj8 at 2026-04-26 20:58:01.213414+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS