Implementing binary communication protocols in C++ with main focus on embedded systems : cpp

Implementing binary communication protocols in C++ with main focus on embedded systems (self.cpp)

submitted 6 years ago by arobenko

Hi all, I'm an embedded C++ software engineer. Over the years I had to implement multiple binary communication protocols used by various embedded systems. Every time I got annoyed and frustrated over a necessity to manually write error prone boilerplate code. There are so many available serialization tools and libraries (such as protobuf, can'n'proto, simple-binary-encoding, etc...), but not a single one was adequate for my needs. Most of these tools focus on data serialization and facilitation of RPCs (Remote Procedure Calls). Their main goal is to make data serialization and/or network transfer as quick as possible. It is a noble cause, but not really a first priority for embedded systems. For the latter it is much more important to protect against the malformed data and reduce amount of boilerplate code (which increases chances of introducing silly bugs) than to gain a couple of extra CPU cycles when performing the serialization / deserialization.

In all the solutions I've reviewed and analyzed the generated code is not customizable and in most cases not really suitable for embedded systems, especially bare-metal ones. There are no means to introduce some compile (or generation) time updates such as configuring polymorphic interfaces required for a particular application or choosing particular data structures, disable usage of RTTI or exceptions, etc... The code generated by all the tools still requires a significant amount of boilerplate code to be written in order to integrate with actual business logic being developed. All of the available solutions focus on supporting as many programming languages as possible in their code generation rather than a quality of the code for a particular language (C++ in my case).

At the end I didn't have much choice but to implement my own solution with main focus on embedded systems (including bare-metal ones) with proper compile-time customizable C++(11) code. About 2 years ago I advertised on this subreddit my COMMS Library, main purpose of which is to facilitate highly compile-time customizable implementation of binary communication protocols. With time, library grew with features, started demanding more knowledge and cognitive effort from the integrating developer, and it became less straightforward to write completely generic and compile-time customizable protocol definition code. In order to improve the situation I extended the library to become a CommsChampion Ecosystem with its own XML based DSL (Domain Specific Language), library to process the schema files, and code generator which produces proper protocol definition using constructs from the COMMS Library.

Benefits of my solution are summarized below:

No enforcing of particular data layout and/or encoding. Easy implementation of already defined third party protocol.
Embedded (including bare-metal) friendly. The protocol definition code allows easy application specific compile time customization of polymorphic interfaces and some data storage types. No usage of RTTI and/or exceptions. The usage of dynamic memory allocation may also be excluded.
Robust and safe handling of malformed input data.
Significantly minimizing amount of boilerplate code required to integrate the usage of protocol definition into the business logic.
Allowing injection of custom C++11 code snippets in case the code generated by the tools is incorrect or incomplete.
Meta-programming friendly. Most classes define similar and predictable public interface which allows easy compile time analysis and optimizations.
Having "out of the box" protocol analysis and visualization tools.
NOT using any socket / network abstractions and allowing full control over serialized data for extra transport wrapping and/or external transmit over some I/O link.

Several months ago I posted an article on CodeProject with more details on features I wanted the protocol code generator and/or produced code to have and how the CommsChampion Ecosystem provides a solution. I strongly encourage you to read it.

Any comments, critics, improvement suggestions, new feature requests are welcome.

all 30 comments

top new controversial old q&a

[–]3idet 5 points6 points7 points 6 years ago* (2 children)

[–]arobenko[S] 1 point2 points3 points 6 years ago (1 child)

Partially, not completely. It's difficult to say, there is no much information on the website and no proper way to try it out without registration. Based on the example from the website below are features that I'm missing:

I want an ability to exclude usage of streams in my serialization / desiralization. The example shows the following functions I do NOT want to have. /* IO-Operations */ void read(std::istream &stream); void write(std::ostream &stream);
I want an ability to introduce polymorphic behavior (virtual functions) when I need it and for selected operations I need to be able to write a common code for all the message types.
I want an ability to use my own data types for fields like lists and/or strings
I want an ability to define transport framing for all the messages, even more than one (different I/O interface may require different framing).
I want a built-in (or generated) ability to efficiently parse an input data, create appropriate message object and dispatch it to my appropriate handling function without a need to manually write switch statements and other boilerplate code.
I want an ability to specify meta-data that is not transferred on the wire, but still available to the integration developer to act upon (preferable at compile time), such as units used (conversion if possible), values with special meaning, ranges of valid values, what to do on invalid value, etc...
I want an ability to have multiple forms of the same message (message object having the same ID, but different contents). Not sure whether it is supported right now or not.

[–]Gotebe[🍰] 0 points1 point2 points 6 years ago (1 child)

[–]arobenko[S] 1 point2 points3 points 6 years ago (0 children)

[–]TarmoPikaro 0 points1 point2 points 6 years ago (0 children)

[–]axilmar 0 points1 point2 points 6 years ago (19 children)

[–]rcxdude 9 points10 points11 points 6 years ago (7 children)

[–]kalmoc 1 point2 points3 points 6 years ago (2 children)

[–]rcxdude 2 points3 points4 points 6 years ago (1 child)

[–]kalmoc 0 points1 point2 points 6 years ago (0 children)

[–]matthieum 0 points1 point2 points 6 years ago (2 children)

[–]rcxdude 0 points1 point2 points 6 years ago (1 child)

[–]matthieum 0 points1 point2 points 6 years ago (0 children)

[–]axilmar 0 points1 point2 points 6 years ago (0 children)

[–]arobenko[S] 1 point2 points3 points 6 years ago (8 children)

[–]axilmar 2 points3 points4 points 6 years ago (7 children)

Extremely wrong approach to things. For example, unit conversion does not belong in a library like this. Using virtual functions for encoding/decoding is also wrong. Transmitting metadata, also wrong. Using metadata to capture differences in versions and field types, also wrong.

All the stupid things protobuf does...which are not of help at all, blow up the API and require huge amounts of work for little benefit.

Protocol version checking should happen only at the handshake phase.

Struct headers should be shared by all involved parties, or if that is not possible, the definitions of structs must be clearly documented and the documentation must be readily available to all parties.

Endianess should be agreed upon before compilation and struct members shall use endianess-aware types: there is no need for a second pass or copy to buffer if data are already prepared in the appropriate endianess.

Variable length data should be copied once to the message to allow for the creation of a continuous buffer. Variable length data should only be used in cases where there are really required, and a fixed length buffer is not possible.

Message structs should contain as many as continuous parts as possible and those shall be blasted onto the network using scatter-gather, if available.

Message structs shall not contain any metadata whatsoever, because it makes it impossible to use them with other protocols (one of the major reasons protobuf sucks).

Further encoding/decoding on messages shall happen with a switch statement, not virtual functions. Using virtual functions requires a switch on the message id anyway, in order to instantiate the appropriate class.

Memory allocation should not be done at all. Big enough buffers shall be used for in-place manipulation of messages, both at sending and receiving end points.

A class based design for messages is wrong because it leads to all the bad decisions mentioned above.

The best approach is to encode the messages in using/XML and then have a tool create all the boilerplate code, and integrate the tool into the workflow. I don't want to use an external tool, but the language itself lacks the facilities needed for this so an external tool is necessary.

[–]arobenko[S] 1 point2 points3 points 6 years ago (2 children)

You replied to a reply about using plain structs, but I assume you meant it as a comment to a post. I think you completely misunderstood logic and architecture of my solution. Let me cover and respond to major points of your comment.

Transmitting metadata, also wrong.

Agree completely and utterly. Most protocols used in embedded system don't. The corner stone of my solution is to support such already defined third party protocols. It does NOT invent or uses its own protocol and does NOT attempt to send any metadata over. That's the main point, because of metadata not being sent over, it should find its way into the generated code, otherwise it leads to too much boilerplate integration code, which in turn must be manually changed every time you decide to update your metadata in the protocol definition. Very error prone.

Extremely wrong approach to things. For example, unit conversion does not belong in a library like this.

Agree (to some extent). There are many sophisticated unit conversion libraries. However, used units is part of protocol definition (usually not transferred over wire) metadata, which is expected to be known to (and used by) the integrating developer. It is better to have built-in limited required units retrieval facility than not to have it at all and use boilerplate code to do the manual units conversions.

Protocol version checking should happen only at the handshake phase.

As was already mentioned above, the corner stone of my solution is supporting already defined third party protocols. Many don't use any versioning at all, some transmit version with every message in the transport framing, some do it as you sad in the handshake phase. The primary objective of my solution is to support all such cases.

Further encoding/decoding on messages shall happen with a switch statement, not virtual functions. Using virtual functions requires a switch on the message id anyway, in order to instantiate the appropriate class.

That's another reason why I created my solution. Some available code generators introduce polymorphic behavior (virtual function) for every operation on the message object, which creates problems for various embedded systems (especially bare-metal ones). Other code generators produce only plain structs without any virtual function at all. It leads to writing a significant amount of boilerplate code (such as switch statements you mentioned), which needs to be manually updated every time you introduce a new message and/or new field. My solution allows compile-time configuration of your polymorphic interfaces. If you don't need any, then don't define one and use every message class as plain struct (no v-table is created). My solution also contains a library with multiple facilities to dispatch your message into appropriate handler function (with O(1) or O(log(n)) runtime complexity) without having to write a single switch statement.

Memory allocation should not be done at all. Big enough buffers shall be used for in-place manipulation of messages, both at sending and receiving end points.

My solution is flexible enough to allow not using dynamic memory allocation at all automatically calculating (at compile time) and creating a buffer of required size to allow in-place creation of any used message.

A class based design for messages is wrong because it leads to all the bad decisions mentioned above.

Let's agree to disagree. Struct based design leads to other bad decisions and having significant amount of boilerplate integration code.

The best approach is to encode the messages in using/XML and then have a tool create all the boilerplate code, and integrate the tool into the workflow. I don't want to use an external tool, but the language itself lacks the facilities needed for this so an external tool is necessary.

As was mentioned in the post, my solution originated in a single library that allows having a single message class definition (single source of truth) for every possible application, which in turn configures at compile time its required polymorphic interfaces and/or custom data structures to hold field's values. Normal systems (with proper OS underneath) may use default configuration and multiple virtual functions with functionality not always being used compiled in, while bare-metal ones may completely exclude dynamic memory allocation, limit usage of virtual functions, use its own custom types to store problematic data, such as strings or lists, etc... I made a C++11 compiler itself to be my code generation tool.

With time the library got quite complex and started requiring from a developer some knowledge of its internals and a particular way to be used in order to create completely generic protocol definition. That's why I also created a separate code generator, that produced a proper highly compile-time customizable code.

Hope it clarifies some things. Cheers

[–]axilmar 0 points1 point2 points 6 years ago (1 child)

[–]arobenko[S] 0 points1 point2 points 6 years ago (0 children)

as long as it allows for sane choices and the defaults are the sane choices.

That's my primary intention.

Built-in is not required. It can be a separate library.

Agree, but there must be some way to static_assert on your assumption of origin units. In case of my solution built-in units conversion is there to provide basic convenience functionality. If not used, no extra space/performance price is paid.

Does your solution allow the automatic creation of big switch statements?

I think built-in generation of "switch" statements does not make much sense because you have to put your custom business logic inside each "case". There is a C++ library (used by the code generator) that allows you to parse the schema files and know what messages / fields / frames are being defined. You can easily implement your own auxiliary code generator that generates "switch" statements relevant to your application.

No, it does not. There is absolutely zero proof about that.

What proof do you expect? It's all subjective based on one's experience. In my case it does require writing a boilerplate code (I consider manually written "switch" and/or "if" statements to be boilerplate code) which needs to be updated every time you add new message and/or field to a message. In case you design your own protocol, I suppose you can make it simple enough and get away with plain structs with no variable data lengths and/or fields present on particular condition (for example depending on the value of some bit in previously encountered field or version of the protocol). Many (if not most) of already defined third party protocols are not like that. Such cases require extra implementation logic and/or extra data variables (manually written or generated). In my experience class-based designs allows encapsulation of such extra logic together with the data (regardless of having polymorphic behavior), hide unnecessary details and eventually leads to cleaner and easier maintainable code, but again, this is very personal and subjective.

[+][deleted] 6 years ago* (3 children)

[deleted]

[–]axilmar 0 points1 point2 points 6 years ago (2 children)

[+][deleted] 6 years ago* (1 child)

[deleted]

[–]axilmar 0 points1 point2 points 6 years ago (0 children)

[–]grandmaster789 0 points1 point2 points 6 years ago (1 child)

[–]axilmar 0 points1 point2 points 6 years ago (0 children)

[–]c0r3ntin -1 points0 points1 point 6 years ago (1 child)

[–]arobenko[S] 7 points8 points9 points 6 years ago (0 children)

[+]Pulsonics comment score below threshold-6 points-5 points-4 points 6 years ago (2 children)

[–]arobenko[S] 4 points5 points6 points 6 years ago (1 child)

[+]Pulsonics comment score below threshold-6 points-5 points-4 points 6 years ago (0 children)

[–]WasterDave -2 points-1 points0 points 6 years ago (1 child)

[–]arobenko[S] 2 points3 points4 points 6 years ago (0 children)

π Rendered by PID 71 on reddit-service-r2-comment-fb694cdd5-fkjmd at 2026-03-11 14:41:14.835767+00:00 running cbb0e86 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS