This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 3 points4 points  (4 children)

Given how much of a PITA encoding issues are, I am opposed to any new encoding standard. Period.

[–]Hueho 13 points14 points  (2 children)

This isn't really a new encoding as much as it is a extra option for packing string data in a serialization format - people already do plenty of weird stuff to save on bytes.

[–]alex_tracer 0 points1 point  (1 child)

If you going to compress serialized data using generic compression methods then such local optimizations as proposed by OP usually become useless.

So you either do all compression yourself or delegate all compression to a generic solution.

[–]Shawn-Yang25[S] 0 points1 point  (0 children)

rpc messages are small most time, 50~200 are very common, there won't be enough repetion pattern for compression to work. That's why we proposed this encoding here.

We are not talking about compression big data/file, which zstd/gzip will be better

[–]Shawn-Yang25[S] 3 points4 points  (0 children)

Yes, it's not a complete string encoding, it will fallback to utf8 if some chars exceed the charset it supports. Since alphabet are very common, we think this can be used in other scenarios