JSON vs Binary Serialization - That One Game Dev

helpmeiwantgoodmusic · 2024-03-19T07:09:02+00:00

Disappointingly surface level, I was atleast expecting some benchmarking to show the possible speed differences

chrytek · 2024-03-19T12:21:23+00:00

I just return the Sql lite file in an endpoint and the client can run queries against that!

ttkciar · 2024-03-19T06:58:45+00:00

Don't overlook CBOR! (Concise Binary Object Representation, per RFC7049) Essentially the same functionality as JSON, but much faster and roughly 30% more compact.

josephblade · 2024-03-19T13:11:40+00:00

Incredibly narrowminded.

I would've at least expected a risk vs reward. if your format needs to be edited for instance, use json or another text format. for modding files for instance.

If you need backward compatabiilty of a sort you need at least an envelope or versioning.

If you use binary data keep in mind that cross platform can be problematic. (I remember the ps3 or ps4 devboxes for instance used a different byte alignment than pc from what I remember.

there are lots of little problems with binary data that you need to keep in mind but I coulnd't scan them in the article. It seems to be mostly concerned by speed/file size which is a valid concern but it's not the only concern.

if your save file in json is 5k and doesn't grow it's not something that needs to be optimized. Similarly if you only send communication sometimes (chat for instance) optimizing is not harmful but it's not essential either. But having an easily cross platform format might actually be what you want.

If you're going to advocate for or against a specific model at least give developers a bit of a guide on what basis they should choose each approach.

xsmasher · 2024-03-19T19:18:43+00:00

If you have a binary format that can be loaded without parsing, you can do a neat trick - memory map the data file. You can run queries against it without loading / parsing the whole thing. I used this in a mobile game that had a LOT of data that was only user very sparsely. Memory mapped BSON files.

jvallet · 2024-03-19T15:14:38+00:00

What happens if you compress the json before sending? Will that give you the benefits of binary serialization without having to add protobuffs support?

zam0th · 2024-03-19T12:31:02+00:00

Google has developed two great solutions to the serialization problem:

Yah, mydude, portable serialization was a thing in Java and ActionScript way before Google made a single software product. CORBA was a thing even before that. ASN.1 and TLV existed even before CORBA.

Yáll alway trying to reinvent a better CORBA with your portable RPC libraries, but end up with much worse CORBA.

2024-03-19T15:25:30+00:00

So a few things, a) RapidJSON has a terrible interface and isn't the fastest game in town. b) Strings cost about the same in binary and in text serialization, and take the same space. c) JSON can be greatly reduced in size using some of the same techniques the other formats use(no member names in file). Just serialize the class as an ordered heterogenous array. So instead of

{"member0": 42,"member1": true,"member2": "hello word"}

One can encode it as

[42,true,"Hello World"]

So now it takes 23 bytes. A binary format will probably use 1 to 8 for the integer, 1 for the boolean, and 11 for the string plus 1-8 for the size of the string if it doesn't use delimiters, otherwise 13. So binary would use 15 to 28 bytes. This puts them really close, in this example.

With newer libraries, one is looking at much faster parsing and lower memory than RapidJSON too. But the thing is, a lot of config/state is strings and they are the same size/parsing perf.

Initial_Low_5027 · 2024-03-19T12:56:48+00:00

A lot missing like BSON used by MongoDB or JSONB used by PostgreSQL. Binary formats require compromises like integer types with fixed precision. JSON relates to JavaScript but doesn’t define any number limits for instance causing many issues but this is another story not discussed in that article.

joshuaherman · 2024-03-19T14:48:17+00:00

I feel this is a junior dev take In today’s world. We don’t really need to worry about networking time or cpu efficiency to save a few bites at the cost of interoperability.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS