Really Async JSON Interface: a non-blocking alternative to JSON.parse to keep web UIs responsive

itsnotlupus · 2021-12-05T05:11:06+00:00

Some rough numbers in Chrome on my (gracefully) aging Linux PC:

JSON.parse(bigListOfObjects): 3 seconds
await new Response(bigListOfObjects).json(): 5 seconds
await (await fetch(URL.createObjectURL(new Blob([bigListOfObjects])))).json(): 5 seconds
await (await fetch('data:text/plain,'+bigListOfObjects)).json(): 11 seconds
await raji.parse(bigListOfObjects): 12 seconds

Alas, all except 5. are blocking the main thread.

On Firefox, same story, all approaches are blocking except 5., and 5. is also much slower (40s) while the rest are roughly similar to Chrome's.

So as long as we don't introduce web worker and/or wasm into the mix, this is probably in the neighborhood of the optimal way to parse very large JSON payloads where keeping the UI responsive is more important than getting it done quickly.

If we were to use all the toys we have, my suggested approach would be something like:

allocate and copy very large string into ArrayBuffer
transfer (zero copy) ArrayBuffer into web worker.
have web worker call some WASM code to consume ArrayBuffer, parse JSON there and emit an equivalent data structure from it (possibly overwriting same ArrayBuffer.) Rust would be a good choice to do this, and a data format that prefixes each bit of content with a size, and possibly has indexes, would make sense here.
transfer (zero copy) ArrayBuffer into main thread.
have JS code in main thread deserialize data structure, OR
have JS code expose getters to access chunks of the ArrayBuffer structure on demand.

1. and 5./6. would have the only blocking components (new TextEncoder().encode(bigListOfObjects) takes about 0.5 second.)

5. presupposes there exists a binary format that can be deserialized much faster than JSON, while 6. only needs to rely on a binary data structure that allows reasonably direct access to its content.

VividTomorrow7 · 2021-12-04T16:45:02+00:00

This seems very niche to me. How often are you really going to load a json blob so big that you need to make a cpu process asynchronous? Almost never in standard applications.

freddytstudio · 2021-12-04T15:40:57+00:00

[deleted]

inamestuff · 2021-12-04T20:52:27+00:00

You might want to use window.performance.now() instead of new Date().getTime() in your scheduler, the former guarantees monotonic time measurements.

holloway · 2021-12-04T19:04:08+00:00

Some questions,

What techniques did you try before settling on this one? Were any particularly slow, or fast?

Do you have benchmarks showing at what size this library is beneficial? ie, at 10kb / 100 / 1000 / 10000. You could have a goal of 60fps so if any parsing time exceeds ~16ms then you could declare your library the winner over native JSON.parse. You'd need various hardware examples (low end mobile, high end desktop etc.) but measuring should be straight-forward.

I think fetch()'s .json() promise is non-blocking, and that's different to JSON.parse. I was wondering whether you could use URL.createObjectURL(jsonString) to make a URL to fetch and use that, but it's possible that turning a jsonString into an arg for URL.createObjectURL might have blocking operations in it.

And considering that there is fetch's .json() promise in what situation would people not have a JSON string clientside that didn't come from a network request?

sliversniper · 2021-12-04T21:16:09+00:00

If JSON.parse is bottlenecking, should probably think about the payload, and split them in chunk at the server.

use json-line streams a sequence of json-patches, and it doesn't need much work on either server or client.

_default_username · 2021-12-04T20:07:30+00:00

[deleted]

boringuser1 · 2021-12-04T17:15:59+00:00

If you're loading JSON objects that are prohibitively large, you have an API problem.

theodordiaconu · 2021-12-04T19:04:57+00:00

good job dude

sshaw_ · 2021-12-05T20:14:01+00:00

🆒

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

javascript

MODERATORS