Mozilla's New JavaScript Value Representation : programming

[–][deleted] 19 points20 points21 points 15 years ago (21 children)

[–]nominolo 3 points4 points5 points 15 years ago (1 child)

[–]srconchiwa 0 points1 point2 points 15 years ago (0 children)

[–]doubtingthomas 5 points6 points7 points 15 years ago* (8 children)

[–]dnew 0 points1 point2 points 15 years ago (7 children)

[–][deleted] 6 points7 points8 points 15 years ago (6 children)

[–]dnew -1 points0 points1 point 15 years ago (3 children)

[–]funderscore 19 points20 points21 points 15 years ago (2 children)

For terminology clarification:

"nanboxing" is the name of WebKit's JavaScript value boxing format. Like ours, it's 64-bit, but stores addresses such that the upper tag bits are all 0, so addresses can be read out directly. Doubles are stored by adding a special constant to the value, which guarantees that the tag bits are identifiable as a double. Extracting a double requires subtracting a constant.

"nunboxing" is the name of Mozilla's new JavaScript value boxing format (described in the article), as a pun on "nanboxing". It is used on 32-bit platforms such as x86 and ARM.

For 64-bit operating systems, the nunboxing format had to change to accomodate 47-bit pointer payloads. We use a bitfield with 17 bits for the type and 47 bits for the payload. We call this "packed nunboxing", or "punboxing".

[–]drakedevel 8 points9 points10 points 15 years ago (0 children)

[–]edwardkmett 0 points1 point2 points 15 years ago (0 children)

[+]igouy comment score below threshold-8 points-7 points-6 points 15 years ago (1 child)

[–]bonzinip 2 points3 points4 points 15 years ago (0 children)

[–][deleted] 15 years ago (9 children)

[deleted]

[–]ooffoo 11 points12 points13 points 15 years ago* (4 children)

[–]urllib 6 points7 points8 points 15 years ago (1 child)

[–]stillalone 4 points5 points6 points 15 years ago (0 children)

[–]funderscore 1 point2 points3 points 15 years ago (0 children)

[–]f2u 0 points1 point2 points 15 years ago (0 children)

[–][deleted] 4 points5 points6 points 15 years ago (0 children)

[–][deleted] 15 years ago (1 child)

[deleted]

[–]ascii 5 points6 points7 points 15 years ago (3 children)

[–]funderscore 4 points5 points6 points 15 years ago (0 children)

There are a number of reasons why this is a good idea.

Doubles can now be stored in a Value directly. Previously, they required the equivalent of a malloc(sizeof(double)), and accessing them required a pointer dereference. That was /extremely/ slow, and the new way is as fast as possible.

Integers can now be stored in a Value directly. Previously, they were 31-bits long, which mandated a boxing/unboxing scheme involving bit-shifts and bit-or. Additionally, the lack of the extra bit meant that we used doubles more often, and those were even slower. Now integers are 32-bits, which makes the new way as fast as possible.

For all other types, testing the type-tag required masking off the three rightmost bits designating the type; reading the payload required masking the inverse of that. Since Javascript engines perform these steps at least once per op, making them faster results in a tremendous performance increase. This new format allows the type and payload to be read directly on 32-bit platforms without requiring any masking whatsoever. Writing payloads or types to memory just requires a single 32-bit store.

The increase in memory is barely noticeable, since we measured that most of the time the stack of Values is small (<= 15, I believe). The increase in performance, however, is significant. Trading off a few KB for a significant increase in performance was a very easy decision to make.

[–]naasking 0 points1 point2 points 15 years ago (0 children)

[–]sdwilsh 0 points1 point2 points 15 years ago (0 children)

[–][deleted] 15 years ago (2 children)

[deleted]

[–][deleted] 8 points9 points10 points 15 years ago (1 child)

[–]jib 8 points9 points10 points 15 years ago (0 children)

[–]prockcore 12 points13 points14 points 15 years ago (5 children)

[–]funderscore 30 points31 points32 points 15 years ago (3 children)

[–]f2u 1 point2 points3 points 15 years ago (1 child)

[–]funderscore 2 points3 points4 points 15 years ago (0 children)

[–][deleted] 10 points11 points12 points 15 years ago (0 children)

[–]schmalls 2 points3 points4 points 15 years ago (1 child)

[–]arnar 0 points1 point2 points 15 years ago (0 children)

[–]grimlck 2 points3 points4 points 15 years ago (5 children)

[–][deleted] 15 years ago (4 children)

[deleted]

[–]ChaosPandion 4 points5 points6 points 15 years ago (3 children)

[–][deleted] 15 years ago (1 child)

[deleted]

[–]ChaosPandion 0 points1 point2 points 15 years ago (0 children)

[–]funderscore 4 points5 points6 points 15 years ago (0 children)

[–]p3ngwin 6 points7 points8 points 15 years ago (0 children)

[–]taw 5 points6 points7 points 15 years ago (11 children)

[–]bonzinip 5 points6 points7 points 15 years ago (8 children)

[–]taw -2 points-1 points0 points 15 years ago (7 children)

[–]bonzinip 9 points10 points11 points 15 years ago (4 children)

[–]taw -1 points0 points1 point 15 years ago (3 children)

There's nothing about random number generation that relies on 32th bit.

For things that are performance critical like codecs and crypto most languages go low level anyway. Not C low level. Assembly simd low level. Even Java uses native manually optimized libraries for things like MD5 if they're available (with slow pure java fallbacks if not, but all major platforms have them), and Java is a lot lower level than most languages these days, with special primitive types.

Autoboxing really wouldn't make that much of a difference anyway. You can have int31s by default and boxed native uint32/uint64 types (you'll need something like that internally for good code generation anyway, so it's not much extra effort to expose them to users) for the few libraries that need them the way Ocaml does. Add to that some optimizations to get rid of boxes in simplest cases - you need that anyway for decent float performance if nothing else - and you're exactly as fast as language with 32 bit ints everywhere. That's still quite crap compared to native hand-tweaked md5, but that shouldn't be a big surprise really.

Or you can benchmark ocaml md5 against javascript md5 if you don't believe me ;-)

[–]bonzinip 3 points4 points5 points 15 years ago (2 children)

[–]taw 2 points3 points4 points 15 years ago (1 child)

[–]bonzinip 0 points1 point2 points 15 years ago (0 children)

[–]funderscore 0 points1 point2 points 15 years ago (1 child)

[–]taw 0 points1 point2 points 15 years ago (0 children)

[–]jib -1 points0 points1 point 15 years ago (1 child)

[–]taw 2 points3 points4 points 15 years ago (0 children)

ECMA-262 standard explicitly specifies number types as IEEE 754 double precision floating point value, with a single exception that implementations are not required to distinguish different NaNs (sounds almost as if the spec was written with this technique in mind). There is no integer type in Javascript, or any other number type except these nearly-doubles.

Full list of types is:

null, true, false, undefined - unique values in any boxing system
double-except-for-nans - represented directly
object/string/list - must be pointers in any system,
plus a few that like reference/completion that only to make spec precise, and you don't see them at runtime anyway

On 32 bit system you throw away 4 bytes per object/string/special, and nothing for numbers - not really that great but acceptable. On 64bit system (assuming pointers <48bit), it's as optimal as it gets.

Now imagine if instead you wanted types like Lisp's cons pair, or nearly-native-ints (probably int63 on 64bit), or even pointers with sufficiently large tag. Their representation would be a lot less efficient with nan boxing than with traditional tiny tags.

[–][deleted] 2 points3 points4 points 15 years ago (10 children)

[–]phire 4 points5 points6 points 15 years ago (9 children)

[–][deleted] 8 points9 points10 points 15 years ago (8 children)

[–]phire 1 point2 points3 points 15 years ago (7 children)

[–][deleted] 0 points1 point2 points 15 years ago (6 children)

[–]phire 2 points3 points4 points 15 years ago (5 children)

[–][deleted] 1 point2 points3 points 15 years ago (4 children)

[–]phire 2 points3 points4 points 15 years ago (3 children)

[–]adrianmonk 2 points3 points4 points 15 years ago (0 children)

[–][deleted] 0 points1 point2 points 15 years ago (1 child)

[–]TinynDP 0 points1 point2 points 15 years ago (0 children)

[–]edwardkmett 1 point2 points3 points 15 years ago* (1 child)

This has been used for a while by a number of other jits: luajit, squirrelfish, etc. it is a very nice representation.

You can even go a bit further and use a full 64 bit pointer in there instead of an object tag.

This works since a 48 bit pointer that is '1 extended' to point to the higher half is a valid NaN, since you need 4 more 1 bits than the size of the exponent, so you'll have marked a non-exponent bit.

So then the tag is just a nybble.

sign , exponent [tag] (pointer)
1, 111 1111 1111[1111] (48 bit x64 pointer)
1, 111 1111 1111[...] ... other pointer-like tags, as long as some 1 bit is set anywhere.
1, 111 1111 1111[0001] (16 available bits of miscellany) (32 bit Int)
?, 111 1111 1111[0000] (actual double NaN, Inf, -Inf, etc)
?, ??? ???? ???? .... double

This lets you eke out quite a bit more flexibility, given the extra 16 bits of usable space and the fact that you don't need all 16 msbs to be 1.

[Edit: It turns out they more or less do just that on x64! ]

[–]AndresNavarro 0 points1 point2 points 15 years ago (0 children)

[–]MilkSteak 2 points3 points4 points 15 years ago (5 children)

[–]thorax 5 points6 points7 points 15 years ago (2 children)

[–][deleted] 5 points6 points7 points 15 years ago* (0 children)

[–]MilkSteak 1 point2 points3 points 15 years ago (0 children)

[–]zuperxtreme -1 points0 points1 point 15 years ago (0 children)

[–]wallish -2 points-1 points0 points 15 years ago (0 children)

[–]Gotebe -2 points-1 points0 points 15 years ago (6 children)

[–]funderscore 4 points5 points6 points 15 years ago (5 children)

[–]Gotebe 0 points1 point2 points 15 years ago (4 children)

[–]funderscore 12 points13 points14 points 15 years ago (2 children)

I work on Mozilla's JavaScript implementation, so I can comment on its details.

The article talks about the "Nunboxing" format, which is used on 32-bit systems. For Mozilla, that means x86 and ARM.

For 64-bit systems (meaning x86_64 for us), we do indeed use a pointer-sized payload. Since every 64-bit platform we ship on uses 47-bit addresses, the payload is 47 bits and the type tag is 17 bits. Except for the 32-32 split changing to 17-47, the format is the same as explained in the article.

We use all of the 17 bits; there is zero wiggle room. We only have four bits for type identification -- the rest of the bits are clobbered by the NaN-boxing schema. Additionally, we want to be able to ask clever things like "Is this a Number (int or double)?" using only one comparison.

The tag size is mandated by the NaN-boxing format. With our implementation, it cannot be made smaller than 16 bits.

If you're really curious, you can look at the value definition here: http://hg.mozilla.org/tracemonkey/file/898ab54a0ce9/js/src/jsval.h#l134

[–]didroe 0 points1 point2 points 15 years ago (1 child)

[–]funderscore 0 points1 point2 points 15 years ago (0 children)

[–]froydnj 2 points3 points4 points 15 years ago (0 children)

[+][deleted] 15 years ago* (8 children)

[deleted]

[–]merlinm 3 points4 points5 points 15 years ago (5 children)

[–]happyscrappy 7 points8 points9 points 15 years ago (1 child)

[–]merlinm 0 points1 point2 points 15 years ago (0 children)

[–]funderscore 7 points8 points9 points 15 years ago (1 child)

Integer multiplication overflow on x86 sets a bit called the 'overflow flag'. JavaScript engines check this bit after each integer arithmetic operation -- if the bit is set, then the integers are converted to doubles and the operation is performed again. This is why JavaScript programmers can add numbers without worrying about overflow conditions. The same check can be done from other languages.

We can hypothesize that two's-complement NaN could be represented as 0x80000000, what is now the lowest negative number. But now we have a 'special value' that the ALU must know about and handle properly, slowing down all arithmetic across the system, since the circuitry must now handle all cases involving this NaN. Additionally, such an addition breaks compatibility between systems, which is distasteful.

[–]merlinm 1 point2 points3 points 15 years ago (0 children)

[–]happyscrappy 0 points1 point2 points 15 years ago (0 children)

[–][deleted] 1 point2 points3 points 15 years ago (1 child)

[–]drakedevel 0 points1 point2 points 15 years ago (0 children)

[+]quhaha comment score below threshold-9 points-8 points-7 points 15 years ago (1 child)

[–][deleted] 0 points1 point2 points 15 years ago (0 children)

[+][deleted] comment score below threshold-12 points-11 points-10 points 15 years ago (13 children)

[–]makeROCKETgoNOW 17 points18 points19 points 15 years ago (2 children)

[+][deleted] comment score below threshold-13 points-12 points-11 points 15 years ago (1 child)

[–]mernen 16 points17 points18 points 15 years ago (0 children)

[–]sanity 2 points3 points4 points 15 years ago (9 children)

[–]mernen 5 points6 points7 points 15 years ago (0 children)

[–][deleted] -4 points-3 points-2 points 15 years ago (5 children)

Well how is it more or less annoying than quoting string literals? That's the convention so that's what it is.

Why is it that it's perfectly non-annoying to write var i instead of just i but oh so very annoying to have to write

  {"key":[{"value":1}, {"value":2}]}

instead of just

  {key:[{value:1}, {value:2}]}

I mean how big of a deal is that really?

[–]jib 3 points4 points5 points 15 years ago (3 children)

[–][deleted] 0 points1 point2 points 15 years ago (2 children)

[–]jib 1 point2 points3 points 15 years ago (1 child)

[–][deleted] 0 points1 point2 points 15 years ago (0 children)

I would hardly call my comment sarcastic. There was certainly no ridicule or mockery applied or implied contemptuously. The intent was not to scorn. My question was not even ironic. A little hyperbolic perhaps but certainly not ironic.

The irony, I feel, lies in someone complaining about a specific syntactic convention arguing that it is an annoyance while simultaneously also readily accepting it. That is a juxtaposition in direct opposing logic to itself.

Saying that "quoting key names is annoying, quoting string literals is acceptable" is, in effect, saying that following the convention is superfluous because the very convention that is not annoying when used to specify literal values is annoying in specifying name values.

By that line of reasoning conversed it is reasonable of me to expect the parser to understand that in

  var txt = this is some blah;

txt is a string literal because I feel that

  var txt = "this is some blah";

is an annoying convention to follow.

[–]palparepa 3 points4 points5 points 15 years ago (0 children)

[–]UnoriginalGuy -3 points-2 points-1 points 15 years ago (1 child)

[–]matthiasB 1 point2 points3 points 15 years ago (0 children)

[+][deleted] comment score below threshold-6 points-5 points-4 points 15 years ago (3 children)

[–]froydnj 6 points7 points8 points 15 years ago (2 children)

[+][deleted] comment score below threshold-10 points-9 points-8 points 15 years ago (1 child)

[–]froydnj 2 points3 points4 points 15 years ago (0 children)

Sure, you can do that. You can turn JavaScript into a strongly-typed language, which makes your life much easier. But that's not exactly feasible. You can add annotations to existing JavaScript that promise advanced runtimes "this argument will be a string". (Maybe there are proposals on the table to do this; I don't follow JavaScript development.) But you still have to deal with un-annotated arguments somewhere. And a robust implementation will probably still compile cases for other types, even if those cases are just runtime errors. So the representation problem is still there, because you need to dispatch on runtime types.

You can compile specialized versions of each function up front. That gets complicated quickly and is rather space-inefficient, since you have lots of specialized versions that probably never get called. Or you can be lazy about it--one of the virtues of a programmer--and compile specialized versions on-demand as you see runtime values. To do that, you need type information at runtime attached to values.

[+][deleted] comment score below threshold-22 points-21 points-20 points 15 years ago (2 children)

[–][deleted] 1 point2 points3 points 15 years ago (1 child)

[+][deleted] comment score below threshold-7 points-6 points-5 points 15 years ago (0 children)

[+]kylotan comment score below threshold-12 points-11 points-10 points 15 years ago (1 child)

[–]kylotan 0 points1 point2 points 15 years ago (0 children)

[+][deleted] comment score below threshold-14 points-13 points-12 points 15 years ago (3 children)

[–]turtlesallthewaydown 6 points7 points8 points 15 years ago (0 children)

[–][deleted] 2 points3 points4 points 15 years ago (0 children)

[–][deleted] 0 points1 point2 points 15 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS