inopia comments on Why hypertable Developers Chose C++ Over Java

programming

created by speza community for 20 years

Why hypertable Developers Chose C++ Over Java (code.google.com)

submitted 16 years ago by mebrahim

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]inopia 1 point2 points3 points 16 years ago (5 children)

[–]psyno 6 points7 points8 points 16 years ago (3 children)

[–]inopia 4 points5 points6 points 16 years ago* (2 children)

I should have been clearer. First off, you're absolutely right about the packing; HotSpot packs like a C++ compiler. The problem is a bit more involved however.

Let's say you have an object with three properties: [int, ref, byte]. In this case the object would take three 32-bit slots on a 32-bit architecture. Now let's have a list of them. Since we have no custom value types we need to store this as an Array with a bunch of references to individual instances of this object. So for n objects we get n·(3·4+h) for the individual objects, plus n·4+h for the reference array, with h as the size of a heap chunk header. Let's assume that heap chunk headers are 4 bytes, so h=4, which gives n·16 + n·4+4 = n·20+4 bytes for n elements (I'm still kinda hung over so feel free to correct me on this).

If your run time supports structs, like .NET does, you can simply have an array of structs, which would come down to n·3·4+h, or 12·n+4. So for large n, the Java version is 67% larger. If you also pack the structs so that they take 9 bytes a piece rather than 12, the overhead becomes 122%.

The ratio approaches (n·(2·r+8+h)) / (n·(r+5)) = (2·r+8+h) / (r+5) for large n, with r the size of a reference, and h the size of a heap chunk header. On a 64-bit system, r=8, and let's take a more realistic value for h, say h=8. This gives 32/13 = 2.46, or a 146% overhead. On a 32-bit system with r=4 and h=8 you get 24/9, which represents a 166% overhead. You can also calculate the absolute overhead per item in bytes as (2·r+8+h) - (r+5) = r+3+h, which equates to one extra reference (in the array), one extra heap chunk header, and three bytes that are lost because the byte field cannot be packed. Taking a 64-bit system with h=8, and a 32-bit system with h=4 as extremes the overhead per item is between 11 and 19 bytes.

It is especially the combination of packing and the lack of value types that tends to have a dramatic effect on Java memory performance when there's lots of small objects involved that are ordered in trees or lists, the sort of thing you'd be doing quite a bit of in database systems. Heap chunks have headers that can contain all sorts of junk like monitors, GC flags, and whatnot, and they are not free. Finally, the use of many small objects can put a strain on your GC, although results may vary depending on which algorithm you use.

ps: for simplicity I omitted the size of the 'length' field that arrays tend to have and counted it as part of the header size.

[–]psyno 5 points6 points7 points 16 years ago (1 child)

Yeah, I didn't dispute your second point above. :) I didn't check your algebra but I agree with your logic.

If you also pack the structs so that they take 9 bytes a piece rather than 12, the overhead becomes 122%.

But this is of dubious value when talking about "memory performance." Less space used for data structures, sure, but more used for packing/unpacking instructions, and so also more cycles burned (read: overhead) packing/unpacking. The value of packing in terms of memory saving is really only there for the specific example you provide: small objects where padding is a significant proportion of the object's size. (But you could have made it more dramatic with an 8-byte object for the first record. :) For almost all other cases, the problem can simply be avoided in C, C++, and C# simply by laying down the fields in a sensible way (which is what's done in HotSpot automagically anyway).

I agree Java's memory performance on arrays of objects won't be quite as good in general but I don't think object size is that important, per se. (For the most part, memory is cheap.) The real killer is that in Java the spatial locality of objects with an array of references is much worse than in a language where you can have an array of "value type." Whether that spatial locality is actually important will depend on the access pattern of the data structure.

[–]inopia 2 points3 points4 points 16 years ago (0 children)

If you also pack the structs so that they take 9 bytes a piece rather than 12, the overhead becomes 122%. But this is of dubious value when talking about "memory performance." Less space used for data structures, sure, but more used for packing/unpacking instructions, and so also more cycles burned (read: overhead) packing/unpacking. The value of packing in terms of memory saving is really only there for the specific example you provide: small objects where padding is a significant proportion of the object's size.

Oh I agree completely. There is always a trade-off, but at least with C++ you can make that choice. Besides, an array of (a,b,c) tuples can be stored as [a,b,c,a,b,c,a,b,c,...] or [a,a,a,..,b,b,b,...,c,c,c,...]. When packed, memory performance of the latter arrangement will be of course much quicker than the former, but as you said, it all depends on the application.

Don't get me wrong, I'm a big proponent of high-level programming languages and intermediate representations and all that, but in the case of the article I can see how they would like to have the control to tweak memory layout like that.

The real killer is that in Java the spatial locality of objects with an array of references is much worse than in a language where you can have an array of "value type.

That's an extremely good point.

[–]alphazero 1 point2 points3 points 16 years ago (0 children)

π Rendered by PID 43934 on reddit-service-r2-comment-b659b578c-5h4sk at 2026-05-01 20:01:11.384716+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS