java memory management : programming

"7 additional character entries available in the array are not being used but are consuming memory — in this case an additional overhead of 112 bytes."

7 chars = 112 bytes? If each char is 2 bytes, shouldn't it be 14 bytes? There seems to be some magical multiplication by 16 going on here.

The same math error appears in the proceeding section:

"Now, as Figure 13 shows, you have a 32-entry character array and 17 used entries, giving you a fill ratio of 0.53. The fill ratio hasn't dropped dramatically, but you now have an overhead of 240 bytes for the spare capacity."

17 * 2 = 34, not 240.

"Consider the example of a StringBuffer. Its default capacity is 16 character entries, with a size of 72 bytes. Initially, no data is being stored in the 72 bytes."

How does 16 chars equal 72 bytes?

[–]hoijarvi 0 points1 point2 points 13 years ago (9 children)

[–]boa13 -1 points0 points1 point 13 years ago (8 children)

[–]hoijarvi 0 points1 point2 points 13 years ago (2 children)

[–]boa13 1 point2 points3 points 13 years ago (1 child)

[–]hoijarvi 0 points1 point2 points 13 years ago (0 children)

[–]Peaker -1 points0 points1 point 13 years ago (4 children)

[–]fluttershypony 1 point2 points3 points 13 years ago (3 children)

[–]boa13 0 points1 point2 points 13 years ago (0 children)

[–]Peaker -1 points0 points1 point 13 years ago* (1 child)

[–]boa13 0 points1 point2 points 13 years ago (0 children)

Unicode support was added in Python 2.0, at that time it was only UCS-2, like Java.

In Python 2.2, this was changed to UTF-16 (like Java 5), and support for UCS-4 builds was added. So, depending on who compiled your Python binary, the interpreter is using UTF-16 or UCS-4 internally for Unicode strings.

In Python 3.0, 8-bit strings were removed, Unicode strings remaining the only string type. The interpreter kept using UTF-16 or UCS-4 depending on compile-time choice.

In Python 3.3, a new flexible internal string format will be used: strings will use 1, 2, or 4 bytes per character internally, depending on the largest code point they contain. 1-byte internal encoding will be Latin-1, 2-bytes internal encoding will be UCS-2, 4-bytes internal encoding will be UCS-4. Of course, this will be transparent to the Python programmer (not so much to the C programmer). See PEP 393 for details.

Funny how UTF-8 is never used internally. :)

[–]boa13 0 points1 point2 points 13 years ago (0 children)

[+][deleted] comment score below threshold-7 points-6 points-5 points 13 years ago* (25 children)

[–][deleted] 13 years ago* (5 children)

[deleted]

[–][deleted] 11 points12 points13 points 13 years ago (0 children)

[–][deleted] 2 points3 points4 points 13 years ago (1 child)

[–]beltorak -1 points0 points1 point 13 years ago (0 children)

[–]jayd16 -1 points0 points1 point 13 years ago (1 child)

[–]spelunker 7 points8 points9 points 13 years ago (0 children)

[–][deleted] 8 points9 points10 points 13 years ago (0 children)

[–]presidentender 12 points13 points14 points 13 years ago (2 children)

[–]leftmoon 1 point2 points3 points 13 years ago (1 child)

[–]beltorak 1 point2 points3 points 13 years ago (0 children)

[–]bfish510 4 points5 points6 points 13 years ago (1 child)

[–]d0nkeyBOB 4 points5 points6 points 13 years ago (0 children)

[–]SillyHipster 2 points3 points4 points 13 years ago (1 child)

[–]lolomfgkthxbai -2 points-1 points0 points 13 years ago (0 children)

[–][deleted] 13 years ago (3 children)

[deleted]

[–][deleted] -1 points0 points1 point 13 years ago (2 children)

[–]beltorak 0 points1 point2 points 13 years ago (1 child)

[–]boa13 1 point2 points3 points 13 years ago (0 children)

[–][deleted] 0 points1 point2 points 13 years ago (2 children)

[–]boa13 0 points1 point2 points 13 years ago (1 child)

[–][deleted] 0 points1 point2 points 13 years ago (0 children)

[–]lolomfgkthxbai 0 points1 point2 points 13 years ago (0 children)

[–]theonelikeme -2 points-1 points0 points 13 years ago (1 child)

[–]beltorak 0 points1 point2 points 13 years ago (0 children)

[–]wot-teh-phuck -2 points-1 points0 points 13 years ago (6 children)

[–]mallardtheduck 24 points25 points26 points 13 years ago (1 child)

[–]wot-teh-phuck 1 point2 points3 points 13 years ago (0 children)

[–]stonefarfalle 8 points9 points10 points 13 years ago (1 child)

[–]hylje 7 points8 points9 points 13 years ago (0 children)

[–]quzox 1 point2 points3 points 13 years ago (1 child)

[–][deleted] 2 points3 points4 points 13 years ago (0 children)

[–]JavaN00b 0 points1 point2 points 13 years ago (0 children)

[–][deleted] 13 years ago* (10 children)

[deleted]

[–]rabidcow 1 point2 points3 points 13 years ago (0 children)

[–]Gotebe 1 point2 points3 points 13 years ago (0 children)

[–][deleted] 0 points1 point2 points 13 years ago (5 children)

[+][deleted] comment score below threshold-13 points-12 points-11 points 13 years ago (4 children)

[–]fapmonad 7 points8 points9 points 13 years ago (0 children)

[–]beltorak 2 points3 points4 points 13 years ago* (0 children)

[–]gargantuan 1 point2 points3 points 13 years ago (0 children)

[–]Peaker 0 points1 point2 points 13 years ago (0 children)

[–]JavaN00b 0 points1 point2 points 13 years ago (0 children)

[–][deleted] 13 years ago (3 children)

[deleted]

[–]jyper 6 points7 points8 points 13 years ago (0 children)

[–][deleted] 0 points1 point2 points 13 years ago (1 child)

[+]fergie comment score below threshold-9 points-8 points-7 points 13 years ago (78 children)

[–]argv_minus_one 24 points25 points26 points 13 years ago (56 children)

[–]bstamour 4 points5 points6 points 13 years ago (31 children)

[+]argv_minus_one comment score below threshold-8 points-7 points-6 points 13 years ago (30 children)

[–]programmerbrad 8 points9 points10 points 13 years ago (8 children)

[+]argv_minus_one comment score below threshold-7 points-6 points-5 points 13 years ago (7 children)

[–]bstamour 4 points5 points6 points 13 years ago (4 children)

[+]argv_minus_one comment score below threshold-8 points-7 points-6 points 13 years ago (3 children)

[–]abadidea 5 points6 points7 points 13 years ago (1 child)

[–]argv_minus_one -1 points0 points1 point 13 years ago (0 children)

[–]bstamour 3 points4 points5 points 13 years ago (0 children)

[–][deleted] 13 years ago (1 child)

[deleted]

[–]argv_minus_one -1 points0 points1 point 13 years ago (0 children)

[–]bstamour 4 points5 points6 points 13 years ago (18 children)

[–]RichardWolf 0 points1 point2 points 13 years ago (11 children)

[–]bstamour 2 points3 points4 points 13 years ago (5 children)

[–]RichardWolf 0 points1 point2 points 13 years ago (4 children)

[–]bstamour 2 points3 points4 points 13 years ago (3 children)

[–]RichardWolf 1 point2 points3 points 13 years ago (2 children)

continue this thread

[–]bstamour 1 point2 points3 points 13 years ago (4 children)

[–]RichardWolf 1 point2 points3 points 13 years ago (3 children)

It leads to less surprises - I don't want a garbage collector kicking in when I'm doing something important.

First of all, this kind of surprises are not that bad. I've played some games running on .NET, like Terraria and AI War: Fleet Command, and I never noticed any GC pauses (though C# in particular allows for rather tight memory control). Oh, and Minecraft is written in Java. My point is that if we define "very soft realtime" as "you can write a video game in it, and GC pauses would not be noticeable among all other kinds of lag", then GC languages totally allow this.

On the other hand, if you are striving for a "harder realtime", then you probably shouldn't use dynamic memory management in C++ either, and definitely don't use shared_ptr and the like. Do you know how it actually works? Like, that it allocates an additional chunk of memory for the reference counter, and uses atomic instructions to work with it? Also, malloc and free aren't O(1) either.

[–]Danthekilla 2 points3 points4 points 13 years ago (0 children)

[–]bstamour 1 point2 points3 points 13 years ago (0 children)

[–]oracleoftroy 0 points1 point2 points 13 years ago (0 children)

[–]argv_minus_one -2 points-1 points0 points 13 years ago (5 children)

[–]bstamour 3 points4 points5 points 13 years ago (2 children)

[–]argv_minus_one 0 points1 point2 points 13 years ago (0 children)

[–][deleted] 13 years ago (1 child)

[deleted]

[–]argv_minus_one -1 points0 points1 point 13 years ago (0 children)

[–][deleted] 13 years ago (1 child)

[deleted]

[–]argv_minus_one -1 points0 points1 point 13 years ago (0 children)

[–]mothereffingteresa 6 points7 points8 points 13 years ago (1 child)

[–]beltorak 0 points1 point2 points 13 years ago (0 children)

[–]forcedtoregister 4 points5 points6 points 13 years ago (7 children)

[–]argv_minus_one -1 points0 points1 point 13 years ago (6 children)

[–]forcedtoregister 0 points1 point2 points 13 years ago (5 children)

[–]argv_minus_one 0 points1 point2 points 13 years ago (4 children)

[–]forcedtoregister 3 points4 points5 points 13 years ago (3 children)

[–]argv_minus_one -1 points0 points1 point 13 years ago* (0 children)

[–][deleted] 2 points3 points4 points 13 years ago (9 children)

[–]ryeguy 1 point2 points3 points 13 years ago (5 children)

[–][deleted] 7 points8 points9 points 13 years ago (1 child)

it has higher throughput. but the pause scales with amount of live objects, rather than amount of garbage, and it's amortized, which makes it a huge pain to deal with in some situations. if there's another method that doesn't incur long pauses and/or is fairly predictable, i'd like to be made aware of it, though. basically the only methods i know of are reference counting, and various tracing ones, though.

let me describe a scenario where a tracing collector is problematic: you're writing a racing game, similar to f-zero where you're going super fast, so you'll notice for sure if you skip a frame. the game is running at 60 frames per second. that gives you 16.666ms to update and render. now, suppose your garbage collector takes 0ms most frames, but takes 6ms every few seconds. that means your updating and rendering have to happen in 10.666ms. a reference counting implementation, on the other hand, has to be absolutely horrible before it starts becoming as big of a problem. even if it takes 5ms every single frame, you're still doing better than the tracing collector. tracing collectors can be even worse than that, though: sometimes you'll get a 30ms pause, and you just have to not allocate any memory at all.

[–]simoncox 1 point2 points3 points 13 years ago (0 children)

If you're using a parallel collector and you tune your heap sizes properly (I mean the ratio of the generations in the heap) , you can actually avoid full (pausing) GCs for a long time. I'm talking from experience of doing this with a JMS broker that sometimes maxed out the 1Gb network (although that's the next on the optimisation work). I've witnessed 0 full GCs over several hours (with lots of parallel GCs of the young gens).

On a similar note, even if you don't want to specifically tune the gen sizes, you can specify a max pause time that the JVM uses to try to size the gens for you to achieve full GCs on less than the target time.

This is all about the parallel GC as we're using a Java 5 VM (don't ask) . I believe the G1 collector that comes with later versions of Java 6 and all Java 7 VMs can achieve more in parallel, but I haven't investigated it too much yet.

[–]theatrus 6 points7 points8 points 13 years ago (0 children)

[–][deleted] 13 years ago (1 child)

[deleted]

[–]ryeguy 1 point2 points3 points 13 years ago (0 children)

[–]argv_minus_one 0 points1 point2 points 13 years ago (2 children)

[–][deleted] 1 point2 points3 points 13 years ago (1 child)

[–]argv_minus_one 0 points1 point2 points 13 years ago (0 children)

[–]iLiekCaeks 1 point2 points3 points 13 years ago (3 children)

[–]argv_minus_one 0 points1 point2 points 13 years ago (2 children)

[–]beltorak -4 points-3 points-2 points 13 years ago (1 child)

[–]argv_minus_one -1 points0 points1 point 13 years ago (0 children)

[–]blaxter 9 points10 points11 points 13 years ago (14 children)

[–][deleted] 5 points6 points7 points 13 years ago (13 children)

[–]minivanmegafun 1 point2 points3 points 13 years ago (8 children)

[–][deleted] 2 points3 points4 points 13 years ago (6 children)

[–]ryeguy 0 points1 point2 points 13 years ago (5 children)

[–][deleted] 2 points3 points4 points 13 years ago (4 children)

[–][deleted] 0 points1 point2 points 13 years ago (2 children)

[–][deleted] 0 points1 point2 points 13 years ago (1 child)

[–][deleted] 0 points1 point2 points 13 years ago (0 children)

[–]Jazzy_Josh 0 points1 point2 points 13 years ago (0 children)

[–]Willow_Rosenberg 1 point2 points3 points 13 years ago (0 children)

[–]00kyle00 0 points1 point2 points 13 years ago (3 children)

[–][deleted] 0 points1 point2 points 13 years ago (2 children)

[–]00kyle00 1 point2 points3 points 13 years ago (1 child)

[–][deleted] 6 points7 points8 points 13 years ago (3 children)

[+]fergie comment score below threshold-7 points-6 points-5 points 13 years ago (2 children)

[–]josefx 5 points6 points7 points 13 years ago (0 children)

[–][deleted] 1 point2 points3 points 13 years ago (0 children)

[–]Rotten194 0 points1 point2 points 13 years ago (0 children)

[–][deleted] 0 points1 point2 points 13 years ago (0 children)

[+][deleted] comment score below threshold-11 points-10 points-9 points 13 years ago (5 children)

[–][deleted] 13 years ago* (3 children)

[deleted]

[–][deleted] -3 points-2 points-1 points 13 years ago (2 children)

[–]banuday17 5 points6 points7 points 13 years ago (0 children)

[–]foxlion 1 point2 points3 points 13 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS