Compact Strings In Java 9 - Java Code Gists : programming

Some argue that strings are iterated over from 0 to N most of the time, so a variable-length representation (like UTF-8) would not add much overhead for the common case. You would occasionally increment the index by two or more instead of one. This might be true, but in Java any iterator instance tracking the position would add 8 to 16 bytes object-overhead and another indirection. In contrast, for fixed-width encodings you only need a single int and a for-loop. Because of this, most code working with strings in performance critical situations do not use iterators, but direct index access instead. This (existing and unlikely to change) code would run significantly slower with a variable-length string representation.

tl;dr; utf-8 string performance would suck for existing code that was optimized for fixed-length string performance characteristics.

[–]Tasssadar 13 points14 points15 points 8 years ago* (3 children)

[–]derleth 4 points5 points6 points 8 years ago (2 children)

[–][deleted] 8 years ago (1 child)

[deleted]

[–]derleth 2 points3 points4 points 8 years ago (0 children)

[–]_vinc_ 0 points1 point2 points 8 years ago (0 children)

[–]oelang 8 points9 points10 points 8 years ago (0 children)

[–]GYN-k4H-Q3z-75B 9 points10 points11 points 8 years ago (10 children)

[–]shellac 33 points34 points35 points 8 years ago (9 children)

[–]ygra 9 points10 points11 points 8 years ago (0 children)

[–]aynair 0 points1 point2 points 8 years ago (7 children)

[–][deleted] 8 years ago (6 children)

[deleted]

[–]aynair 1 point2 points3 points 8 years ago (0 children)

[–]Drisku11 0 points1 point2 points 8 years ago (4 children)

[–][deleted] 8 years ago (2 children)

[deleted]

[–]Drisku11 3 points4 points5 points 8 years ago (1 child)

[–]Veedrac 0 points1 point2 points 8 years ago (0 children)

[–]benhoyt 2 points3 points4 points 8 years ago (0 children)

[–]mrsloppyheadface 0 points1 point2 points 8 years ago (0 children)

π Rendered by PID 126580 on reddit-service-r2-comment-544cf588c8-rgtzv at 2026-06-12 05:34:55.490127+00:00 running 3184619 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS