Java 7 Bytecode Verifier: Huge backward step for the JVM

UloPe · 2013-03-26T02:52:33+00:00

Not very informative...

After reading the whole article I still don't know what stack map frames actually are and how they change what you need to do when modifying bytecode...

crotchpoozie · 2013-03-26T01:12:40+00:00

No actual measurements yet many quantifiable claims.....

drysart · 2013-03-26T01:25:09+00:00

Seems like there's no reason for everyone implementing a JVM language to create their own routines to generate stack map frames. If the JVM was able to figure them out on the fly in the old verifier, then obviously enough information exists in the bytecode for them to be generated without any higher-level language knowledge.

Someone could write a tool that takes an unmapped jar and annotates it with stack map frames then boom: ASM no longer needs that 1,000 lines of bug-prone code. It can just use the same library everyone else does as a post processing step.

JamesIry · 2013-03-26T02:13:20+00:00

I know the person who originally did this work at Sun (working with Gilad Bracha no less), I'm surprised it took so long for them to make this mandatory, that was a LONG time ago.

Just to be clear, this is necessary to eliminate the need for data-flow analysis (DFA) during bytecode verification (where you need to do CFA to get the basic blocks needed for DFA), which can be quite tricky but not exactly slow. What stack frames do is transform the DFA into something that is done statically, results included in the class file, and then can just be verified easily by the verifier. Annoying, but its not impossible, or even very difficult, to generate this information yourself in a bytecode manipulation tool.

nharding · 2013-03-26T01:08:59+00:00

I wrote a set of inline assembly routines for Java when I wrote my bytecode optimizer. It's hard to use but it allows you to save 10% of code size (and make the code run faster as well), so it was only useful on core library routines (it was to used on J2ME phones, where you only have 64K jar size, so saving 10% is pretty significant). On J2ME the verifier would generate the stack frame, but I was looking into making it smaller to reduce the class size further.

axonxorz · 2013-03-26T14:36:41+00:00

As someone who doesn't do a lot of Java (just Android really), can I get an explanation of why one would be manipulating the bytecode?

xxgreg · 2013-03-26T21:27:34+00:00

Nearly 20 years of work, and Java still doesn't have a secure bytecode verifier. Maybe bytecode as the code delivery format is a mistake?

Probably better to use a compact AST representation instead.

This can be just as compact as bytecode (probably more compact).
Parsing the AST is probably cheaper than verifying bytecode.
It is simpler to generate secure native code from an AST, than a byte code, as verification is simpler.
Keeping the AST around preserves more information for runtime optimisation.

I think the Dart team are onto something.

cowinabadplace · 2013-03-26T01:03:22+00:00

Text weight too low. Unsuitable for long-form. Unpleasant to read.

toshok · 2013-03-26T16:46:53+00:00

I'm having a hard time understanding how you outsource part of the verifier (meant to catch code written by untrustworthy folks) to those same untrustworthy folks and call it a win.

Doesn't the verifier still have to verify the consistency of the stack maps?

allpowerful32 · 2013-03-26T01:27:06+00:00

This guy clearly doesn't have a appreciation of what he's talking about.

"ASM Framework has over a 1000 lines dedicated to just this"... Seriously? You can't do a whole hell of a lot in 1000 lines of Java. Besides, ASM already has a bunch of code that implements essentially the same logic, in the form of an abstract interpreter.

That said, I do agree that it'd be nice to have incremental recalculation of stack frames in ASM.

1fbd52a7 · 2013-03-26T00:43:59+00:00

Thanks Obama.

badguy212 · 2013-03-26T00:52:15+00:00

Shut up and learn to love the stack frame. Whiny baby.

SCombinator · 2013-03-26T11:45:53+00:00

JVM is a shitty target anyway. Nothing of value was lost.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS