ansaurus

Question

Types in Bytecode

Answer 1

+10 A:

These instructions are typed to ensure the program is typesafe. When loading a class the virtual machine performs verification on the bytecodes to ensure that, for example, a float isn't passed as an argument to a method expecting an integer. This static verification requires that the verifier can determine the types and number of values on the stack for any given execution path. The load and store instructions need the type tag because the local variables in the stack frames are not typed (i.e. you can istore to a local variable and later fstore to the same position). The type tags on the instructions allow the verifier to know what type of value is stored in each local variable.

The verifier looks at each opcode in the method and keeps track of what types will be on the stack and in the local variables after executing each one. You are right that this is another form of type checking and does duplicate some of the checks done by the java compiler. The verification step prevents loading of any code that would cause the VM to execute an illegal instruction and ensures the safety properties of the Java platform without incurring the large runtime penalty of checking types before each operation. Runtime type checking for each opcode would be a performance hit each time the method is executed, but the static verification is done only once when the class is loaded.

Case 1:

Instruction             Verification    Stack Types            Local Variable Types 
----------------------- --------------- ---------------------- ----------------------- 
<method entry>          OK              []                     1: none
iconst_1                OK              [int]                  1: none
istore_1                OK              []                     1: int
return                  OK              []                     1: int

Case 2:

Instruction             Verification    Stack Types            Local Variable Types 
----------------------- --------------- ---------------------- ----------------------- 
<method entry>          OK              []                     1: none
iconst_1                OK              [int]                  1: none
fstore_1                Error: Expecting to find float on stack

The error is given because the verifier knows that fstore_1 expects a float on the stack but the result of executing the previous instructions leaves an int on the stack.

This verification is done without executing the opcodes, rather it is done by looking at the types of the instruction, just like the java compiler gives an error when you write (Integer)"abcd". The compiler doesn't have to run the program to know that "abcd" is a string and can't be cast to Integer.

Geoff Reedy 2010-04-14 14:23:56

Thanks for the answer. So the verifier does some data-flow analysis before executing the class to detect this types of error. Interesting to learn :)

HH 2010-04-15 07:38:24

Answer 2

+2 A:

To answer your first question with my best guess: these bytecodes are different because they may require different implementations. For example, a particular architecture may keep integer operands on the main stack, but floating-point operands in hardware registers.

To answer your second question, VerifyError is thrown when the class is loaded, not when it's executed. The verification process is described here; note pass #3.

Anon 2010-04-14 14:50:05

+1 for the link. And thanks for reminding me VerifyError is thrown at load-time and not at run-time.

HH 2010-04-15 07:34:40

Answer 3

+1 A:

Geoff Reedy explained in his answer what the verifier does when a class is loaded. I just want to add that you can disable the verifier using a JVM parameter. This is not recommended!

For your example program (with iconst and fstore), the result of running with verification disabled is a VM error that halts the JVM with the following message:

=============== DEBUG MESSAGE: illegal bytecode sequence - method not verified ================

#
# An unexpected error has been detected by HotSpot Virtual Machine:
#
#  EXCEPTION_PRIV_INSTRUCTION (0xc0000096) at pc=0x00a82571, pid=2496, tid=3408
#
# Java VM: Java HotSpot(TM) Client VM (1.5.0_15-b04 mixed mode, sharing)
# Problematic frame:
# j  BytecodeMismatch.main([Ljava/lang/String;)V+0
#
...

Christian Semrau 2010-04-21 11:19:03

ansaurus

tags:

views:

answers:

Types in Bytecode

related questions