tags:

views:

505

answers:

2

What is the size of each asm instruction? Every instruction takes how many bytes? 8 bytes? Four for the opcode and Four for the argument? What happens when you have one opcode and 2 arguments, in mov, for example? Do they have a fixed size in memory or do they vary? Does EIP have anything to do with this, of its value is always incremented by one, being totally independent of what kind of instruction it is passing by?

I ask this as when I was reading http://en.wikibooks.org/wiki/X86%5FDisassembly/Functions%5Fand%5FStack%5FFrames , I stumbled across the fact that it seems a call instruction is equivalent to a push and jmp instruction.

call MYFUNCTION
mov my_var, eax

being the same as

push [eip + 2];
jmp MYFUNCTION;
mov my_var, eax

When we're pushing [eip + 2] on the stack, to what value are we pointing then? To the line right next to "jmp MYFUNCTION", mov my_var eax, right?

ps: MSVC++ flags an error on the first line, as it says eip is undefined. It works for eax, esp, ebp, etc. What am I doing wrong?

+4  A: 

Machine code size depends on the processor's architecture.

For example, on IA-32, the instruction size varies from 1 to 6 bytes (or more).

Nick D
When you say the instruction size varies from 1 to 6 bytes on different computers or that on the same computer, some instructions are bigger than others?
devoured elysium
on the same computer of course. Yes, an instruction can be composed from many bytes.
Nick D
+5  A: 

The size of a machine instruction depends on the processor architecture - there are architectures with fixed size instruction, but you are obviously refering to the IA-32 and Intel 64 and they have strongly varing instruction lengths. The instruction pointer is of course always incremented by the length of the processed instruction.

You can download the IA-32 and Intel 64 manuals from Intel - they contain almost everything you can know about the architecture. You can find an opcode map and instruction set format in Intel® 64 and IA-32 Architectures Software Developer’s Manual Volume 2B: Instruction Set Reference, N-Z on pages 623 to 768.

Daniel Brückner
If each instruction has a different length, how can we, or even the computer, know in memory what is an instruction and what is an argument? If every instruction had the same size, the computer would know that, let's say, each 4 bytes corresponded to a new instruction. But if their size varies, that is not true. At the end (or beginning!) of each instruction is there any special byte or something? Thanks
devoured elysium
Absolutly. The first byte includes enough information to determine if a second byte must be read and the second byte allows determining if the instruction has a third byte and so on. Just have a look at the instruction code tables.
Daniel Brückner
@devoured elysium, the are *special* bytes (opcodes) that indicate extended instructions.
Nick D
Incidentally IA-64 is actually Itanium; you may be thinking of the x86-64/x64/amd64/em64t extensions to IA-32. IE-64 is a fixed instruction width architecture (and a honking big one at 128 bits per group-of-instructions).
bobince
Thanks, I always write IA-64 instead of Intel 64. Fixed it.
Daniel Brückner