ansaurus

Question

Assembly: Y86 Stack and call, pushl/popl and ret instructions

Answer 1

+6 A:

Let's look at some of the code:

main:
  call ini

This will push the value of the instruction pointer to the stack (so that you can later return to this position in the code), and jump to the address of the ini label. The 'ret' instruction uses the value stored on the stack to return from the subroutine.

The following is the initialisation sequence of a subroutine. It saves the values of some registers on the stack and sets up a stack frame by copying the stack pointer (esp) to the base pointer register (ebp). If the subroutine has local variables, the stack pointer is decremented to make room for the variables on the stack, and the base pointer is used to access the local variables in the stack frame. In the example the only local variable is the (unused) return value.

The push instruction decrements the stack pointer (esp) with the data size of what's going to be pushed, then stores the value at that address. The pop instruction does the opposite, first getting the value, then increments the stack pointer. (Note that the stack grows downwards, so the stack pointer address gets lower when the stack grows.)

ini:
  pushl %ebp             // save ebp on the stack
  rrmovl %esp, %ebp      // ebp = esp (create stack frame)
  pushl %ebx             // save ebx on the stack
  pushl %eax             // push eax on the stack (only to decrement stack pointer)
  irmovl $0, %eax        // eax = 0
  rmmovl %eax, -8(%ebp)  // store eax at ebp-8 (clear return value)

The code follows a standard pattern, so it looks a bit awkward when there are no local variables, and there is an unused return value. If there are local variables a subtraction would be used to decrement the stack pointer instead of pushing eax.

The following is the exit sequence of a subroutine. It restores the stack to the position before the stack frame was created, then returns to the code that called the subroutine.

ini_finish:
   irmovl $4, %ebx   // ebx = 4
   addl %ebx, %esp   // esp += ebx (remove stack frame)
   popl %ebx         // restore ebx from stack
   popl %ebp         // restore ebp from stack
   ret               // get return address from stack and jump there

In response to your comments:

The ebx register is pushed and popped to preserve it's value. The compiler apparently always puts this code there, probably because the register is very commonly used, just not in this code. Likewise a stack frame is always created by copying esp to ebp even if it's not really needed.

The instruction that pushes eax is only there to decrement the stack pointer. It's done that way for small decrements as it's shorter and faster than subtracting the stack pointer. The space that it reserves is for the return value, again the compiler apparently always does this even if the return value is not used.

In your diagram the esp register is consistently pointing four bytes too high in memory. Remember that the stack pointer is decremented after pushing a value, so it will point to the value pushed, not to the next value. (The memory addresses are way off also, it's something like 0x600 rather than 0x20, as that's where the Stack label is declared.)

Guffa 2009-06-20 17:06:23

Reading your post and at the same time making some stack drawings, I think I somewhat understand this, there's only a few things I still don't get: a) Why do we push and pop %ebx? If I got it right, it's -4(%ebp), which we never access it and there's no value in %ebx to save at the start and get back at the end. b) Why do we push %eax if there's no value in %eax to save? Couldn't we just decrement the %esp to save the necessary space? And why do we even need that space? We are saving the %eax value in the stack for what? We never read back from it.

Nazgulled 2009-06-20 17:36:19

I've tried to do a diagram to see if I understood everything correctly, hopefully, everything's right and I got it, but I have a feeling I didn't, here it is: http://images.nazgulled.net/tmp/stack.jpg -- Still, the a) and b) questions in the first comment, still apply. And I have new one c) Why do we even need to push/pop %ebp and do "rrmovl %esp, %ebp"? It doesn't seem that it's changing anything...

Nazgulled 2009-06-20 19:02:18

I added a response in the answer.

Guffa 2009-06-21 11:30:35

I'm still not sure about that "return value that's not used". Let's go from the line where we create the stack frame, BP = SP: The first push will decrement SP and save the value of %ebx on the stack. The second one will do the same but for %eax, the one you say it's a return value not used. Now remember that SP is 8 bytes below BP because of the 2 pushs. Then, I use "irmovl $0, %eax" and "rmmovl %eax, -8(%ebp)" which will actually overwrite the previous %eax saved value (which doesn't matter cause we just pushed it to save space for it) with 0. Why do you say the value is not used then?

Nazgulled 2009-06-21 15:03:23

The code in the init sequence allocated space for the return value, then it clears it, but after that point the value is never used. The code in the method (between the init and exit sequences) doesn't contain anything that sets the return value, and the exit sequence doesn't do anything to return the value from the subroutine.

Guffa 2009-06-21 15:23:33

ansaurus

tags:

views:

answers:

Assembly: Y86 Stack and call, pushl/popl and ret instructions

related questions