ansaurus

Question

Answer 1

+1 A:

Is there a problem with adding your own xchg instruction prior to the call? Since you have an align just prior to stub, the alignment should be consistent.

Mark Ransom 2009-12-03 06:57:27

Only insofar as a) I'd like to get the assembler to do that, because b) there's a layer of translation between the assembly code I write and the instructions that get generated, and I don't want to make too many assumptions along the lines of "call * always generates exactly 6 bytes".

Edmund 2009-12-03 07:02:04

"call * always generates exactly 6 bytes" - that level of control is why you go to assembly in the first place. IMO I don't think there's any assembler directives to align the end of an instruction rather than the beginning.

Mark Ransom 2009-12-03 07:19:10

Answer 2

+1 A:

Unfortunately, most assemblers are one-pass simple translators, which limit the flexibility of alignment directives they can offer. Even among all the alignment options that assemblers working in several passes could offer, many are neglected because there are too specific. Yours is one of those, I am afraid. It could work in a one-pass assembler as long as it's only one instruction you intend to move, but it's very specific.

I have seen the manual of a sophisticated multi-pass assembler that let you substract the addresses of two labels to get the length of a sequence of instruction, and would let you insert a directive to insert a sequence of NOPs, say, (4 - this length modulo 4) in the place of your choice (as long as it remained possible to converge on a definite position for each instruction). I can't remember what assembler it was. Definitely not gas, which is one-pass as far as I know. It may have been the venerable A386.

Pascal Cuoq 2009-12-03 13:40:35

I suspected as much -- I guess the assembler doesn't know how much padding to put before instruction #1 until its figured out where to put instruction #n. It's understandable and isn't I can manage without it. (The manual adjustment turns out to be as simple as "addl $3, %eax; andl $0xFFFFFFFC, %eax"...)

Edmund 2009-12-03 21:19:48

Answer 3

+1 A:

Have you considered putting the data before the code?

This way it is only a substraction (of the length of the stub code plus some constant offset) to get to the address of the data, so it's one instruction instead of two as your were ready to accept. And I believe that gas will give you the length of the stub code (as the difference of two labels) without problem since the labels are used after having been defined in this case.

Assuming the data is made of 32-bit words, there is also less padding involved compared to your initial solution (although I am not sure why there so many .align directives in your initial solution, probably some orthogonal constraint that you didn't get into).

Pascal Cuoq 2009-12-03 23:07:21

That means the pointer to this block of (data, code) won't be directly callable, which is a pity because it's nice to be able to pass it into C code as a callback. I could return block+sizeof(data), but then I suspect my GC (as yet unwritten) will not recognise that pointer as being dependent on the allocated block.

Edmund 2009-12-04 01:27:43

Infix pointers are difficult to manage in general, but if you are going to write the GC yourself, this particular case of infix pointer can be taken into account. Whereas you typically have (header, data) blocks with the GC accessing the header with `pointer[-1]`, you would have here (data, header, code) with the pointer to code, the pointer to code, and the header indicating how far the block reaches in both directions. You could pass it C code because it would be recognizable that static C code is not in the heap, so the GC would not look for a header in this case.

Pascal Cuoq 2009-12-04 08:39:50

ansaurus

tags:

views:

answers:

Pad instruction so end is aligned

Update

related questions