ansaurus

Question

Working with double-precision numbers in inline assembly (GCC, IA-32)

Answer 1

+1 A:

Here's what I've got. It's not tested, but hopefully would be less gnarly for you to work with. :-)

double
roundd(double n, short mode)
{
    short cw, newcw;

    __asm__("fstcw %w0" : "=m" (cw));
    newcw = cw & 0xf3ff | mode;
    __asm__("fldcw %w0" : : "m" (newcw));
    __asm__("frndint" : "+t" (n));
    __asm__("fldcw %w0" : : "m" (cw));
    return n;
}

Although, if you're not required to use assembly to achieve your rounding mode, think about using the functions in <fenv.h> instead. :-)

Chris Jester-Young 2010-02-12 06:38:10

I am required to use assembly :)

jtbandes 2010-02-12 06:48:14

@jtbandes: Cool. In that case, feel free to test out my version and let me know what needs fixing. :-)

Chris Jester-Young 2010-02-12 06:49:55

How does the `+t` constraint work? I can't find information about it where I found the others.

jtbandes 2010-02-12 15:35:36

@jtbandes: `+` means in/out (used when the instruction reads and writes to the same register), and `t` means `st(0)`. Read the "Machine Constraints" section of the GCC manual: http://gcc.gnu.org/onlinedocs/gccint/Machine-Constraints.html :-)

Chris Jester-Young 2010-02-12 16:51:21

Answer 2

A:

As the sign changes, it means that the sign bit (which is the most significant, the first one) is not correct. That suppose to me that the pointer %1 is wrongly aligned. If you have one byte, it can begin on 0,1,2... but if you access two bytes, the address must be 0,2,4.... and in case of double the address must be even dividable by 8: 0,8,16

So check if the address which you use to load the value is dividable by 8. Assembly has the align keyword to guarantee that your data is correctly aligned.

Thorsten S. 2010-02-12 06:43:58

That alignment is NOT needed by x86 but only advisable for performance.

Ritsaert Hornstra 2010-02-12 06:52:52

Answer 3

+2 A:

At least one issue with your current code is it is using the single precision floating point versions of fld and fstp. If you replace them with fldl and fstpl it will probably work.

tyranid 2010-02-12 06:49:02

Thorsten S. 2010-02-12 07:13:59

+1 Yes, it seems the *l versions will work. The snippet I posted, when disassembled, have the *l suffix too. (Obviously, in my case I just chose to let gcc do all the hard work, by using constraints rather than manually coding in the load/store instructions.)

Chris Jester-Young 2010-02-12 07:17:29

I tried to maintain the spirit of the question but your answer was far better :)

tyranid 2010-02-12 18:01:15

ansaurus

tags:

views:

answers:

Working with double-precision numbers in inline assembly (GCC, IA-32)

related questions