ansaurus

Question

Answer 1

A:

If you do adjust the rounding so that does include both ends of the range, will those extreme values not be only half as likely as any of the non-extreme ones?

Pete Kirkham 2009-03-13 21:35:29

It seems to me if I just use truncation, the answer is yes, but if I increment the max significand, the answer would be no.

Not Sure 2009-03-13 21:44:38

Answer 2

A:

With truncation, you are never going to be inclusive of the max.

Are you sure you really need the max? There is literally an almost 0 chance that you will land on exactly the maximum.

That said, you can exploit the fact that you are giving up precision and do something like this:

float MersenneFloat( float min, float max )
{
    double random = 100000.0; // just a dummy value
    while ((float)random > 65535.0)
    {
        //genrand returns a double in [1,2)
        double random = genrand_close1_open2() - 1.0; // now it's [0,1)
        random *= 65536.0; // now it's [0,65536). We try again if it's > 65535.0
    }
    //return in desired range
    return min + float(random/65535.0) * (max - min);
}

Note that, now, it has a slight chance of multiple calls to genrand each time you call MersenneFloat. So you have given up possible performance for a closed interval. Since you are downcasting from double to float, you end up sacrificing no precision.

Edit: improved algorithm

rlbond 2009-03-13 21:44:02

Yes, I need the max to be inclusive (it's a library function contract). Would there be any advantage to doing it your way, as opposed to incrementing the significand of my max value before the multiplication?

Not Sure 2009-03-13 22:02:13

That may work as well. Somewhere, however, you are either going to need to do a rejection test, or have a not-perfect distribution of values.An analogue of this problem is, say, generating an integer 0-256 from a random int 0-65535. It just doesn't map evenly.

rlbond 2009-03-13 22:24:15

Actually, I just tried Crashworks test suggestion, and the truncation does in fact round up.

Not Sure 2009-03-13 22:58:39

Answer 3

+4 A:

The SSE ops will subtly change the behavior of this algorithm because they don't have the intermediate 80-bit representation -- the math truly is done in 32 or 64 bits. The good news is that you can easily test it and see if it changes your results by simply specifying the /ARCH:SSE2 command line option to MSVC, which will cause it to use the SSE scalar ops instead of x87 FPU instructions for ordinary floating point math.

I'm not sure offhand of what the exact rounding behavior is around the integer boundaries, but you can test to see what'll happen when 1.999.. gets rounded from 64 to 32 bits by eg

static uint64 OnePointNineRepeating = 0x3FF FFFFF FFFF FFFF // exponent 0 (biased to 1023), all 1 bits in mantissa
double asDouble = *(double *)(&OnePointNineRepeating);
float asFloat = asDouble;
return asFloat;

Edit, result: original poster ran this test and found that with truncation, the 1.99999 will round up to 2 both with and without /arch:SSE2.

Crashworks 2009-03-13 22:03:39

Now why didn't I think of running that test among the others I ran :) I did discover that with truncation, the 1.99999 will round up to 2 both with and without /arch:SSE2. Thanks!

Not Sure 2009-03-13 23:05:35

Glad to help -- I was curious what the result of the test would be myself.

Crashworks 2009-03-13 23:11:04

ansaurus

tags:

views:

answers:

Floating point rounding when truncating

related questions