ansaurus

Question

Performance: float to int cast and clipping result to range

Answer 1

+1 A:

When you cast int to short you never get clipping functionality, bits are truncated and then are read as short. E.g. (short)-40000 becomes 25536, and not -32768 as you expected.

Probably you have to edit you question, I am sure you know it if you disassembled bytecode. Also, there is a JIT compiler which might optimize this code (because it is called often) to platform dependent instructions.

Please convert this answer to comment.

leopoldkot 2010-05-27 17:00:32

Answer 2

A:

float to int conversions is one of the slowest operations you can do on an x86 processor, as it requires modifying the x87 rounding modes (twice), which serializes and flushes the processor. You can get a sizeable speedup if you can use SSE instructions instead of x87 instructions, but I have no idea if there's a way to do that in java. Perhaps try using an x86_64 JVM?

Chris Dodd 2010-05-27 19:28:56

Answer 3

+1 A:

You could turn two comparisons into one for values which are in range. This could halve the cost. Currently you perform only one comparison if the value is too negative. (which might not be your typical case)

if (sample + 0x7fff8000 < 0x7fff0000)
    sample = sample < 0 ? -32768 : 32767;

Peter Lawrey 2010-05-27 21:03:04

Your assumption is correct, in the overwhelming majority of cases sample is inside range (>99.9%). So one less branch helps, although only a bit. I love how your method uses integer overflow to check the upper bound implicitly. I'll definetly remember this trick.

Durandal 2010-05-27 21:50:14

Answer 4

A:

int sample = ((int)floatval) & 0xffff;

EJP 2010-05-28 01:02:34

This takes the remainder of floatval/0xFFFF, which is *not* what the OP wants.

Wallacoloo 2010-05-28 01:17:02

No it doesn't, it takes the remainder of floatval/0x10000 actually, but it has exactly the same effect as (short), which he was using, so it presumably is exactly what he wants. So kindly remove the downvote. But probably I2S is quicker.

EJP 2010-05-28 05:33:45

Answer 5

A:

This is Python, but should be easy to convert. I don't know how costly the floating point operations are, but if you can keep it in integer registers you might have some boost; this assumes that you can reinterpret the IEEE754 bits as an int. (That's what my poorly named float2hex is doing.)

import struct

def float2hex(v):
    s = struct.pack('f', v)
    h = struct.unpack('I', s)[0]
    return h

def ToInt(f):
    h = float2hex(f)
    s = h >> 31
    exp = h >> 23 & 0xFF
    mantissa = h & 0x7FFFFF
    exp = exp - 126
    if exp >= 16:
        if s:
            v = -32768
        else:
            v = 32767
    elif exp < 0:
        v = 0
    else:
        v = mantissa | (1 << 23)
        exp -= 24
        if exp > 0:
            v = v << exp
        elif exp < 0:
            v = v >> -exp

        if s:
            v = -v

    print v

This branching may kill you, but maybe this provides something useful anyway? This rounds toward zero.

dash-tom-bang 2010-05-28 01:10:54

dash-tom-bang 2010-05-28 01:29:06

(by cunning operation, you could check `(float_bits if that check passes, you know you're out-of-range and can do a possibly-slower set of checks to determine the sign of the overflow, since if the exponent is greater than this value, regardless of the sign bit, you're out of range.)

dash-tom-bang 2010-05-28 01:32:57

The thought of performing the cast in integer by hand crossed my mind, but I didn't know how to do it. Converting the float raw bit to int requires to call a native method in Java (Float.floatToRawIntBits), and in conjunction with the extensive checks needed, is a lot slower than the cast + compare of the accepted answer. BTW: The sign can be tested simply using h < 0 in Java, since int is signed. This eliminates the need for variable s (might not be an option in Phyton)

Durandal 2010-05-28 09:00:04

ansaurus

tags:

views:

answers:

Performance: float to int cast and clipping result to range

related questions