ansaurus

Question

Answer 1

+2 A:

You should just check if you're above or equal to 0.5 above the running_value. I.e:

int16 round = (running_value >> 7) & 1; // =1 if decimal part >= 0.5, =0 otherwise.
spot_value = running_value >> 8 + round;  //downscale again

Jonas Byström 2009-09-05 21:56:09

Answer 2

+8 A:

You could add 1/2 to the value. In your scaled system that's 128.

int8 spot_value = 21;  //arbitrary. just need a starting point
int16 running_value;

running_value = spot_value << 8;  //multiply by 256 which is 5376
running_value += 154;  //my upscaled value is now 5530
spot_value = running_value + 128;  //add an additional 1/2 for rounding
spot_value = spot_value >> 8;  //downscale again

or...

scale_shift = 8;
scale = 1 << scale_shift
round = scale >> 1;

int8 spot_value = 21;  //arbitrary. just need a starting point
int16 running_value;

running_value = spot_value << scale_shift;  //multiply by 256 which is 5376
running_value += 154;  //my upscaled value is now 5530
spot_value = (running_value + round) >> scale_shift;  //downscale again

or a bit trick based on Dav's answer...

running_value = spot_value << scale_shift;  //multiply by 256 which is 5376
running_value += 154;  //my upscaled value is now 5530
spot_value = (running_value >> 8) + ((running_value >> 7) & 1)

.

Nosredna 2009-09-05 21:56:24

Right, so then my next question is this: "how does one get as smart as you?" :) (and that applies to all the answers - thanks everyone!) cheers.

Steven 2009-09-05 22:12:42

I'm not that smart. I'm just old enough to have had my livelihood depend on writing entire games before computers had floating point hardware. :-)

Nosredna 2009-09-05 22:17:36

If you would have given me 32 bits to work with, I may have suggested multiplying by 129 then shifting down. :-)

Nosredna 2009-09-05 23:20:17

Your answer is useful for positive values but misleading for negative values. First, doing bit shifts of signed values is risky because the effects are platform dependent. Do multiplies/divides instead, and rely on the compiler to do the right optimisation. Secondly, when downscaling, on many platforms negative numbers are truncated towards zero. So, for negative values you need to pre-add the **negative** of half the divisor. -128 in the above example.

Craig McQueen 2009-09-06 03:24:03

Yep. You're absolutely right.

Nosredna 2009-09-06 04:46:37

Your first code snippet has a typo on the last two lines: the first assignment to spot_value is not used in the second.

bk1e 2009-09-06 20:11:28

@bk1e, ah thanks. Editing error.

Nosredna 2009-09-06 21:06:47

Answer 3

+3 A:

Simply add something to the value before you downscale it. That is, when downshifting by 8, add 128 and it will result in a rounded value.

For the general case, when scaling down by N bits, add 1 << (N-1) before downscaling to get a rounded value.

Jason 2009-09-05 21:57:17

Answer 4

+3 A:

running_value = spot_value << 8;  //multiply by 256 which is 5376
running_value += 154;  //my upscaled value is now 5530
if(spot_value & (1 << 7)) {
    spot_value = (running_value >> 8) + 1;  //downscale again, but add 1 since the top bit of removed was set, i.e. ">= 1/2"
} else {
    spot_value = running_value >> 8; //downscale normally
}

Amber 2009-09-05 21:58:31

Answer 5

A:

One thing I have been burned by

int8 spot_value = 21;  //arbitrary. just need a starting point
int16 running_value;

running_value = spot_value << 8;  //multiply by 256 which is 5376

You are doing a leftshift by 8 on the 8-bit variable spot_value. You are putting the value in a 16-bit variable, so it should work fine. However, if you port the code to a new micro, or switch compilers, or change optimizations, or refactor the code to an inline, etc you could inadvertently shift out your entire value.

Basically, it comes down to not trusting the compiler. On an embedded system, you don't have the robustness of gcc. I usually use an accumulator convention to compensate for this:

int8 spot_value = 21;  //arbitrary. just need a starting point
int16 running_value;

running_value = spot_value;  //load accumulator for multiply
running_value <<= 8;         //multiply accumulator (which is 21) by 256

The same principle applies for multiplication and division, especially when change bit widths.

Ben Gartner 2009-09-07 18:47:14

Answer 6

A:

Note that is is only right-shifting of signed integers that is undefined. Left shift is defined. There are three likely behaviours:

1) the compiler will use 'sign-extension' and fill the vacated MSBs with the value of the original MSB (the behaviour you need for the code given to work).

2) Fill vacated bits with zero

3) Fill vacated bits with one

It is possible I suppose but unlikely that the vacated bits are random. Personally I have never observed behaviour other than (1), but the possibility is there, and should be considered if portability and robustness is desired.

As already commented, using * and / is a solution. I have observed the generated code of a number of compilers and invariably a multiply or divide by a literal constant power-of-two will be implemented by a shift operation if on the architecture it is more efficient; even without explicitly setting an optimisation option.

Clifford 2009-09-07 20:26:35

-1 for a pretty weak answer.

JonH 2010-03-07 19:24:35

@JonH: The answer was in response to Craig McQueen's comment to the original question, but the explanation was too large for a comment. Sorry if that was not clear. I wonder why you chose to comment on this 6 months after the event? Nothing to do with revenge for an earlier down-vote of course ;-)

Clifford 2010-03-07 20:56:01

ansaurus

tags:

views:

answers:

"rounding" with integers

related questions