ansaurus

Question

How to handle multiplication of numbers close to 1

Answer 1

+11 A:

Yep: because

(1 + x) * (1 + y) = 1 + x + y + x*y

In your case, x and y are very small, so x*y is going to be far smaller - way too small to influence the results of your computation. So as far as you're concerned,

(1 + x) * (1 + y) = 1 + x + y

This means you can store the numbers with 1 subtracted, and instead of multiplying, just add them up. As long as the results are always much less than 1, they'll be close enough to the mathematically precise results that you won't care about the difference.

EDIT: Just noticed: you say most of them are very close to 1. Obviously this technique won't work for numbers that are not close to 1 - that is, if x and y are large. But if one is large and one is small, it might still work; you only care about the magnitude of the product x*y. (And if both numbers are not close to 1, you can just use regular Java double multiplication...)

David Zaslavsky 2009-04-04 23:03:55

Thanks David, this has certainly provided food for thought - it may be the answer but I'll leave it a bit longer to see what others suggest.

sanity 2009-04-04 23:31:30

+1 - faster than logs :D

v3 2009-04-05 01:25:52

It would be better to simply use the first equation in any case. If x*y is close 0 or not it still works...

Pool 2009-04-05 16:32:41

Dropping the x*y saves you a multiplication, which makes the whole computation significantly faster. And since doubles only store about 15 digits of precision (IIRC), if (x*y)/(x + y) is smaller than 10^-15 it'd get truncated off anyway.

David Zaslavsky 2009-04-05 22:49:24

Answer 2

+3 A:

Isn't this sort of situation exactly what BigDecimal is for?

Edited to add:

"Per the second-last paragraph, I would prefer to avoid BigDecimals if possible for performance reasons." – sanity

"Premature optimization is the root of all evil" - Knuth

There is a simple solution practically made to order for your problem. You are concerned it might not be fast enough, so you want to do something complicated that you think will be faster. The Knuth quote gets overused sometimes, but this is exactly the situation he was warning against. Write it the simple way. Test it. Profile it. See if it's too slow. If it is then start thinking about ways to make it faster. Don't add all this additional complex, bug-prone code until you know it's necessary.

Chris Upchurch 2009-04-04 23:04:17

Per the second-last paragraph, I would prefer to avoid BigDecimals if possible for performance reasons.

sanity 2009-04-04 23:14:00

This isn't a premature optimization. double is already very slow, I've done some benchmarking and BigDecimal seems several orders of magnitude slower.It may be the solution I go with, but I want to consider alternatives.

sanity 2009-04-04 23:29:29

Hmm :/ you didn't cite the complete Knuth quote: "We should forget about small efficiencies, say about 97% of the time: premature optimization is the root of all evil" http://en.wikipedia.org/wiki/Optimization_%28computer_science%29

Jason S 2009-04-06 13:29:45

Answer 3

+10 A:

Perhaps you could use logarithms?

Logarithms conveniently reduce multiplication to addition.

Also, to take care of the initial precision loss, there is the function log1p (at least, it exists in C/C++), which returns log(1+x) without any precision loss. (e.g. log1p(1e-30) returns 1e-30 for me)

Then you can use expm1 to get the decimal part of the actual result.

v3 2009-04-04 23:11:03

Kind of the same idea as my answer, since log(1+x) = x for very small x... anyway +1 for using the math to optimize ;-)

David Zaslavsky 2009-04-04 23:13:49

ouch my head hurts

ojblass 2009-04-05 00:00:20

Answer 4

A:

If you really need the precision, you will have to use something like BigDecimal, even if it's slower than Double.

If you don't really need the precision, you could perhaps go with David's answer. But even if you use multiplications a lot, it might be some Premature Optimization, so BIgDecimal might be the way to go anyway

Caotic 2009-04-04 23:11:53

Answer 5

+1 A:

Depending on where the numbers are coming from and how you are using them, you may want to use rationals instead of floats. Not the right answer for all cases, but when it is the right answer there's really no other.

If rationals don't fit, I'd endorse the logarithms answer.

Edit in response to your edit:

If you are dealing with numbers representing low response rates, do what scientists do:

Represent them as the excess / deficit (normalize out the 1.0 part)
Scale them. Think in terms of "parts per million" or whatever is appropriate.

This will leave you dealing with reasonable numbers for calculations.

MarkusQ 2009-04-04 23:54:06

Answer 6

A:

When you say "most of which are very close to 1", how many, exactly?

Maybe you could have an implicit offset of 1 in all your numbers and just work with the fractions.

Jimmy J 2009-04-05 00:02:36

Answer 7

+1 A:

Its worth noting that you are testing the limits of your hardware rather than Java. Java uses the 64-bit floating point in your CPU.

I suggest you test the performance of BigDecimal before you assume it won't be fast enough for you. You can still do tens of thousands of calculations per second with BigDecimal.

Peter Lawrey 2009-04-05 08:24:28

Answer 8

+1 A:

As David points out, you can just add the offsets up.

(1+x) * (1+y) = 1 + x + y + x*y

However, it seems risky to choose to drop out the last term. Don't. For example, try this:

x = 1e-8 y = 2e-6 z = 3e-7 w = 4e-5

What is (1+x)(1+y)(1+z)*(1+w)? In double precision, I get:

(1+x)(1+y)(1+z)*(1+w)

ans =

      1.00004231009302

However, see what happens if we just do the simple additive approximation.

1 + (x+y+z+w)

ans =

            1.00004231

We lost the low order bits that may have been important. This is only an issue if some of the differences from 1 in the product are at least sqrt(eps), where eps is the precision you are working in.

Try this instead:

f = @(u,v) u + v + u*v;

result = f(x,y);

result = f(result,z);

result = f(result,w);

1+result

ans =

      1.00004231009302

As you can see, this gets us back to the double precision result. In fact, it is a bit more accurate, since the internal value of result is 4.23100930230249e-05.

woodchips 2009-04-07 15:32:49

ansaurus

tags:

views:

answers:

How to handle multiplication of numbers close to 1

related questions