ansaurus

Question

Multiplication algorithm for abritrary precision (bignum) integers.

Answer 1

+1 A:

I wrote a multiplication algorithm a while ago, and I have this comment at the top. If you have two numbers x and y of the same size (same n_digits) then you would multiply like this to get n, which would have twice the digits. Part of the complexity of the algorithm comes from working out which bits not to multiply if n_digits is not the same for both inputs.

Starting from the right, n0 is x0*y0 and you save off the overflow. Now n1 is the sum of x1*y0 and y1*x0 and the previous overflow shifted by your digit size. If you are using 32 bit digits in 64 bit math, that means n0 = low32(x0*y0) and you carry high32(x0*y0) as the overflow. You can see that if you actually used 32 bit digits you could not add the center columns up without exceeding 64 bits, so you probably use 30 or 31 bit digits.

If you have 30 bits per digit, that means you can multiple two 8 digit numbers together. First write this algorithm to accept two small buffers with n_digits up to 8 and use native math for the arithmetic. Then implement it again, taking arbitrary sized n_digits and using the first version, along with your shift and add method, to multiply 8x8 chunks of digits at a time.

/*
    X*Y = N

                          x0     y3
                            \   /  
                             \ /   
                              X    
                      x1     /|\     y2
                        \   / | \   /  
                         \ /  |  \ /   
                          X   |   X    
                  x2     /|\  |  /|\     y1
                    \   / | \ | / | \   /  
                     \ /  |  \|/  |  \ /   
                      X   |   X   |   X    
              x3     /|\  |  /|\  |  /|\     y0
                \   / | \ | / | \ | / | \   /
                 \ /  |  \|/  |  \|/  |  \ /
                  V   |   X   |   X   |   V
                  |\  |  /|\  |  /|\  |  /|
                  | \ | / | \ | / | \ | / |
                  |  \|/  |  \|/  |  \|/  |
                  |   V   |   X   |   V   |
                  |   |\  |  /|\  |  /|   |
                  |   | \ | / | \ | / |   |
                  |   |  \|/  |  \|/  |   |
                  |   |   V   |   V   |   |
                  |   |   |\  |  /|   |   |
                  |   |   | \ | / |   |   |
                  |   |   |  \|/  |   |   |
                  |   |   |   V   |   |   |
                  |   |   |   |   |   |   |
              n7  n6  n5  n4  n3  n2  n1  n0
*/

drawnonward 2010-05-02 21:51:34

I am not sure what you mean by exceeding 64bits with 32-bit digits? As you can see I have 32-bit digits, and was planning to use the 64-bit word size to manage the full range.

nn 2010-05-02 22:22:00

Btw - cool diagram, it's really handy with the multiplication.

nn 2010-05-02 22:23:38

The basecase multiply avoids adding columns by going one diagonal at a time and building the result as it goes, so you can ignore the issue. If you add the products of two 32 bit multiplies you get a 65 bit value. You either have to handle that overflow (easy in asm, hassle in C) or you have to use fewer bits in each digit.

drawnonward 2010-05-03 01:42:07

Incidentally, it is cheaper to do c+=a*b then to just do c=a*b because in the latter you must initialize c to zero.

drawnonward 2010-05-03 01:46:31

don't you mean just the inverse ? ie "in the former" you must initialize c to zero, since in the later ('=') you just override c altogether.

Matthieu M. 2010-05-03 09:27:42

In this case c is an 8k buffer. The multiplication algorithm adds interim results to arbitrary digits as it runs, so all the digits of c must be initialized. In practice c=a*b is calculated as c=0; c+=a*b; so if c already has a value the add is free.

drawnonward 2010-05-03 18:22:13

Answer 2

A:

To do A*b_j, you need to do the grade school multiplication of a bignum with a single digit. You end up having to add a bunch of two-digit products together:

bn *R = ZERO;
for(int i = 0; i < n; i++) {
  bn S = {0, 2};
  S.digits[0] = a[i] * b_j;
  S.digits[1] = (((u_w)a[i]) * b_j) >> 32;  // order depends on endianness
  bn_lshift(S, i);
  R = bn_add(R, S);
}

Of course, this is very inefficient.

Keith Randall 2010-05-02 21:58:37

ansaurus

tags:

views:

answers:

Multiplication algorithm for abritrary precision (bignum) integers.

related questions