ansaurus

Question

Better ways to implement a modulo operation (algorithm question)

Answer 1

A:

For modulo itself, I'm not sure. For modulo as part of the larger modular exponential operation, did you look up Montgomery multiplication as mentioned in the wikipedia page on modular exponentiation? It's been a while since I've looked into this type of algorithm, but from what I recall, it's commonly used in fast modular exponentiation.

edit: for what it's worth, your modulo algorithm seems ok at first glance. You're basically doing division which is a repeated subtraction algorithm.

Jason S 2010-05-05 13:56:48

Answer 2

A:

I'm not sure what you're calculating there to be honest. You talk about modulo operation, but usually a modulo operation is between two numbers a and b, and its result is the remainder of dividing a by b. Where is the a and b in your pseudocode...?

Anyway, maybe this'll help: a mod b = a - floor(a / b) * b.

I don't know if this is faster or not, it depends on whether or not you can do division and multiplication faster than a lot of subtractions.

Another way to speed up the subtraction approach is to use binary search. If you want a mod b, you need to subtract b from a until a is smaller than b. So basically you need to find k such that:

a - k*b < b, k is min

One way to find this k is a linear search:

k = 0;
while ( a - k*b >= b )
    ++k;

return a - k*b;

But you can also binary search it (only ran a few tests but it worked on all of them):

k = 0;
left = 0, right = a
while ( left < right )
{
    m = (left + right) / 2;
    if ( a - m*b >= b )
       left = m + 1;
    else
       right = m;
}

return a - left*b;

I'm guessing the binary search solution will be the fastest when dealing with big numbers.

If you want to calculate a mod b and only a is a big number (you can store b on a primitive data type), you can do it even faster:

for each digit p of a do
    mod = (mod * 10 + p) % b
return mod

This works because we can write a as a_n*10^n + a_(n-1)*10^(n-1) + ... + a_1*10^0 = (((a_n * 10 + a_(n-1)) * 10 + a_(n-2)) * 10 + ...

I think the binary search is what you're looking for though.

IVlad 2010-05-05 14:13:37

OP is basically doing the division algorithm (by repeated subtraction, which is how you do division at a low-level). Binary search won't speed it up when there's a multiplication step (which takes just as long as division when you do it at a low level).

Jason S 2010-05-05 14:23:45

@Jason S - I'm not really sure what the OP is doing, but it looks to me like his `while` loop could be replaced with a binary search.

IVlad 2010-05-05 14:28:23

This is in very low level gate logic. Shifting is easy, fast, and simple. Binary searches are not.

Jason S 2010-05-05 16:06:02

Answer 3

+1 A:

phkahler 2010-05-05 14:56:11

this has an extra benefit: if each number, before you shift it left by one bit, is less than the modulus, then the number shifted left by one bit (which is twice the number) cannot be more than two times the modulus, which means you will only ever need to subtract the modulus once in these steps.

Noah Lavine 2010-05-08 20:52:38

Yes, I've clarified that with some working python code :-)

phkahler 2010-05-10 01:15:32

Answer 4

A:

That test (modulus(n-1) != 1) //a bit test?

-seems redundant combined with (modulus<result).

Designing for hardware implementation i would be conscious of the smaller/greater than tests implying more logic (subtraction) than bitwise operations and branching on zero.

If we can do bitwise tests easily, this could be quick:

m=msb_of(modulus)

while( result>0 ) 
{
  r=msb_of(result) //countdown from prev msb onto result
  shift=r-m        //countdown from r onto modulus or 
                   //unroll the small subtraction 

  takeoff=(modulus<<(shift))  //or integrate this into count of shift

  result=result-takeoff;  //necessary subtraction

  if(shift!=0 && result<0)
  { result=result+(takeoff>>1); }

  } //endwhile

if(result==0) { return result }
else          { return result+takeoff }

(code untested may contain gotchas)

result is repetively decremented by modulus shifted to match at most significant bits.

After each subtraction: result has a ~50/50 chance of loosing more than 1 msb. It also has ~50/50 chance of going negative, addition of half what was subtracted will always put it into positive again. > it should be put back in positive if shift was not=0

The working loop exits when result is underrun and 'shift' was 0.

strainer 2010-05-08 20:27:08

ansaurus

tags:

views:

answers:

Better ways to implement a modulo operation (algorithm question)

related questions