ansaurus

Question

Python: how so fast?

Answer 1

+5 A:

Hate to rain on your parade, but the reason it's so fast is because the math module is actually not implemented in Python.

Python supports loading shared libraries that export Python APIs, but are implemented in other languages. math.so, which provides the module you get from import math, happens to be one of those (and so is _random.so).

clee 2010-01-23 23:27:00

I think it begs the question: how is it done in C?

telliott99 2010-01-23 23:42:08

Find out yourself! http://svn.python.org/view/python/trunk/Modules/_math.c?view=markup :)

clee 2010-01-23 23:51:00

Yow! That's like reading it in the original Greek. But I'll take a stab, thank you.

telliott99 2010-01-23 23:59:12

@telliott99 Go on, ask us how the dictionary is so fast. :D

wisty 2010-01-24 00:46:46

@wisty I just ordered a copy of "Beautiful Code", so I'm sure I'll find out. (Chapter 18) is by Andrew Kuchling on just this topic ;)

telliott99 2010-02-03 22:35:02

Answer 2

A:

I know little to nothing about how this is actually implemented in Python, but given that this is basically primitive multiplication and logarithms, I'm not terribly surprised that it's reasonably fast even on quite large numbers.

There are arbitrary precision math libraries such as GMP that implement a wide variety of operations in a very effective manner, optimized in assembly, for just this purpose.

calmh 2010-01-23 23:28:48

Answer 3

+2 A:

Python converts it to decimal pretty darned fast.

I don't know Python, but no, it needn't to do that. 2^19937 don't need computations, it is simply a binary shift ("<<") with 19937, so it is very fast. Only if you print that in decimal the actual conversion is necessary and that is much slower.

EDIT: Exponentiation can be the same as shifting (=moving the point) if the number base is identical to the exponent base.

10^-1 = 0.1 10^0 = 1
10^1 = 10
10^2 = 100
10^3 = 1000
10^n = 1 (n zeroes)

You see that exponentiation of 10 with the exponent n simply shifts the number. Now computers mostly use the internal base 2 (bits) , so calculating 2^19937 is setting a bit in position 19937 (counting bit positions from zero).
As additional information: The conversion to decimal is normally implemented by a conquer-and-divide algorithm which successively divides the number by powers of ten. As you see, the actual conversion is slower by a factor of half a million.

The second example is more interesting: As you are computing m^n with large integers m,n the fastest way is multiplying it in succession and store the temporary results.

Example: 10^345

a = 10^2
b = a*a = 10^4
c = b*b = 10^16
d = c*c = 10^256

result = d*c*c*c*c*c*c*c*c*b*b*10

(Can be further optimized: see Knuth, Seminumerical Algorithms)

So you only need long multiplications and they can be computed pretty effectively.

EDIT: The exact implementation of multiplication depends: Besides the normal school multiplication Karatsuba, Tom-Cooke and Schoenhagen-Strasse (FFT) multiplication is used.

Thorsten S. 2010-01-23 23:48:54

@Thorsten S. I'm obviously out of my depth but I'm not so sure you're right. *Multiplication* is a simple binary shift, once you figure out how many places to shift, but how can exponentiation be the same? It's a second step of multiplication to get the number of places, right?

telliott99 2010-01-24 21:12:24

See comment above

Thorsten S. 2010-01-24 21:38:25

Answer 4

+3 A:

When compiling to byte code, constant expressions such as 2**19937 will optimized down to a single constant:

>>> def foo(): return 2**10
... 
>>> import dis
>>> dis.dis(foo)
  1           0 LOAD_CONST               3 (1024)
              3 RETURN_VALUE        
>>>

Consider instead:

[~] python -m timeit 'x=2' 'x**19937'
1000 loops, best of 3: 210 usec per loop

truppo 2010-01-23 23:49:35

Thanks. Excellent point.

telliott99 2010-01-23 23:53:20

ansaurus

tags:

views:

answers:

Python: how so fast?

related questions