ansaurus

Question

Confusion while counting elapsed time in python?

Answer 1

+3 A:

There may be many explanations, such as a different set of directories (and zipfiles) on sys.path, automatically loaded/executed code at initialization, other processes running on the platform -- your code is not at all isolated nor repeatable, therefore its results are of very little value. Use python -mtimeit to measure things much, much better.

Edit: some numbers...:

$ py26 -mtimeit 'q=2**1000000000; q+=3<<1000000000'
10 loops, best of 3: 466 msec per loop
$ py31 -mtimeit 'q=2**1000000000; q+=3<<1000000000'
10 loops, best of 3: 444 msec per loop

these accurately measure the += time (slightly better / more optimized in 3.1, repeatably). If you want to measure the shifting or raising to power times, then you can't use literals, of course (since literal expressions get computed at compile time, not at run time: part of why you want all your significant code to be in functions in modules, not in top-level code nor in your main script... so the compiler can do as much of the work as easily feasible, despite its lack of any serious optimizations;-). E.g.:

$ py31 -mtimeit -s't=2' 'q=t**1000000'
10 loops, best of 3: 19.4 msec per loop
$ py31 -mtimeit -s't=3' 'q=t<<1000000'
10000 loops, best of 3: 150 usec per loop

(just takes too long to do them with the larger RHS operand you're using and I'm getting impatient;-). Mixing the ops would of course be a sad disaster in terms of measuring, since the relatively fast ones would essentially disappear in the mix!-) Fortunately, there is no good reason for such mixing -- timeit, after all, is for micro benchmarking!-)

Alex Martelli 2010-08-08 22:26:15

still, it wont take as much as 35 secs for loading these sets (it wont take that long while running other programs). I was thinking of the possibility that python might have some done some optimizations procedures in the background that computed the value in `q` before reaching computing the `start` variable. Does python perform such optimizations by discarding the sequential order flow?

banx 2010-08-08 22:30:26

@banx, nope, the Python optimizer is deliberately designed to be extremely simple, predictable, fast, bug-free, and therefore simple-minded -- it doesn't do even many "obvious" optimizations, much less any subtle ones. Still, the __literals__ that you're using **do** indeed get computed at compilation time, of course (that's not even an optimization, it's a no-brainer part of compilation itself).

Alex Martelli 2010-08-08 22:49:48

@Alex_Martelli: Thanks for the lengthy explanation, THC4k pointed out to my mistake of using literals. Your explanation helps me with the original task to compare the 2 versions of python by learning about the `--mtimeit` option.

banx 2010-08-08 23:13:19

Answer 2

+2 A:

Heh, iteresting problem, took me a while to figure it out:

from time import time
start = time()
q = 2 ** 1000000000 # number literal
q += 3 << 1000000000 # still a literal
print(q.__sizeof__(), time() - start)

Python's compiler (!) computes q. When the script runs, the interpreter takes the time, loads the already computed value and takes the time again. Now unsurprisingly, the two times are pretty much the same.

time on the other hand measures how long the full run (compile+run) takes.

THC4k 2010-08-08 22:44:52

Indeed, if I use a variable `x=1000000000` and replace that number with x in the other 2 instructions, then the results would almost match. The calculations of literals are done prior to the execution of the code. Thank you!

banx 2010-08-08 22:50:15

ansaurus

tags:

views:

answers:

Confusion while counting elapsed time in python?

related questions