ansaurus

Question

Answer 1

A:

The only changes that come to my mind is to move some operations out of the loop:

for i in xrange(width):
    if i%(width/10)==0:
        print i,    
    if i%20==0:
        print '.',
    arri = arr[i]
    is1x = i - s1x
    is2x = i - s2x
    for j in xrange(height):
        d1 = hy(is1x,j-s1y)
        d2 = hy(is2x,j-s2y)
        arri[j] = abs(d1-d2)

The improvement, if any, will probably be minor though.

Jacek Konieczny 2010-05-15 20:14:16

Answer 2

+1 A:

List comprehensions are much faster than loops. For example, instead of

for j in xrange(height):
        d1 = hy(i-s1x,j-s1y)
        d2 = hy(i-s2x,j-s2y)
        arr[i][j] = abs(d1-d2)

You'd write

arr[i] = [abs(hy(i-s1x,j-s1y) - hy(i-s2x,j-s2y)) for j in xrange(height)]

On the other hand, if you're really trying to "optimize", then you might want to reimplement this algorithm in C, and use SWIG or the like to call it from python.

Jonathan Feinberg 2010-05-15 20:32:21

seems like a good enough improvement for lists, however it gives me:ValueError: shape mismatch: objects cannot be broadcast to a single shapewhen i try it on numpy arrays, no matter how i try it

LWolf 2010-05-15 21:31:13

See kaizer's answer; you can't treat NumPy objects like Python data structures.

Jonathan Feinberg 2010-05-15 23:22:19

Answer 3

+4 A:

Interference patterns are fun, aren't they?

So, first off this is going to be minor because running this program as-is on my laptop takes a mere twelve and a half seconds.

But let's see what can be done about doing the first bit through numpy array operations, shall we? We have basically that you want:

arr[i][j] = abs(hypot(i-s1x,j-s1y) - hypot(i-s2x,j-s2y))

For all i and j.

So, since numpy has a hypot function that works on numpy arrays, let's use that. Our first challenge is to get an array of the right size with every element equal to i and another with every element equal to j. But this isn't too hard; in fact, an answer below points my at the wonderful numpy.mgrid which I didn't know about before that does just this:

array_i,array_j = np.mgrid[0:width,0:height]

There is the slight matter of making your (width, height)-sized array into (width,height,3) to be compatible with your image-generation statements, but that's pretty easy to do:

arr = (arr * np.ones((3,1,1))).transpose(1,2,0)

Then we plug this into your program, and let things be done by array operations:

import math, array
import numpy as np
from PIL import Image

size = (800,800)
width, height = size

s1x = width * 1./8
s1y = height * 1./8
s2x = width * 7./8
s2y = height * 7./8

r,g,b = (255,255,255)

array_i,array_j = np.mgrid[0:width,0:height]

arr = np.abs(np.hypot(array_i-s1x, array_j-s1y) -
             np.hypot(array_i-s2x, array_j-s2y))

arr = (arr * np.ones((3,1,1))).transpose(1,2,0)

arr2 = np.zeros((width,height,3),dtype="uint8")
for ld in [200,116,100,84,68,52,36,20,8,4,2]:
    print 'now computing image for ld = '+str(ld)
    # Rest as before

And the new time is... 8.2 seconds. So you save maybe four whole seconds. On the other hand, that's almost exclusively in the image generation stages now, so maybe you can tighten them up by only generating the images you want.

Daniel Martin 2010-05-15 20:54:57

well, it seems that both your answer and the guy bellow's shave some nice 30 secs of the timer (yeah i got a crappy pc - but the image saving part is also 10 secs for me), and while i'm uncertain whose answer to accept, i'll go for yours since it seems that you've put some more effort into it

LWolf 2010-05-15 21:57:59

Our answers do the exact same thing. You have to value explanations though, that's why we are here on stackoverflow, so well done. And @LWolf, you should upvote the answer you accept, since you find it useful.

kaizer.se 2010-05-15 22:11:41

vote up requires 15 reputation ^.^sorry but i can't xD

LWolf 2010-05-15 22:24:26

I see :-) Welcome to stackoverflow!

kaizer.se 2010-05-18 20:17:05

Answer 4

+2 A:

If you use array operations instead of loops, it is much, much faster. For me, the image generation is now what takes so long time. Instead of your two i,j loops, I have this:

I,J = np.mgrid[0:width,0:height]
D1 = np.hypot(I - s1x, J - s1y)
D2 = np.hypot(I - s2x, J - s2y)

arr = np.abs(D1-D2)
# triplicate into 3 layers
arr = np.array((arr, arr, arr)).transpose(1,2,0)
# .. continue program

The basics that you want to remember for the future is: this is not about optimization; using array forms in numpy is just using it like it is supposed to be used. With experience, your future projects should not go the detour over python loops, the array forms should be the natural form.

What we did here was really simple. Instead of math.hypot we found numpy.hypot and used it. Like all such numpy functions, it accepts ndarrays as arguments, and does exactly what we want.

kaizer.se 2010-05-15 20:56:27

You can replace the line you call "clumsy" with `arr = (arr * np.ones((3,1,1))).transpose(1,2,0)`

Daniel Martin 2010-05-15 21:14:13

thanks. that multiplication looks costly though. I realize now that maybe `np.array((arr,arr,arr)).transpose(1,2,0)` works.

kaizer.se 2010-05-15 22:09:39

what *is* optimization? that's the thing you shouldn't do! :-) It's ridiculous things like putting your code inside a function, so that `hy= ..` when used in the loop is a local variable using fast lookup when used in the for loop. not relevant now, but it illustrates the nature of silly optimization.

kaizer.se 2010-05-15 22:32:54

better improvements are using better algorithms (like this answer) and doing less work (make your code use grayscale, drop the 3 duplicated layers).

kaizer.se 2010-05-15 22:33:38

ansaurus

tags:

views:

answers:

Python optimization problem?

related questions