ansaurus

Question

Calculating conditional probabilities from joint pmfs in numpy, too slow. Ideas? (python-numpy)

Answer 1

A:

Where I say "stuff in bold" I mean:

d = array(d[val])

mhourdakis 2010-02-04 13:28:38

no need to "answer" to clarify. Just edit your question.

Andrew Jaffe 2010-02-04 13:32:28

Answer 2

A:

Ok, found the answer myself after playing a little with numpy's in-place array manipulations.

Changed the last 3 lines in the loop to:

    d = conditionalize(d, dim, val)

where conditionalize is defined as:

    def conditionalize(arr, dim, val):
        arr = arr.swapaxes(dim, 0)
        shape = arr.shape[1:]       # shape of the sub-array when we omit the desired dimension.
        count = array(shape).prod() # count of elements omitted the desired dimension.
        arr = arr.reshape(array(arr.shape).prod()) # flatten the array in-place.
        arr = arr[val*count:(val+1)*count] # take the needed elements
        arr = arr.reshape((1,)+shape) # the desired sub-array shape.
        arr = arr. swapaxes(0, dim)   # fix dimensions

        return arr

That made my program's execution time reduce from 15 minutes to 6 seconds. Huge gain.

I hope this helps someone who comes across the same problem.

mhourdakis 2010-02-07 20:23:28

ansaurus

tags:

views:

answers:

Calculating conditional probabilities from joint pmfs in numpy, too slow. Ideas? (python-numpy)

related questions