ansaurus

Question

Answer 1

A:

You could create a subclass of one of the existing 2d sparse arrays

from scipy.sparse import dok_matrix

class sparse1d(dok_matrix):
    def __init__(self, v):
        dok_matrix.__init__(self, (v,))
    def dot(self, other):
        return dok_matrix.dot(self, other.transpose())[0,0]

a=sparse1d((1,2,3))
b=sparse1d((4,5,6))
print a.dot(b)

gnibbler 2010-03-29 18:16:22

Unfortunately, the issue with that is that you have to transpose the dang things on the fly, which doesn't make a lot of sense when you're doing millions of comparisons. I tried caching the dot products, but unfortunately, we don't do the same dot products very often, so that didn't help much.

spitzanator 2010-03-30 18:53:44

Answer 2

A:

I'm not sure that it is really much better or faster, but you could do this to avoid using the transpose:

Asp.multiply(Bsp).sum()

This just takes the element-by-element product of the two matrices and sums the products. You could make a subclass of whatever matrix format you are using that has the above statement as the dot product.

However, it is probably just easier to tranpose them:

Asp*Bsp.T

That doesn't seem like so much to do, but you could also make a subclass and modify the mul() method.

Justin Peel 2010-03-29 18:59:31

I also tried, for a vector [1, 2, 3], creating a matrix:[1, 2, 3][2, 0, 0][3, 0, 0]Taking two of these and multiplying (in any order) gives the desired dot product in the top left of the result matrix. Unfortunately, this severely negatively-impacted speed.

spitzanator 2010-03-30 18:55:12

ansaurus

tags:

views:

answers:

Scipy sparse... arrays?

related questions