ansaurus

Question

Return common element indices between two numpy arrays

Answer 1

A:

how about:

wanted = set(a1)
indices =[idx for (idx, value) in enumerate(a2) if value in wanted]

This should be O(len(a1)+len(a2)) instead of O(len(a1)*len(a2))

NB I don't know numpy so there may be a more 'numpythonic' way to do it, but this is how I would do it in pure python.

Dave Kirby 2010-02-25 11:38:03

should that be enumerate(a2)?

Dave 2010-02-25 11:55:51

Oops, my bad. Fixed it now.

Dave Kirby 2010-02-25 20:13:56

Answer 2

A:

How about

numpy.nonzero(numpy.setmember1d(a2, a1))[0]

This should be fast. From my basic testing, it's about 7 times faster than your second code snippet for len(a2) == 100, len(a1) == 10000, and only one common element at index 45. This assumes that both a1 and a2 have no repeating elements.

Alok 2010-02-25 11:47:30

I compared your solution to Dave Kirby's above, with this one being approx 1.35X faster for len(a2) == 12347424, len(a1) == 1338, so this solution get's my vote - thanks!

Dave 2010-02-25 11:57:37

ansaurus

tags:

views:

answers:

Return common element indices between two numpy arrays

related questions