ansaurus

Question

How do I build a numpy array from a generator?

Answer 1

A:

How about creating an empty (zeros) array first and then interate through the dataset updating the array? Of course you'll have to know how big your array is going to be.

initial_len = 10
myarray = zeros((initial_len))
idx = 0
for i in gimme():
    myarray[idx] = i
    idx += 1

monkut 2008-12-15 06:18:10

Answer 2

+9 A:

Numpy arrays require their length to be set explicitly at creation time, unlike python lists. This is necessary so that space for each item can be consecutively allocated in memory. Consecutive allocation is the key feature of numpy arrays: this combined with native code implementation let operations on them execute much quicker than regular lists.

Keeping this in mind, it is technically impossible to take a generator object and turn it into an array unless you either:

(a) can predict how many elements it will yield when run:

my_array = numpy.zeros(predict_length())
for i, el in enumerate(gimme()): my_array[i] = el

(b) are willing to store its elements in an intermediate list :

my_array = numpy.array(list(gimme()))

(c) can make two identical generators, run through the first one to find the total length, initialize the array, and then run through the generator again to find each element:

length = sum(1 for el in gimme())
my_array = numpy.zeros(length)
for i, el in enumerate(gimme()): my_array[i] = el

(a) is probably what you're looking for. (b) is space inefficient, and (c) is time inefficient (you have to go through the generator twice).

shsmurfy 2008-12-15 06:31:12

Thanks, that makes alot of sense.

saffsd 2008-12-15 07:28:37

Answer 3

+7 A:

One google behind this stackoverflow result, I found that there is a numpy.fromiter(data, dtype, count). If you set count to -1, it will do what is desired. It also seems to require a dtype to be set. In my case, this worked:

numpy.fromiter(something.generate(from_this_input), float, count=-1)

dhill 2009-02-24 03:53:18

Interesting, I shall try it the next time I need it.

saffsd 2009-02-28 00:22:30

Answer 4

A:

Somewhat tangential, but if your generator is a list comprehension, you can use numpy.where to more effectively get your result (I discovered this in my own code after seeing this post)

Benjamin Horstman 2009-05-12 20:33:33

ansaurus

tags:

views:

answers:

How do I build a numpy array from a generator?

related questions