ansaurus

Question

numpy : How to convert an array type quickly

Answer 1

+3 A:

When you use imgarray = imgarray.astype('B'), you get a copy of the array, cast to the specified type. This requires extra memory allocation, even though you immediately flip imgarray to point to the newly allocated array.

If you use imgarray.view('uint8'), then you get a view of the array. This uses the same data except that it is interpreted as uint8 instead of imgarray.dtype. (np.dot returns a uint32 array, so after the np.dot, imgarray is of type uint32.)

The problem with using view, however, is that a 32-bit integer becomes viewed as 4 8-bit integers, and we only care about the value in the last 8-bits. So we need to skip to every 4th 8-bit integer. We can do that with slicing:

imgarray.view('uint8')[:,::4]

IPython's %timeit command shows there is a significant speed up doing things this way:

In [37]: %timeit imgarray2 = imgarray.astype('B')
10000 loops, best of 3: 107 us per loop

In [39]: %timeit imgarray3 = imgarray.view('B')[:,::4]
100000 loops, best of 3: 3.64 us per loop

unutbu 2009-12-11 17:34:10

Can I save this view to a file

shodanex 2009-12-11 17:42:21

@shodanex: Yes, you could use np.save(). See http://docs.scipy.org/doc/numpy-1.3.x/reference/generated/numpy.save.html

unutbu 2009-12-11 18:08:40

@shodanex: For other format options, see also http://docs.scipy.org/doc/numpy-1.3.x/reference/routines.io.html

unutbu 2009-12-11 18:09:20

it is then implicitly architecture-dependent, since which slice to use depends on endianness.

kaizer.se 2009-12-13 10:59:21

@kaizer.se: Yes, that's true. Do you know a nice way to make the code non-architecture-dependent?

unutbu 2009-12-13 12:55:55

ansaurus

tags:

views:

answers:

numpy : How to convert an array type quickly

related questions