ansaurus

Question

Building an interleaved buffer for pyopengl and numpy

Answer 1

+1 A:

The reason that create_array_1 is so much faster seems to be that the items in the (python) list all point to the same object. You can see this if you test:

print (ibuffer[0] is ibuffer[1])

inside the subroutines. In create_array_1 this is true (before you create the numpy array), while in create_array_2 this is always going to be false. I guess this means that data conversion step in the array conversion only has to happen once in create_array_1, while it happens 4096 times in create_array_2.

If this is the reason, I guess the timings will be different if you make render generate random data. Create_array_5 is slowest as it makes a new array each time you add data to the end.

Andrew Walker 2010-02-28 20:55:15

Good point! My aim was to produce predictable output so I could ensure they were all working correctly. I've added a switch to force render() to produce random data and create_array_1 is still the fastest.

Nick Sonneveld 2010-03-01 00:45:43

This has me confused. Potentially two more things to check: Does anything change if you add dtype=float32 to the numpy.empty calls? Does order='C' or order='F' matter. As far as I can see these won't change anything, but I've already been surprised once.

Andrew Walker 2010-03-01 21:27:34

Well adding dtype=float32 doesn't seem to help and neither does forcing create_array_x to share the same buffer which suggests maybe its just slow to modify values within a numpy array.

Nick Sonneveld 2010-03-02 01:47:54

(I also tried playing with order='?' as well with no effect)

Nick Sonneveld 2010-03-02 01:49:16

Answer 2

A:

I know it seems strange, but have you tried fromfile?

Dmitry Kochkin 2010-03-01 20:55:08

This is potentially dynamic data. Do you mean to save static data to disk and reload when needed? Or to create some file like interface and pass it to numpy.fromfile ?

Nick Sonneveld 2010-03-02 01:31:25

I think both choices are possible. You don't need to create file like interface - StringIO are enough for that.

Dmitry Kochkin 2010-03-02 09:28:54

Answer 3

+1 A:

The benefit of numpy is not realized by simply storing the data in an array, it is achieved by performing operations across many elements in an array instead of one by one. Your example can be boiled down and optimized to this trivial solution with orders of magnitude speedup:

numpy.random.standard_normal(4096*20)

...that's not very helpful, but it does kind of hint at where the costs are.

Here is an incremental improvement that beats the list append solution (but only slightly) by eliminating the iteration over 4096 elements.

xs = numpy.arange(4096)
render2 = numpy.vectorize(render)

def create_array_7():
    ibuffer = STATIC_BUFFER
    for i, a in enumerate(render2(xs)):
        ibuffer[i::20] = a
    return ibuffer

... but not the speedup we are looking for.

The real savings will be realized by a recasting of the render routine so that you don't have to create a python object for every value that ends up being placed in the buffer. Where does tex_left, tex_right...etc. come from? Are they calculated or read?

bpowah 2010-03-02 06:00:34

Tex coords are generated based on tile index but they could be cached as numpy array. The vertices are potentially updated every frame though. Also, I suppose the vertices and tex coords don't have to be interleaved either.

Nick Sonneveld 2010-03-02 12:52:59

ooh, looking into numpy's indexing, I learned that you can use an array as an index to another. eg:>>> tiles = numpy.array( [0,1,1,1,1,2,2,2,3,3,3] )>>> texcoords = numpy.array( [ [33,20], [44,50], [55,60], [66,70], [77,80] ] )>>> texcoords[tiles]array([[33, 20], [44, 50], [44, 50], [44, 50], [44, 50], [55, 60], [55, 60], [55, 60], [66, 70], [66, 70], [66, 70]])

Nick Sonneveld 2010-03-03 03:52:56

ansaurus

tags:

views:

answers:

Building an interleaved buffer for pyopengl and numpy

related questions