ansaurus

Question

Fast 4x4 matrix multiplication in Java with NIO float buffers

Answer 1

+2 A:

Go through the FloatBuffer.array() if that operation is supported. Then just perform the necessary multiplications through that array, and return the resulting matrix.

Have a look at GameDev.net - Matrix Math for the exact computations.

~~If you want to optimize it further, you could try out Strassens Algorithm. You wouldn't even need to pad your matrices, since they are square and of a size that is a power of 2.~~

aioobe 2010-05-09 11:35:29

As the wikipedia article on Strassen says: "Practical implementations of Strassen's algorithm switch to standard methods of matrix multiplication for small enough submatrices, for which they are more efficient. The particular crossover point for which Strassen's algorithm is more efficient depends on the specific implementation and hardware. It has been estimated that Strassen's algorithm is faster for matrices with widths from 32 to 128 for optimized implementations."

janneb 2010-05-09 12:38:09

Good point. Thanks!

aioobe 2010-05-09 12:39:45

Answer 2

+5 A:

The real answer is of course to test different implementations and check which one is fastest.

My guess, without testing, would be that as the matrices are so small, expanding the loops by hand would result in the fastest code. E.g. something like

result[0][0] = this[0][0] * matrix2[0][0] + this[0][1] * matrix2[1][0] 
             + this[0][2] * matrix2[2][0] + this[0][3] * matrix2[3][0];
result[0][1] = // ... and so forth

or then maybe just unroll the innermost loop, and retain the two outermost ones to save some typing as well as I$.

janneb 2010-05-09 12:49:36

Note that the JIT compiler is quite good at unrolling loops where necessary, so you might find there's not much in it.

Neil Coffey 2010-05-09 17:35:32

ansaurus

tags:

views:

answers:

Fast 4x4 matrix multiplication in Java with NIO float buffers

related questions