ansaurus

Question

Answer 1

A:

EDIT: It sounds like you're passing the data to an external interface. If this is anything as slow as a gigabit ethernet interface, then the bottleneck will be at the wire and not how quickly you can compose data. Just iterate over the data to build your blocks in any manner that seems convenient for your code.

Perhaps what you want to do is pass the blocks around using an offset/stride notation. Essentially, each block is described from its starting address, the into the block where the first element appears, the number of bytes between elements, and the number of bytes between rows. So, something like:

       Block
         1    2    3    4
base     0    0    50   50
first    0    1    0    1
offset   2    2    2    2
stride   100  100  100  100

So you could work on the data in parallel (assuming you don't have to worry about writes) something like this

struct Block {
    int base;
    int first;
    int offset;
    int stride;
    int cols; rows;
};

/* given some reasonable block[n] and buffer */

for ( int row = 0; col < block[n].rows; ++row)
    for (int col = 0; row < block[n].cols; ++col)
    {
        int cell = buffer[
                      block[n].base + 
                      block[n].first +
                      row*block[n].stride + 
                      col*block[n].offset]
        doSomething(cell);
    }

TokenMacGuy 2009-06-10 04:39:10

thanks! this is fine for manipulating the blocks inside the app. but not good when it is passed to the hardware. the hardware, being having litle resources, needs the contiguous data for each block.

moogs 2009-06-10 04:58:29

gigabit ethernet is slow?

moogs 2009-06-10 08:43:10

No, it isn't very slow, but there are interconnects that are fast enough that the processor can have a hard time keeping them fed. They are all much faster than gigabit ethernet. PCI-Express, Myranet, some varieties of FibreChannel are all examples of this.

TokenMacGuy 2009-06-10 14:32:03

in this case, the ethernet is far from being the bottleneck. thanks.

moogs 2009-06-16 04:56:35

Answer 2

+1 A:

This is a prime example where SSE can help you. It's very good at data shuffling as well as streaming data from memory and back. On some non-x86 architectures, there are similar ISA extensions available (e.g. AltiVec)

MSalters 2009-06-10 07:42:15

got resources/links/book to start on?

moogs 2009-09-07 05:26:42

Intel's x86 manual Volume 1 Chapter 9 seems the logical one.

MSalters 2009-09-07 09:56:51

ansaurus

tags:

views:

answers:

Fast Interleaving of Data

related questions