Round-robin processing with MPI (off by one/some) | ansaurus

tags:

views:

27

answers:

1

+1 Q:

Round-robin processing with MPI (off by one/some)

I have an MPI implementation basically for IDW2 based gridding on a set of sparsely sampled points. I have divided the jobs up as follows:

All nodes read all the data, the last node does not need to but whatever.

Node0 takes each data point and sends to nodes 1...N-1 with the following code:

int nodes_in_play = NNodes-2;
for(int i=0;i < data_size;i++)
{
    int dest = (i%nodes_in_play)+1;
    //printf("Point %d of %d going to %d\n",i+1,data_size,dest);
    Error = MPI_Send(las_points[i],3,MPI_DOUBLE,dest,PIPE_MSG,MPI_COMM_WORLD);
    if(Error != MPI_SUCCESS) break;
}

Nodes 1...N-1 perform IDW based estimates



for(int i=0;i<=data_size-nodes_in_play;i+=nodes_in_play)
{
    Error = MPI_Recv(test_point,3,MPI_DOUBLE,0,MPI_ANY_TAG,MPI_COMM_WORLD,&status);
    if(status.MPI_TAG == END_MSG) break;
    ... IDW2 code
    Error = MPI_Send(&zdiff,1,MPI_DOUBLE,NNodes-1,PIPE_MSG,MPI_COMM_WORLD);
}

Node N does receive and serializes to output file

This works fine for 3 nodes but with more nodes the IDW loop is off by some due to the tricky loop boundaries and the overall run gets stuck. What would be a simple way run the receive.. process .. send tasks in the in-between nodes. I am looking for a nifty for loop line.

What I have done:

Against my better judgement I have added a while(1) loop in the intermediate nodes, with an exit condition if a message with END_TAG is received. Node0 sends an END_TAG message to all intermediate nodes once all the points have been sent off.

A:

The while loop is an ugly solution but works with an End flag. I will stick with that for now.

whatnick 2010-10-10 17:02:58

related questions

What's the best way to unit test concurrent Erlang code?

How can i connect two or more machines via tcp cable to form a network grid?

How are you taking advantage of Multicore?

Start stored procedures sequentially or in parallel

Using Parallel.For to test SQL queries and comparison with the ThreadPool

Multithreaded image processing in C++

RT parallel processing in Rails

What is a good textbook for Parallel Computing?

Easy parallelisation

How to wait untill all child processes called by fork() complete?

Free OpenMosix replacement?

What is the easiest way to parallelize my C# program across multiple PCs

Passing values with Parallel Extensions and VB.net

What's the best way of executing tasks in parallel in Ksh and Perl?

What are some practical problems that parallel computing, f#, and GPU-parallel processing might solve.

How to paralleize search for a string in a file with a help of fork? (GNU Linux/g++)

Unit Testing, Deadlocks, and Race Conditions

How would you simply Monitor.TryEnter

How would you simplfy Entering and Exiting a ReaderWriterLock?

Which parallel programming APIs do you use?

How does NUnit (and MSTest) handle tests that change static/shared variables?

MPI for multicore ?

Is it possible that F# will be optimized more than other .Net languages in the future?

What parallel programming model do you recommend today to take advantage of the manycore processors of tomorrow?

What are the current best options for parallelizing a CPU-intensive .NET app?