ansaurus

Question

Parallelizing nested loops with dependencies

Answer 1

A:

To be honest, your notation is hard to grok at first glance (for me at least). Perhaps if you could be more explicit or possibly use C or C++ code. What is your method of parallelization (pthreads, openmp, etc)? I suspect one issue is that you can improve your load balancing. For instance, you may not want to assign work to threads in a card dealing fashion.

BobbyShaftoe 2009-10-03 00:46:11

I currently use pthreads. With regard to the notation, yes, I know. The problem is that all the sets contain objects of different types, so writing it out in a C-like manner would require a lot of extra details. The += in Iterate should really mean "add to set".

Victor Liu 2009-10-03 00:50:58

Answer 2

A:

I'm also having a hard time reading this. If you could express it in terms of sequential C++ code, stl algorithms we could likely help here.

Rick 2009-10-03 00:49:18

Answer 3

A:

If at all possible, the best way to speed up a deeply nested set of calls like this is to not have a deeply nested set of calls.

You can often re-organize your data or the links inside your data to have references that might save you a level of looping, or you can sometimes find a way to line up the loops one after the other, storing the intermediate information. Sometimes it even takes creating a different object structure.

I'm not saying this always works, but removing even one level will be a MUCH more significant reduction in time than anything else you might try.

If I could understand your psuedocode I might give it a try, but I'm guessing you've abstracted out most of the structure that would be required for an insightful design anyway.

Bill K 2009-10-03 00:58:59

Answer 4

+2 A:

You could try to express your problem in form of a map-reduce problem (http://en.wikipedia.org/wiki/MapReduce), making each level of nesting a single map-reduce job. The for loop would be translated to the mapping, and the g_i call to the reduction step.

You could try to make your pseudolanguage a bit more clear... maybe express it as python program with n=3 or n=4? Is your "for" a regular for loop? If so, I don't really understand the first pair of parentheses.

I'm not really sure if your problem is parallelizable in stated form. If you say that the loop's variable depends on previous iteration, then it looks more like a sequential problem to me.

liori 2009-10-03 01:17:06

This is exactly what I'd do. If you can arrange it so that each step can be done using message semantics, it'll make your life easier.

kyoryu 2009-10-03 01:26:45

You're right in that it would not be parallelizable if the loop's variable is strictly dependent on the previous iteration. But sometimes it's only dependent on values from 3 iterations ago, or not at all. Supposing you were able to query if such a dependency exists, is it possible to do better than just plain sequential? The thing is I don't want to special case if each for-loop is over a singleton (or a small set) or not.

Victor Liu 2009-10-03 02:11:31

Try Haskell's metacompiler. It might deduce data dependency between iterations and parallelize code without hints, regardless of what special case represents your code. Then you (probably) will not have to explicitly write parallel code for each case from scratch.

liori 2009-10-03 17:17:05

ansaurus

tags:

views:

answers:

Parallelizing nested loops with dependencies

related questions