ansaurus

Question

How to create an `omp parallel for` with synchronization (`barrier`) of all threads in the middle with OpenMP

Answer 1

+2 A:

One problem I see with this code, is that the code does not comply with the spec :)

If you need all do_step_one()'s to end, you'll need something like the following:

#pragma omp parallel for
for(unsigned int i=0; i<N; i++){
     do_step_one(i);
}

#pragma omp parallel for
for(unsigned int i=0; i<N; i++){
     do_step_two(i);
}

The result of this would be a parallelism of the first for, and then a parallelism of the second for.

Anna 2009-11-10 18:42:28

I have forgotten most of my OMP work - does this method still maintain the threads or does it need to recreate them for the second `parallel for`?

Gavin Miller 2009-11-10 18:47:13

I'm not sure about that. It is a matter of inner implementation. Formally, it can create the threads again for the second loop, but I think that they might have optimizations avoiding that.

Anna 2009-11-10 19:00:50

Oh, I think that I understand now where you're getting at - the first for will have to end for the second one to start, since the parallelism is made per block.To be extra certain (it will probably have no effect), it is possible to place a barrier between the two loops.

Anna 2009-11-10 19:03:12

The end of a parallel for has an implicit block on it. That is, the master thread waits for all threads to complete. I'm just thinking that this version would incur the overhead of creating the threads a second time _if_ OMP doesn't maintain the threads. Whether that makes a difference depends upon the implementation of `do_step_one()` and `do_step_two()`

Gavin Miller 2009-11-10 19:31:38

You're right, but my guess is that they have a smart implementation, that doesn't just kill and re-create threads, but re-use them in the future. Anyway, its easy to check it when running the program (for example by using top on linux).

Anna 2009-11-10 19:43:39

This solution is right, thanks. By the way I need the possibilities to disable the implicit barrier at the end of the first parallel for (it's for learning purpose, I want to show to my student why they need synchronisation between threads). Thank.

Guillaume Bouchard 2009-11-11 14:55:05

You can go with a different approach - make a loop to go 2N times, doing step one for the first n, and step 2 for n+1 ... 2n.Parallelise this for like in the example above.

Anna 2009-11-11 19:50:35

ansaurus

tags:

views:

answers:

How to create an `omp parallel for` with synchronization (`barrier`) of all threads in the middle with OpenMP

related questions