ansaurus

Question

Answer 1

+3 A:

It seems that your pattern is not correct. Consider the case when thread #1 executes till after the first #pragma flush. Then the control switches to the thread #2, which goes on and creates a c, the control is taken back just before second #pragma flush. Now the first thread wakes up, and creates the child anew.

Edit: sorry, wrong: it will be unable to take the lock.

Edit 2: no, still correct, because the value will be not flushed in thread #1

Vlad 2010-03-03 15:42:43

I added another revision. Though I must admit the day has been long and I am slowly out of brain-flops.

phresnel 2010-03-03 15:58:36

I think we still need a barrier between `create (c);` and `childCreated[c] = true;`, otherwise they may be reordered (and a thread #3 may start using `c` which is still not created).

Vlad 2010-03-03 16:05:44

upon adding a third revision which is only a double-checked pattern again, I realized exactly that :)But when I see my third revision, it somehat looks too trivial to be correct. And drinking more coffee is not an option anymore, as I am already saturated on that :/

phresnel 2010-03-03 16:16:10

Isn't your revision #3 basically the same as the code on page 12 of the PDF you referred to?

Vlad 2010-03-03 16:23:29

I think there the barrier is between entering the function and the first check, but my first barrier is after the first check.

phresnel 2010-03-03 16:29:28

The second `#pragma` must be indeed a barrier: it's not enough to just flush `childCreated[c]`, the value of `c` must be already created here from the point of view of the other threads.

Vlad 2010-03-03 16:40:58

Aside of that, I can't see currently any problems.

Vlad 2010-03-03 16:42:17

Still, I can see one: it's not a _triple_ -checked locking any more :-)

Vlad 2010-03-03 16:43:10

It makes me still nervous that this code looks too trivial and that the authors of the pdf-file (and many other places where double-checks are discussed) don't just introduce another, primitive variable like I. Admittedly, there are also solutions with additional thread-private booleans, but it seems nothing gets as easy as my third code ... Now I only need to find out how a proper barrier looks in OpenMP (and not one where **all** threads must reach that barrier) :) Thank you very much for your help! (edit: on the other hand, I'll open up another question on that)

phresnel 2010-03-03 20:23:30

About the memory barrier in OpenMP, may be this will be helpful: http://msdn.microsoft.com/en-us/library/sz9sd6et(VS.80).aspx

Vlad 2010-03-03 21:09:32

Even though I am no Microsoft-Fan, I had this exact tab open for a long time yesterday, plus the original documentation at http://openmp.org/wp/openmp-specifications/ , and as always, Wikipedia with those two interesting articles: http://en.wikipedia.org/wiki/Memory_barrier , http://en.wikipedia.org/wiki/Memory_ordering#Compiler_memory_barrier

phresnel 2010-03-04 06:26:33

Indeed, looking at the assembly, `#pragma omp flush` does the same as `__sync_synchronize();` as in that wiki article, and thus both also issue a compiler-level reordering barrier. I think this case is closed :)

phresnel 2010-03-04 09:21:11

@phresnel: I am not a MS-fan, too, but I happen to work under Windows environment. And this link was among the first Google suggested. :)

Vlad 2010-03-04 10:06:44

ansaurus

tags:

views:

answers:

Triple checked locking?

related questions