ansaurus

Question

Answer 1

+10 A:

The problem could be in your test:

if (x < y)

the thread could evaluate x and not get around to evaluating y until much later.

Michael Burr 2010-10-25 06:03:19

I guess that is the problem, the question is why? Actually sequentially consistent atomic operations must prevent this, mustn't they?

confucius 2010-10-25 06:15:48

Thanks a lot! The problem is unspecified sequence of expression evaluation, so simple :)

confucius 2010-10-25 06:58:46

@confucius: while your scenario might have a dependency on the order that the variables might be read, the more general issue is that reading 2 different atomic instances isn't atomic.

Michael Burr 2010-10-25 16:35:31

@Michael Burr: Of course, I think in the current situation misleading line in the code was (x < y) condition, which hides implicit load operations. (x.load() < y.load()) instead would make it easier to see.

confucius 2010-10-26 05:39:24

Answer 2

+10 A:

There is a problem with the comparison:

x < y

The order of evaluation of subexpressions (in this case, of x and y) is unspecified, so y may be evaluated before x or x may be evaluated before y.

If x is read first, you have a problem:

x = 0; y = 0;
t2 reads x (value = 0);
t1 increments x; x = 1;
t1 increments y; y = 1;
t2 reads y (value = 1);
t2 compares x < y as 0 < 1; test succeeds!

If you explicitly ensure that y is read first, you can avoid the problem:

int yval = y;
int xval = x;
if (xval < yval) { /* ... */ }

James McNellis 2010-10-25 06:20:25

Thanks! That's it. Sorry guys, cannot plus, https is blocked in the office and I can't login :(

confucius 2010-10-25 06:27:38

Answer 3

A:

First, I agree with "Michael Burr" and "James McNellis". Your test is not fair, and there's a legitime possibility to fail. However even if you rewrite the test the way "James McNellis" suggests the test may fail.

First reason for this is that you don't use volatile semantics, hence the compiler may do optimizations to your code (that are supposed to be ok in a single-threaded case).

But even with volatile your code is not guaranteed to work.

I think you don't fully understand the concept of memory reordering. Actually memory read/write reorder can occur at two levels:

Compiler may exchange the order of the generated read/write instructions.
CPU may execute memory read/write instructions in arbitrary order.

Using volatile prevents the (1). However you've done nothing to prevent (2) - memory access reordering by the hardware.

To prevent this you should put special memory fence instructions in the code (that are designated for CPU, unlike volatile which is for compiler only).

In x86/x64 there're many different memory fence instructions. Also every instruction with lock semantics by default issues full memory fence.

More information here:

http://en.wikipedia.org/wiki/Memory_barrier

valdo 2010-10-25 07:02:15

The C++0x atomics guarantee correct memory ordering.

James McNellis 2010-10-25 07:04:36

valdo - no need to use volatile here, because memory barriers by default generated by C++0x atomic operations prevent both (1) and (2).

confucius 2010-10-25 07:13:52

Answer 4

+3 A:

Every now and then, x will wrap around to 0 just before y wraps around to zero. At this point y will legitimately be greater than x.

Martin York 2010-10-25 08:16:51

Took a shot at editing it. It might as well be noted that signed overflow leads to undefined behavior, though unsigned overflow wraps as expected.

GMan 2010-10-25 08:22:28

Agree. Overflow was not supposed to happen, it was done just for test, but this could be a problem too. Thanks.

confucius 2010-10-25 09:10:37

ansaurus

tags:

views:

answers:

Memory ordering issues

related questions