I have written the followin atomic template with a view to mimicing the atomic operations which will be available in the upcoming c++0x standard.
However, I am not sure that the __sync_synchronize() call I have around the returning of the underlying value are necessary.
From my understanding, __sync_synchronize() is a full memory barrier and I'm not sure I need such a costly call when returning the object value.
I'm pretty sure it'll be needed around the setting of the value but I could also implement this with the assembly ..
__asm__ __volatile__ ( "rep;nop": : :"memory" );
Does anyone know wether I definitely need the synchronize() on return of the object.
M.
template < typename T >
struct atomic
{
private:
volatile T obj;
public:
atomic( const T & t ) :
obj( t )
{
}
inline operator T()
{
__sync_synchronize(); // Not sure this is overkill
return obj;
}
inline atomic< T > & operator=( T val )
{
__sync_synchronize(); // Not sure if this is overkill
obj = val;
return *this;
}
inline T operator++()
{
return __sync_add_and_fetch( &obj, (T)1 );
}
inline T operator++( int )
{
return __sync_fetch_and_add( &obj, (T)1 );
}
inline T operator+=( T val )
{
return __sync_add_and_fetch( &obj, val );
}
inline T operator--()
{
return __sync_sub_and_fetch( &obj, (T)1 );
}
inline T operator--( int )
{
return __sync_fetch_and_sub( &obj, (T)1 );
}
inline T operator-=( T )
{
return __sync_sub_and_fetch( &obj, val );
}
// Perform an atomic CAS operation
// returning the value before the operation
inline T exchange( T oldVal, T newVal )
{
return __sync_val_compare_and_swap( &obj, oldval, newval );
}
};
Update: I want to make sure that the operations are consistent in the face of read/write re-ordering due to compiler optimisations.