ansaurus

Question

Initialize costly value in map with minimum lock contention (C++)

Answer 1

+2 A:

It appears the goal is enable fast turnaround on lookup of existing objects at the expense of creating new objects. Here is a solution which will do this for you.

You need to maintain two maps in memory, both will ultimately have the same contents. You'll also need a mutex for writing and reading: - curReadMap * - curWriteMap * - writeMutex - readMutex

Those two pointers are important, we'll be swapping those. Now to read a value you have (rough pseudo-code):

  lock( readMutex )
  value = checkValueIn( curReadMap, key )
  unlock()
  if( value ) return value

If you don't find the value then you can enter the writing section

  lock( writeMutex )
  value = checkValueIn( curWriteMap, key ) //double check now
  if( value ) return value

  value = createNewValue()
  putIn( curWriteMap, key, value )

  lock( readMutex )
  swap( curWriteMap, curReadMap )
  unlock( readMutex )

  putIn( curWriteMap, key, value ) //update old map now as well
  unlock( writeMutex )

In this scheme the typical reader bears only the cost of the one mutex lock and a lookup in the map. They only ever bear creation cost if the object is not found. The swapping of the map pointers also ensures a very minimal lock time on the readMutex.

edA-qa mort-ora-y 2010-10-15 06:32:39

I've considered something similar. Actually, you don't even need the second map. Just have a create_lock that all threads must grab when they try to create a new object. I found this unsatisfactory for two reasons: we cannot create objects in parallel (since they take a while to create it is a bummer), and there are too many lock()/unlock() calls for my taste (especially since the order of lock acquisition is, well, interleaved).

Lajos Nagy 2010-10-15 15:09:19

You are write you could avoid the second map. This would put the insertion time inside the read lock. That may not be a problem (depends on how big the map is). You shouldn't be afraid of interleaved locks: these are always acquired in the same order, there is no fear of deadlock in such a case. If you really need creation to happen in parallel, and you can't afford to let the object be created twice, then you'll have to move to a more complex system with more locks and/or condition variables.

edA-qa mort-ora-y 2010-10-16 06:16:18

Answer 2

A:

Your trouble with scheme #3 is how to keep track of the placeholder objects. If you're willing to give up a bit of parallelism, you can do it more easily - allocate a single lock which will serialize value creation with each map. Then it goes:

lookup(map, key):
  1. Grab read lock on map
  2. lookup key in map, unlock & return if present
  3. grab lock on map.create_serializer
  4. lookup key in map, unlock & return if present
  5. release read lock on map
  6. create value for key
  7. grab write lock on map
  8. insert (key,value) into map
  9. release write lock on map
 10. release lock on map.create_serializer

The create_serializer makes sure that no more than one thread is creating objects to put into this map at any one time. Multiple threads looking up the same missing key at the same time will serialize at step 3 - the first will continue to build the value and the rest will find that value already built in step 4.

This will (unnecessarily) serialize different value creations for the same map, but otherwise satisfies your criteria.

Keith Randall 2010-10-18 18:22:28

ansaurus

tags:

views:

answers:

Initialize costly value in map with minimum lock contention (C++)

related questions