Multithreading: classical Producer Consumer algorithm

views:

answers:

+1 Q:

Multithreading: classical Producer Consumer algorithm

Hi all,

Something I don't get about the classical algorithm for the Producer-Consumer problem (from Wikipedia:)

semaphore mutex = 1
semaphore fillCount = 0
semaphore emptyCount = BUFFER_SIZE

procedure producer() {
    while (true) {
        item = produceItem()
        down(emptyCount)
            down(mutex)
                putItemIntoBuffer(item)
            up(mutex)
        up(fillCount)
    }
    up(fillCount) //the consumer may not finish before the producer.
}

procedure consumer() {
    while (true) {
        down(fillCount)
            down(mutex)
                item = removeItemFromBuffer()
            up(mutex)
        up(emptyCount)
        consumeItem(item)
    }
}

I note that both producers and consumers lock 'mutex' prior to messing with the buffer, and unlock it thereafter. If that is the case, i.e. only a single thread is accessing the buffer at any given moment, I don't really see how the above algo differs from a simple solution that entails only putting a guarding mutex over the buffer:

semaphore mutex = 1

procedure producer() {
    while (true) {
        item = produceItem()
        flag = true
        while (flag) {
            down(mutex)
            if (bufferNotFull()) {
                putItemIntoBuffer(item)
                flag = false
            }
            up(mutex)
        }
    }
}


procedure consumer() {
    while (true) {
        flag = true
        while (flag) {
            down(mutex)
            if (bufferNotEmpty()) {
                item = removeItemFromBuffer()
                flag = false
            }
            up(mutex)
        }
        consumeItem(item)
    }
}

Only thing I can think of that necessitates using the 'fillCount' and 'emptyCount' semaphores is scheduling.

Maybe the first algo is for making sure that in a state where 5 consumers are waiting on an empty buffer (zero 'fillCount'), it is assured that when a new producer comes along, it will go past its "down(emptyCount)" statement quickly and get the 'mutex' quickly.

(whereas in the other solution the consumers will needlessly get the 'mutex' only to relinquish it until the new producer gets it and inserts an item).

Am I right? Am I missing something?

+2 A:

If there are no messages in the buffer, the consumer will down the mutex, check the buffer, find that it's empty, up the mutex, loop back around and immediately repeat the process. In simple terms, consumers and producers are stuck in busy loops that chew up 100% of a CPU core. This is not just a theoretical problem, either. You may well find that your computer's fan starts to spin every time you run your program.

Marcelo Cantos 2010-10-30 12:07:51

Thanks for your quick reply. So when using the correct algorithm I need to rely on the up(),down() operations not to hog CPU... I mean a premise for that algorithm is that these operations magically don't do that, isn't it?

bloodcell 2010-10-30 12:14:59

Yes, the algorithm depends on the fact that, if the semaphore is at zero, the `down()` operation will put the thread that called it to sleep until someone calls `up()` on the same semaphore.

Marcelo Cantos 2010-10-30 12:17:41

There is a starving issue if more than one consumer is involved. In the latter example consumer checks for an item in a loop. But by the time he get's to try again some other consumer may "steal" his item. And next time again, and again. That is not nice.

Dialecticus 2010-10-30 12:10:34

The item is removed from the buffer before the mutex is released, so if there's an item, the current consumer gets it. It doesn't matter which consumer gets it, in most cases, but that consumers never see an empty buffer with stuff in it or a not-empty buffer that has no items. The mutex is enough for that. The second program does do a lot of polling, though.

cHao 2010-10-30 12:15:38

+1 A:

There's more than just the locking of the buffer going on.

The fillCount semaphore in the first program is to pause the consumer(s) when there's nothing left to consume. Without it, you're constantly polling the buffer to see if there's anything to get, and that's quite wasteful.

Likewise, the emptyCount semaphore pauses the producer when the buffer is full.

cHao 2010-10-30 12:11:48

The hole concept of the producer/consumer pattern is that the shared resource (buffer) is only accessed if certain criteria are met. And to not utilize unnecessary CPU cycles in order to make sure they are met.

Consumer:

Wait if there is nothing to consume (=> empty buffer).

Producer:

Wait if there is not enough space in the buffer.

And for both its very important to note:

Wait != Spin wait

ntziolis 2010-10-30 12:46:07

ansaurus

tags:

views:

answers:

Multithreading: classical Producer Consumer algorithm

related questions