ansaurus

Question

Microbenchmark showing process-switching faster than thread-switching; what's wrong?

Answer 1

A:

Simple: pthread_mutex_lock() takes around about 2ms on your system, and your threads version takes two locks each time through the loop, whereas the processes version takes only one lock.

Andrew McGregor 2010-05-17 00:11:49

Actually, it's the process-switching benchmark which is touching more synchronization structures; the thread version is only interacting with one lock/CV. (I've confusingly left the name unchanged -- the main loop is in a threads() function in both programs.)

Yang 2010-05-17 06:15:45

Answer 2

A:

Historically Unix (and Linux as it derivative) had relatively cheap fork() so process creating wasn't an issue and concurrent processesing was (and still mostly is) done using multiple processes.

Later other OSes appeared (don't want to call names) that were very heavy on process creation so people working on those had to invent threads, which very "light" processes, thus introduced the whole new bunch of concurrency problems.

UNIX/Linux world followed the suit by introducing threads as well, although there wasn't really a need. However support for threads in Linux is somewhat limited - threads for the one process must share one core, so in many cases multi-process environment on Linux is faster than multi-threaded, which is likely the reason for the result you've got.

qrdl 2010-05-17 06:48:27

Thanks for the explanation, but I'm actually already aware of the history here, of the high-level overhead differences, and of the pains of shared-memory concurrent programming. My question still stands, though. It also is very much not true that threads of a process are pinned to the same core.

Yang 2010-05-18 16:58:21

Ok I was incorrect about limited support for threads in CFS, but if I remember correctly, if you don't specify CPU affinity for the thread explicitly, it will use the same CPU/core. The idea probably was to minimise cache misses, but as a result threads of one process tend to be bound to the same core

qrdl 2010-05-18 18:23:33

ansaurus

tags:

views:

answers:

Microbenchmark showing process-switching faster than thread-switching; what's wrong?

related questions