What are the "things to know" when diving into multi-threaded programming in C++

views:

1756

answers:

+37 Q:

What are the "things to know" when diving into multi-threaded programming in C++

I'm currently working on a wireless networking application in C++ and it's coming to a point where I'm going to want to multi-thread pieces of software under one process, rather than have them all in separate processes. Theoretically, I understand multi-threading, but I've yet to dive in practically.

What should every programmer know when writing multi-threaded code in C++?

Simple answer Locks!

WACM161 2010-01-22 15:05:27

Simple to the point of being no help.

Mike DeSimone 2010-01-22 15:19:52

+10 A:

You should read about locks, mutexes, semaphores and condition variables.

One word of advice, if your app has any form of UI make sure you always change it from the UI thread. Most UI toolkits/frameworks will crash (or behave unexpectedly) if you access them from a background thread. Usually they provide some form of dispatching method to execute some function in the UI thread.

ruibm 2010-01-22 15:08:49

Could you possibly elaborate more on the UI aspect? The application does not currently have a GUI, but it's on my list of "to-do-after-everything-else-is-done" items.

Mark 2010-01-22 15:12:32

There's not much to it. Just that UI frameworks are generally single-threaded to the extent that only one thread is even allowed to interact with the UI. Accessing any part of the UI from another thread is an error.

jalf 2010-01-22 15:15:56

Most UIs are not thread safe. That is, you can't have a thread altering a control in the GUI and another thread doing something else, even with another window. So you have to specify one thread (usually the main thread) as the only one which can access the GUI, and the other threads have to go through the GUI thread to do anything with the GUI.

Mike DeSimone 2010-01-22 15:17:40

Other things you should read about are deadlocks, priority inheritance, and race conditions.

Mike DeSimone 2010-01-22 15:19:14

I'm using an open source C++ GUI framework (http://rawmaterialsoftware.com/juce.php) that doesn't complain about calling UI methods from other threads: you just have to use some global lock on the UI message manager thread while doing so.

Gaspard Bucher 2010-01-22 20:39:00

+16 A:

I am no expert at all in this subject. Just some rule of thumbs:

1) Design for simplicity, bugs really are hard to find in concurrent code even in the simplest examples.
2) C++ offers you a very elegant paradigm to manage resources(mutex, semaphore,...): RAII. I observed that it is much easier to work with boost::thread than to work with POSIX threads.
3) Build your code as thread-safe. If you don't do so, your program could behave strangely.

AraK 2010-01-22 15:13:39

what advantages does boost::thread have over POSIX threads?

Mark 2010-01-22 15:16:18

Boost threads are a wrapper to POSIX threads on UNIX systems, and Win32 threads on Windows systems. Boost threads is a C++ library and easy to use in C++ applications, where POSIX Threads is a C library and requires a lot of work on your end to play nice with your objects.

Dr. Watson 2010-01-22 15:17:21

@Mark: the main advangates are portability, and a C++ interface - in particular, RAII classes to manage things like mutex locks.

Mike Seymour 2010-01-22 15:19:52

@Mark Writing exception-safe code with POSIX is not possible. Unless you manage to write your own classes. This is just an example, adding to what have been said above.

AraK 2010-01-22 15:30:53

I think the question is "how do I design code to be thread safe?"

Max Lybbert 2010-01-22 19:23:04

@Max Thread safe code is the job of the compiler not the programmer. do you mean "correct synchronized access to shared data?"

AraK 2010-01-22 22:11:42

Another advantage to using boost::thread is that its the basis for the threading code going into the new C++ standard... so code you write using boost::thread will behave similarly and have the same constructs as thread code in the new standard. I also agree with the advantages others have stated... I highly recommend boost::thread for multithreading.

Polaris878 2010-01-22 22:12:22

+20 A:

I would focus on design the thing as much as partitioned as possible so you have the minimal amount of shared things across threads. If you make sure you don't have statics and other resources shared among threads (other than those that you would be sharing if you designed this with processes instead of threads) you would be fine.

Therefore, while yes, you have to have in mind concepts like locks, semaphores, etc, the best way to tackle this is to try to avoid them.

Ariel 2010-01-22 15:14:52

It is impossible to have static resources shared across threads? Or just not recommended? Is that not what the "volatile" property is for? (correct me if I'm wrong, which I very well might be.)

Mark 2010-01-22 15:29:46

It is possible, for ex. by defining a static variable in the global scope. You can do that, and sometimes you have to, but in those cases, you have to restrict access to one thread at a time, for ex. through a lock, semaphore, etc. That typically introduces performance issues (because threads have to wait to enter a critical section). So, if you design the thing properly, you should be able to minimize multi-threading issues.

Ariel 2010-01-22 15:45:22

`volatile` has nothing to do with multithreading. The keyword is meant for memory-mapped hardware - to ensure that when you write to a memory location mapped to some hardware device, the write is performed immediately, rather than being cached in a register initially. But that doesn't guarantee that the write is *atomic*, which would be a requirement for safely using it in a multithreaded context. In general, `volatile` is useless for threading. You should use memory barriers instead. Or ensure that your static resources are immutable.

jalf 2010-01-22 16:05:16

`volatile` is also sometimes used for memory-mapped input variables, to ensure that the compiler doesn't optimize it away. I've done that on embedded-systems.

Paul Nathan 2010-01-22 16:22:55

@jalf: `volatile` can be interpreted as meaning that the value might change externally, which could be very useful in a multithreaded context. Of course, it would be even more useful if it were atomic.

David Thornley 2010-01-22 16:26:54

`volatile` is sometimes acceptable when you want a quick-n-dirty int/bool check in a loop of a value that some other thread writes, and you don't care about timing or ordering. No need for mutex then.

Marcus Lindblom 2010-01-22 16:37:15

@David: No, it could be useful in a multithreaded context if it was atomic. Having one guarantee without the other is just useless. Without atomicity there's just no point. @Marcus: I can't imagine any case where ordering isn't relevant in a MT context. What if the bool is set before the data it's meant to guard, due to reordering?

jalf 2010-01-22 16:52:44

Ultimately, volatile is too aggressive about an invariant we don't really need (that of immediately reading/writing through to memory). We need that *sometimes*, but not necessarily on every read or write. So a mem barrier is a more efficient solution there. But we also need to protect against reordering, and volatile doesn't do that, memory barriers do. So end result: 2-0 to mem barriers. Volatile is just pointless, offering half of what you need, and nothing you don't also get with other primitives.

jalf 2010-01-22 16:54:11

Given that the question was a beginner in muti-threaded programming, I think volatile should be completely avoided. Even if appropriate for certain situations (I have my doubts), a beginner isn't going to be able to tell when those situations occur

KeithB 2010-01-22 17:37:44

+1 A:

You should have an understanding of basic systems programing, in particular:

Synchronous vs Asynchronous I/O (blocking vs. non-blocking)
Synchronization mechanisms, such as lock and mutex constructs
Thread management on your target platform

Dr. Watson 2010-01-22 15:15:15

why does this guy get +1 and I get -4 when he is basically saying Locks in more words??

WACM161 2010-01-22 21:14:10

@WACM161: *because* he's saying "locks in more words". Because saying "locks" is not helpful, and conveys zero information to someone who's not already familiar with locks. This answer says that you should have an understanding of locks as well as the other listed threading primitives. Yours didn't even say what it is you're supposed to do about locks. From reading your post, it's not clear whether the OP is supposed to understand locks, use locks or just shout "LOCKS!!!" while coding.

jalf 2010-01-22 21:42:49

+7 A:

One thing I've found very useful is to make the application configurable with regard to the actual number of threads it uses for various tasks. For example, if you have multiple threads accessing a database, make the number of those threads be configurable via a command line parameter. This is extremely handy when debugging - you can exclude threading issues by setting the number to 1, or force them by setting it to a high number. It's also very handy when working out what the optimal number of threads is.

anon 2010-01-22 15:19:48

+6 A:

Make sure you test your code in a single-cpu system and a multi-cpu system.

Based on the comments:-

Single socket, single core
Single socket, two cores
Single socket, more than two cores
Two sockets, single core each
Two sockets, combination of single, dual and multi core cpus
Mulitple sockets, combination of single, dual and multi core cpus

The limiting factor here is going to be cost. Ideally, concentrate on the types of system your code is going to run on.

Skizz 2010-01-22 15:21:21

Ideally, different numbers of CPUs. Race conditions are hard to find, and running a variety of tasks on a variety of environments could help find them.

David Thornley 2010-01-22 16:09:52

Ideally, test on > 2 CPUs. For some reason, 2 is a 'stable' number in math and things start getting funky > 2.

Paul Nathan 2010-01-22 16:37:08

Also note that multi-CPU systems might reveal race conditions that'd never occur on a single CPU, multiple-core system. The added latency in communication between cores can throw things upside down.

jalf 2010-01-22 18:48:39

A colleague recently discovered a race condition in our code just by running some stuff which we had believed to be completely stable and reliable on our old 8-core Core2 systems on a new 8-core i7 box. The change in execution time exposed a the race.

timday 2010-01-22 21:10:11

Excellent advice, test on both single processor and SMP, then various systems at that. Virtual machines can help a lot here.

Chris O 2010-01-22 22:36:03

+8 A:

Never assume that external APIs are threadsafe. If it is not explicitly stated in their docs, do not call them concurrently from multiple threads. Instead, limit your use of them to a single thread or use a mutex to prevent concurrent calls (this is rather similar to the aforementioned GUI libraries).

Next point is language-related. Remember, C++ has (currently) no well-defined approach to threading. The compiler/optimizer does not know if code might be called concurrently. The volatile keyword is useful to prevent certain optimizations (i.e. caching of memory fields in CPU registers) in multi-threaded contexts, but it is no synchronization mechanism.

I'd recommend boost for synchronization primitives. Don't mess with platform APIs. They make your code difficult to port because they have similar functionality on all major platforms, but slightly different detail behaviour. Boost solves these problems by exposing only common functionality to the user.

Furthermore, if there's even the smallest chance that a data structure could be written to by two threads at the same time, use a synchronization primitive to protect it. Even if you think it will only happen once in a million years.

Alexander Gessler 2010-01-22 15:50:56

+3 A:

In addition to the other things mentioned, you should learn about asynchronous message queues. They can elegantly solve the problems of data sharing and event handling. This approach works well when you have concurrent state machines that need to communicate with each other.

I'm not aware of any message passing frameworks tailored to work only at the thread level. I've only seen home-brewed solutions. Please comment if you know of any existing ones.

EDIT:

One could use the lock-free queues from Intel's TBB, either as-is, or as the basis for a more general message-passing queue.

Emile Cormier 2010-01-22 16:06:24

+8 A:

I am exactly in this situation: I wrote a library with a global lock (many threads, but only one running at a time in the library) and am refactoring it to support concurrency.

I have read books on the subject but what I learned stands in a few points:

think parallel: imagine a crowd passing through the code. What happens when a method is called while already in action ?
think shared: imagine many people trying to read and alter shared resources at the same time.
design: avoid the problems that points 1 and 2 can raise.
never think you can ignore edge cases, they will bite you hard.

Since you cannot proof-test a concurrent design (because thread execution interleaving is not reproducible), you have to ensure that your design is robust by carefully analyzing the code paths and documenting how the code is supposed to be used.

Once you understand how and where you should bottleneck your code, you can read the documentation on the tools used for this job:

Mutex (exclusive access to a resource)
Scoped Locks (good pattern to lock/unlock a Mutex)
Semaphores (passing information between threads)
ReadWrite Mutex (many readers, exclusive access on write)
Signals (how to 'kill' a thread or send it an interrupt signal, how to catch these)
Parallel design patterns: boss/worker, producer/consumer, etc (see schmidt)
platform specific tools: openMP, C blocks, etc

Good luck ! Concurrency is fun, just take your time...

Gaspard Bucher 2010-01-22 16:15:50

Concurrency fun!!?? Man, you are brave :-)

Ariel 2010-01-22 17:56:34

@Ariel: it's fun IF you accept to slow down, think and become creative. Like every difficult task, it's fun if you give yourself the time it needs to do things right.

Gaspard Bucher 2010-01-22 20:34:01

I agree. I find writing MT code much more interesting than single threaded.

anon 2010-01-22 21:07:09

Part of my graduate study area relates to parallelism.

I read this book and found it a good summary of approaches at the design level.

At the basic technical level, you have 2 basic options: threads or message passing. Threaded applications are the easiest to get off the ground, since pthreads, windows threads or boost threads are ready to go. However, it brings with it the complexity of shared memory.

Message-passing usability seems mostly limited at this point to the MPI API. It sets up an environment where you can run jobs and partition your program between processors. It's more for supercomputer/cluster environments where there's no intrinsic shared memory. You can achieve similar results with sockets and so forth.

At another level, you can use language type pragmas: the popular one today is OpenMP. I've not used it, but it appears to build threads in via preprocessing or a link-time library.

The classic problem is synchronization here; all the problems in multiprogramming come from the non-deterministic nature of multiprograms, which can not be avoided.

See the Lamport timing methods for a further discussion of synchronizations and timing.

Multithreading is not something that only Ph.D.`s and gurus can do, but you will have to be pretty decent to do it without making insane bugs.

Paul Nathan 2010-01-22 16:32:08

+1 A:

Make sure to explicitly know what objects are shared and how they are shared.

As much as possible make your functions purely functional. That is they have inputs and outputs and no side effects. This makes it much simpler to reason about your code. With a simpler program it isn't such a big deal but as the complexity rises it will become essential. Side effects are what lead to thread-safety issues.

Plays devil's advocate with your code. Look at some code and think how could I break this with some well timed thread interleaving. At some point this case will happen.

First learn thread-safety. Once you get that nailed down then you move onto the hard part: Concurrent performance. This is where moving away from global locks is essential. Figuring out ways to minimize and remove locks while still maintaining the thread-safety is hard.

Matt Price 2010-01-22 16:57:09

+1 A:

Stay away from MFC and it's multithreading + messaging library.
In fact if you see MFC and threads coming toward you - run for the hills (*)

(*) Unless of course if MFC is coming FROM the hills - in which case run AWAY from the hills.

Martin Beckett 2010-01-22 17:01:33

Before giving any advice on do's and dont's about multi-thread programming in C++, I would like to ask the question Is there any particular reason you want to start writing the application in C++?

There are other programming paradigms where you utilize the multi-cores without getting into multi-threaded programming. One such paradigm is functional programming. Write each piece of your code as functions without any side effects. Then it is easy to run it in multiple thread without worrying about synchronization.

I am using Erlang for my development purpose. It has increased by productivity by at least 50%. Code running may not be as fast as the code written in C++. But I have noticed that for most of the back-end offline data processing, speed is not as important as distribution of work and utilizing the hardware as much as possible. Erlang provides a simple concurrency model where you can execute a single function in mutliple-threads without worrying about the synchronization issue. Writing multi-threaded code is easy, but debugging that is time consuming. I have done multi-threaded programming in C++, but I am currently happy with Erlang concurrency model. It is worth looking into.

Yogish Baliga 2010-01-22 17:46:16

This is a question answering site, but you're doing it wrong.

ergosys 2010-01-22 21:44:06

Sometimes it is better to ask questions before answering a question.

Yogish Baliga 2010-03-13 05:13:58

+3 A:

Since you are a beginner, start simple. First make it work correctly, then worry about optimizations. I've seen people try to optimize by increasing the concurrency of a particular section of code (often using dubious tricks), without ever looking to see if there was any contention in the first place.

Second, you want to be able to work at as high a level as you can. Don't work at the level of locks and mutexs if you can using an existing master-worker queue. Intel's TBB looks promising, being slightly higher level than pure threads.

Third, multi-threaded programming is hard. Reduce the areas of your code where you have to think about it as much as possible. If you can write a class such that objects of that class are only ever operated on in a single thread, and there is no static data, it greatly reduces the things that you have to worry about in the class.

KeithB 2010-01-22 17:47:50

I found viewing the introductory lectures on OS and systems programming here by John Kubiatowicz at Berkeley useful.

Anshul 2010-01-22 19:05:49

+3 A:

A few of the answers have touched on this, but I wanted to emphasize one point: If you can, make sure that as much of your data as possible is only accessible from one thread at a time. Message queues are a very useful construct to use for this.

I haven't had to write much heavily-threaded code in C++, but in general, the producer-consumer pattern can be very helpful in utilizing multiple threads efficiently, while avoiding the race conditions associated with concurrent access.

If you can use someone else's already-debugged code to handle thread interaction, you're in good shape. As a beginner, there is a temptation to do things in an ad-hoc fashion - to use a "volatile" variable to synchronize between two pieces of code, for example. Avoid that as much as possible. It's very difficult to write code that's bulletproof in the presence of contending threads, so find some code you can trust, and minimize your use of the low-level primitives as much as you can.

Mark Bessey 2010-01-22 20:08:53

+1 for producer-consumer. It combines data sharing and synchronization into one elegant solution. It works very well when the application follows the data-flow paradigm.

Emile Cormier 2010-01-26 18:27:05

+3 A:

My top tips for threading newbies:

If you possibly can, use a task-based parallelism library, Intel's TBB being the most obvious one. This insulates you from the grungy, tricky details and is more efficient than anything you'll cobble together yourself. The main downside is this model doesn't support all uses of multithreading; it's great for exploiting multicores for compute power, less good if you wanted threads for waiting on blocking I/O.
Know how to abort threads (or in the case of TBB, how to make tasks complete early when you decide you didn't want the results after all). Newbies seem to be drawn to thread kill functions like moths to a flame. Don't do it... Herb Sutter has a great short article on this.

timday 2010-01-22 21:00:28

Make sure you know what volatile means and it's uses(which may not be obvious at first).

Also, when designing multithreaded code, it helps to imagine that an infinite amount of processors is executing every single line of code in your application at once. (er, every single line of code that is possible according to your logic in your code.) And that everything that isn't marked volatile the compiler does a special optimization on it so that only the thread that changed it can read/set it's true value and all the other threads get garbage.

Earlz 2010-01-22 22:03:55

Way to mislead the OP. `volatile` has *nothing* to do with multithreading. It is not intended for multithreading, and it has no properties relevant to multithreading.

jalf 2010-01-23 01:55:22

+1 A:

Keep things dead simple as much as possible. It's better to have a simpler design (maintenance, less bugs) than a more complex solution that might have slightly better CPU utilization.

Avoid sharing state between threads as much as possible, this reduces the number of places that must use synchronization.

Avoid false-sharing at all costs (google this term).

Use a thread pool so you're not frequently creating/destroying threads (that's expensive and slow).

Consider using OpenMP, Intel and Microsoft (possibly others) support this extension to C++.

If you are doing number crunching, consider using Intel IPP, which internally uses optimized SIMD functions (this isn't really multi-threading, but is parallelism of a related sorts).

Have tons of fun.

Chris O 2010-01-22 22:34:30

+1. Just wanted to write about thread pools! Some years ago, I found that you can create only limited number of threads per process in win32 system. I mean not currently active threads, but threads with empty body. So for fault-tolerant pretending systems thread pools are even essential. Besides, thread pool is easier to debug…

Eugene 2010-01-23 06:38:41

+2 A:

The biggest "mindset" difference between single-threaded and multi-threaded programming in my opinion is in testing/verification. In single-threaded programming, people will often bash out some half-thought-out code, run it, and if it seems to work, they'll call it good, and often get away with it using it in a production environment.

In multithreaded programming, on the other hand, the program's behavior is non-deterministic, because the exact combination of timing of which threads are running for which periods of time (relative to each other) will be different every time the program runs. So just running a multithreaded program a few times (or even a few million times) and saying "it didn't crash for me, ship it!" is entirely inadequate.

Instead, when doing a multithreaded program, you always should be trying to prove (at least to your own satisfaction) that not only does the program work, but that there is no way it could possibly not work. This is much harder, because instead of verifying a single code-path, you are effectively trying to verify a near-infinite number of possible code-paths.

The only realistic way to do that without having your brain explode is to keep things as bone-headedly simple as you can possibly make them. If you can avoid using multithreading totally, do that. If you must do multithreading, share as little data between threads as possible, and use proper multithreading primitives (e.g. mutexes, thread-safe message queues, wait conditions) and don't try to get away with half-measures (e.g. trying to synchronize access to a shared piece of data using only boolean flags will never work reliably, so don't try it)

What you want to avoid is the multithreading hell scenario: the multithreaded program that runs happily for weeks on end on your test machine, but crashes randomly, about once a year, at the customer's site. That kind of race-condition bug can be nearly impossible to reproduce, and the only way to avoid it is to design your code extremely carefully to guarantee it can't happen.

Threads are strong juju. Use them sparingly.

Jeremy Friesner 2010-01-23 05:29:06

I'm in the same boat as you, I am just starting multi threading for the first time as part of a project and I've been looking around the net for resources. I found this blog to be very informative. Part 1 is pthreads, but I linked starting on the boost section.

Flamewires 2010-05-15 21:50:45

ansaurus

tags:

views:

answers:

What are the "things to know" when diving into multi-threaded programming in C++

related questions