ansaurus

Question

seg faults due to multithreading (using boost libraries)

Answer 1

+2 A:

Segfaults can appear to arise from other, well-debugged libraries (or even from the standard library!) if you corrupt the heap or the free store, for example by double-freeing (or double-deleting) a pointer, accessing a pointer that was already freed (or deleted), freeing (or deleting) a pointer that was not allocated, using delete where you should have used delete[] or vice-versa, etc.

This crash will often happen in a completely different place and at a totally different time from when and where the error occurred. If you have variables other than the matrices shared among multiple threads, and you have a race condition that, for example, causes you to double-delete the shared variable, this could corrupt the free store and later cause a crash inside of the boost matrix code.

You should run your code through a tool like valgrind to try to track down the heap/free store corruption.

Tyler McHenry 2009-08-29 20:13:53

Thanks for the suggestions. We are using boost's shared_ptr (smart pointer) so I'm fairly confident that it isn't an issue of deleting memory incorrectly. (There are no delete statements in the threaded code.) We only have one object shared among threads and we only access it at the beginning and end of threads to copy data out and write data back in. We are using locks to prevent simultaneous access between threads. Is there something I'm not considering here?

scandido 2009-08-29 20:34:40

I've never used valgrind before. Can you suggest a good tutorial for getting started aside from the documentation on their site?

scandido 2009-08-29 20:37:24

valgrind is pretty simple to use; it's understanding the output that's sometimes a challenge. All you really need to do to get started running a program like this is compile with debugging symbols and then run: valgrind --tool=memcheck ./yourapp. For each memory access error, it will report where it occurred, what sort of error it was, and then give you one or two abbreviated stack traces that show you where the incompatible operations occurred. Unfortunately as far as I know there is no better resource than their official docs. Take a look at the memcheck section, since that's what you need.

Tyler McHenry 2009-08-29 20:42:24

It seems you are correct. After removing all the bad stuff that valgrind brought to my attention, the seg fault seems to have disappeared. Thanks for your help!

scandido 2009-08-30 22:20:03

ansaurus

tags:

views:

answers:

seg faults due to multithreading (using boost libraries)

related questions