ansaurus

Question

Wait for a detached thread to finish in C++

Answer 1

+2 A:

Yes, I believe that what you're describing is happening (race condition on deallocate). One quick way to fix this is to create a static instance of Wait, one that won't get destroyed. This will work as long as you don't need to have more than one waiter at the same time.

You will also permanently use that memory, it will not deallocate. But it doesn't look like that's too bad.

The main issue is that it's hard to coordinate lifetimes of your thread communication constructs between threads: you will always need at least one leftover communication construct to communicate when it is safe to destroy (at least in languages without garbage collection, like C++).

EDIT: See comments for some ideas about refcounting with a global mutex.

Adam Goode 2009-11-15 02:58:56

Unfortunately, this is in a massively multithreaded app, and we really want separate Wait objects for each - otherwise it slows us down too much.

Tim 2009-11-15 03:18:45

Also, if we use a static Wait, there is the problem of trying to coordinate which thread needs to resume.

Tim 2009-11-15 03:28:13

Ok, you can do this. You can add a refcount field to the Wait object, protected by a global mutex. Start the refcount at 2, and then have the callback and the waiter both decrement the refcount when done. If the global mutex becomes your bottleneck, there are other more complicated solutions.

Adam Goode 2009-11-15 03:29:25

Also, you don't need the global mutex if you can use atomic operations for the refcount.

Adam Goode 2009-11-15 03:35:15

Excellent! The refcounting is actually provided by CORBA, but that solved the problem! Thanks!

Tim 2009-11-15 04:16:25

Answer 2

A:

To the best of my knowledge there's no portable way to directly ask a thread if its done running (i.e. no pthread_ function). What you are doing is the right way to do it, at least as far as having a condition that you signal. If you are seeing crashes that you are sure are due to the Wait object is being deallocated when the thread that creates it quits (and not some other subtle locking issue -- all too common), the issue is that you need to make sure the Wait isn't being deallocated, by managing from a thread other than the one that does the notification. Put it in global memory or dynamically allocate it and share it with that thread. Most simply don't have the thread being waited on own the memory for the Wait, have the thread doing the waiting own it.

quark 2009-11-15 03:00:25

Answer 3

A:

Are you initializing and destroying the mutex and condition var properly?

Wait::Wait()
{
    pthread_mutex_init(&m_mutex, NULL);
    pthread_cond_init(&m_cond, NULL);
    m_done = false;
}

Wait::~Wait()
{
    assert(m_done);
    pthread_mutex_destroy(&m_mutex);
    pthread_cond_destroy(&m_cond);
}

Make sure that you aren't prematurely destroying the Wait object -- if it gets destroyed in one thread while the other thread still needs it, you'll get a race condition that will likely result in a segfault. I'd recommend making it a global static variable that gets constructed on program initialization (before main()) and gets destroyed on program exit.

Adam Rosenfield 2009-11-15 03:03:26

yes, the mutex and cond are initialized/destroyed properly. I'm actually using wrapper classes on those that have been well tested. And yes, I'm certain that Wait is being prematurely destroyed -- while one thread is still in Wait::callback.

Tim 2009-11-15 03:30:51

Answer 4

A:

If your assumption is correct then third party module appears to be buggy and you need to come up with some kind of hack to make your application work.

Static Wait is not feasible. How about Wait pool (it even may grow on demand)? Is you application using thread pool to run? Although there will still be a chance that same Wait will be reused while third party module is still using it. But you can minimize such chance by properly queing vacant Waits in your pool.

Disclaimer: I am in no way an expert in thread safety, so consider this post as a suggestion from a layman.

BostonLogan 2009-11-15 04:35:59

ansaurus

tags:

views:

answers:

Wait for a detached thread to finish in C++

related questions