ansaurus

Question

Answer 1

+7 A:

The function can be interrupted with a signal, so you could set a timer before calling waitpid() and it will exit with an EINTR when the timer signal is raised. Edit: It should be as simple as calling alarm(5) before calling waitpid().

Steve Baker 2008-11-11 21:29:59

What determines which thread handles a signal? How will I be sure that this is the thread that handles it? Is it that alarm was called in some thread, so that thread handles the signal?

Greg Rogers 2008-11-11 21:39:16

The man page for signal seems to say that the result is unspecified, which means that it may not be handled by the right thread and lead to incorrect results.

Greg Rogers 2008-11-11 21:44:46

It is probably a good idea to have just one thread which receives signals, ensuring that all other threads mask the signal with sigprocmask or similar

MarkR 2008-11-11 22:16:37

note to anyone reading the above comment: use pthread_sigmask not sigprocmask

Greg Rogers 2008-11-11 22:41:39

Don't actually do this. You can lose children if waitpid() reaps the child but SIGALRM fires before the kernel returns. Many unixes have bugs here as well, and don't EINTR correctly even in the ideal case.

geocar 2008-12-12 02:52:24

Answer 2

+1 A:

I can use a signal handler for SIGCHLD, and in the signal handler do whatever I was going to do when a child exits, or send a message to a different thread to do some action. But using a signal handler obfuscates the flow of the code a little bit.

In order to avoid race conditions you should avoid doing anything more complex than changing a volatile flag in a signal handler.

I think the best option in your case is to send a signal to the parent. waitpid() will then set errno to EINTR and return. At this point you check for waitpid return value and errno, notice you have been sent a signal and take appropriate action.

Krunch 2008-11-11 21:35:32

Well, you can do the self-pipe trick, and have the waitpid-thread really be blocking on a select to a pipe instead. Then, when it gets SIGCHLD, have it write a byte to the pipe, which wakes itself up.

wnoise 2008-11-12 16:43:01

Answer 3

+2 A:

If you're going to use signals anyways (as per Steve's suggestion), you can just send the signal manually when you want to exit. This will cause waitpid to return EINTR and the thread can then exit. No need for a periodic alarm/restart.

Chris Dodd 2008-11-12 00:05:05

Answer 4

A:

Just off the top of my head...

[deleted]

Well, my 'off the top of my head' answer should have stayed where it was. Move along...

shank 2008-11-12 16:38:56

Answer 5

+4 A:

Don't mix alarm() with wait(). You can lose error information that way.

Use the self-pipe trick. This turns any signal into a select()able event:

int selfpipe[2];
void selfpipe_sigh(int n)
{
    write(selfpipe[1], "",1);
}
void selfpipe_setup(void)
{
    static struct sigaction act;
    if (pipe(selfpipe) == -1) { abort(); }
    fcntl(selfpipe[0],F_SETFL,fctnl(selfpipe[0],F_GETFL)|O_NONBLOCK);
    fcntl(selfpipe[1],F_SETFL,fctnl(selfpipe[1],F_GETFL)|O_NONBLOCK);
    memset(&act, 0, sizeof(act));
    act.sa_handler = selfpipe_sigh;
    act.sa_flags |= 0;
    sigaction(SIGCHLD, &act, NULL);
}

Then, your waitpid-like function looks like this:

int selfpipe_waitpid(void)
{
    static char dummy[4096];
    fd_set rfds;
    struct timeval tv;
    int died = 0, st;

    tv.tv_sec = 5;
    tv.tv_usec = 0;
    FD_ZERO(&rfds);
    FD_SET(selfpipe[0], &rfds);
    if (select(selfpipe[0]+1, &rfds, NULL, NULL, &tv) > 0) {
       while (read(selfpipe[0],dummy,sizeof(dummy)) > 0);
       while (waitpid(-1, &st, WNOHANG) != -1) died++;
    }
    return died;
}

You can see in selfpipe_waitpid() how you can control the timeout and even mix with other select()-based IO.

geocar 2008-11-14 13:08:56

seems like an interesting concept. question, why make the pipe non-blocking? and why do you need to loops after the select? shouldn't there *always* be data when the select succeeds?

Evan Teran 2008-12-11 03:53:18

If two children die, you won't necessarily get two SIGCHLD notifications. You make the pipe non-blocking in case too many SIGCHLDs come in (roughly PIPE_BUF).

geocar 2008-12-12 02:46:58

The loops also help to protect against too many SIGCHLDs, and while ideally there would always be data after select completes, read() will block until sizeof(dummy) bytes are filled unless it is marked non-blocking for read.

geocar 2008-12-12 02:49:02

ansaurus

tags:

views:

answers:

Waitpid equivalent with timeout?

related questions