ansaurus

Question

Having trouble with fork(), pipe(), dup2() and exec() in C

Answer 1

A:

I think your forked processes will continue executing.

Try either:

Changing it to 'return execvp'
Add 'exit(1);' after execvp

Kieveli 2009-05-27 17:24:47

Hmm maybe not. But try it anyway =) Maybe make it 'return execvp'

Kieveli 2009-05-27 17:27:18

I don't think that will work. As far as I know execvp() replaces the current process (child process) by the executing command and so, if the execvp() call is successful, the child process will never reach any code line below that.

Nazgulled 2009-05-27 17:32:10

Just tried and as I suspected, doesn't work: 1) "The exec() family of functions replaces the current process image with a new process image." 2) "If any of the exec() functions returns, an error will have occurred." Quotes from man page.

Nazgulled 2009-05-27 17:39:39

Yeah, that's what I was thinking... but I did a search for execvp examples, and they all seem to have an 'exit(1);' afterwards.

Kieveli 2009-05-27 17:40:07

Maybe it pops it off when it's finished executing and resumes your previous process? It just executes it without spawning a new process, but resumes where it leaves off?

Kieveli 2009-05-27 17:40:53

Then they don't know how exec() works lol... I also didn't and it was here on SO that someone pointed it out to me.

Nazgulled 2009-05-27 17:41:48

`exec*()` never returns if it was successful, but will return if it failed. Thus a `exit(1)` following an `exec*()` is to handle failure (for whatever reason... file not found, no permission, etc.)

ephemient 2009-05-27 17:46:35

Answer 2

A:

One potential problem is that cmdargs may have garbage at the end of it. You're supposed to terminate that array with a null pointer before passing it to execvp().

It looks like grep is accepting STDIN, though, so that might not be causing any problems (yet).

UncleO 2009-05-27 17:43:19

I believe strsep() deals with that, let's say the command is "cmd arg1 arg2", cmdArgs will be: cmdArgs[0] = "cmd"; cmdArgs[1] = "arg1"; cmdArgs[2] = "arg2"; cmdArgs[3] = NULL;

Nazgulled 2009-05-27 17:46:56

Answer 3

+3 A:

You should have an error exit after execvp() - it will fail sometime.

exit(EXIT_FAILURE);

As @uncleo points out, the argument list must have a null pointer to indicate the end:

cmdArgs[aCount] = 0;

It is not clear to me that you let both programs run free - it appears that you require the first program in the pipeline to finish before starting the second, which is not a recipe for success if the first program blocks because the pipe is full.

Jonathan Leffler 2009-05-27 17:48:02

I didn't add any type of error handling on purpose, no need to make the code even bigger for testing... Like I said above, doesn't strsep() handle that? The last argument will always be NULL cause strsep() does that himself.

Nazgulled 2009-05-27 17:57:04

He's right about the running free. You fork off a process and wait for it to finish before forking off the next one. Perhaps grep is stalling with a filled pipe.

UncleO 2009-05-27 18:04:32

None of those suggestions solved the problem... I just tested everything and the same thing happens.

Nazgulled 2009-05-27 18:39:09

@Nazgulled: I understand no error handling - but I'd still put an exit() after execvp(). Maybe you're safe enough - that makes sure. Also, your 'ls' listing is short enough that it shouldn't fill any pipe buffer - fortunately, you're testing in a small environment. I see that strsep() will give you a null - you're using do { } while ().

Jonathan Leffler 2009-05-27 19:02:54

Be aware that strsep() is not available everywhere - specifically, not on Solaris 10.

Jonathan Leffler 2009-05-27 19:05:16

I just updated the code in the question to reflect the changes suggested. I understand what you're saying Jonathan, the full code for my bash handles all errors, including the ones from execvp(), this is just a test example so I can understand pipes and make them work so then I can adapt the code to my bash. Also, strsep() is available for me, which is enough. This is just a university exercise, not a real a and live program.

Nazgulled 2009-05-27 19:27:44

@Nazgulled: Don't write "cheap code" just because it's an "exercise". You would much rather build a habit of writing good code than try to break a habit of writing bad code.

ephemient 2009-06-02 22:14:59

Answer 4

+1 A:

Jonathan has the right idea. You rely on the first process to fork all the others. Each one has to run to completion before the next one is forked.

Instead, fork the processes in a loop like you are doing, but wait for them outside the inner loop, (at the bottom of the big loop for the shell prompt).

loop //for prompt
    next prompt
    loop //to fork tasks, store the pids
        if pid == 0 run command
        else store the pid
    end loop
    loop // on pids
        wait
    end loop
end loop

UncleO 2009-05-27 18:11:55

Didn't you just said that "I" was right in the comment above? It didn't felt you were talking to me saying that Jonathan was right but the other way around. Maybe I misinterpreted you...

Nazgulled 2009-05-27 18:34:29

Just tested it and it didn't work... The same thing happens.

Nazgulled 2009-05-27 18:38:46

I just updated the code in the question to reflect the changes suggested.

Nazgulled 2009-05-27 19:24:31

Answer 5

+2 A:

ephemient 2009-05-27 19:53:20

I see what you mean, but I'm finding it rather difficult to understand that pseudo code and everything it's doing. That old and new fds pipe thing is confusing me. Also, not sure what you mean by the last paragraph.

Nazgulled 2009-05-27 20:19:54

Ignoring the old_fds/new_fds thing, if I change your code to run pipe() within the readline loop and close(fdPipe[*]) before waiting, it works in the immediate sense that the failing pipeline in the question no longer fails. The rest is cautionary, as *other* things that might be wrong as you go further.

ephemient 2009-05-27 20:39:17

I see that you probably didn't notice in the code but I was just considering the case with 2 commands and one pipe would suffice. Still, I didn't know I needed more than one for more than 2 commands so that will come in handy. But for now, I'll concentrate is just two, try to understand that and move to the next step. I still don't quite get everything you said besides that and that I need to close them in the parents. I think it's better I fix this for two commends and then post back my code and if I'll be doing something wrong or missing something, hopefully, you'll point it out.

Nazgulled 2009-05-27 23:54:50

One last thing, I'm a little ahead of what we are supposed to do, this exercise doesn't come til next week's Friday and cause I have other classes to study, I might not post here for a few days. If you don't mind in keeping an eye (maybe through rss), I will be much appreciated. (P.S: I'll probably mark this answer as the accepted one, I just want to make sure that I get it and can fix my code before I do)

Nazgulled 2009-05-27 23:57:23

This is the best answer to accept - I agree. And the generalization to multi-stage pipelines will be important eventually. I believe that some shells - probably bash amongst them - fork off a single process to run the entire pipeline, and that first child eventually execs the last process in the pipeline, after fixing up the plumbing between the various other stages. The child also becomes a process group leader and other things so that job control works sanely.

Jonathan Leffler 2009-05-28 05:08:55

I think I'll be asking the teacher about that, if I should spawn a new process for each command in the pipeline or just one...

Nazgulled 2009-05-28 12:33:08

By the way, how does bash fork only one single process to run the entire pipeline? I can't get my head around how is that possible...

Nazgulled 2009-05-28 19:12:07

So the suggestion is to close the ends of the pipe in the first process after the loop that does the forking, and before the loop that waits. These open ends are preventing the ctrl-D (EOF) that grep needs to finish. The danger ephemient mentions would occur if you were to have more than two commands, so more than one pipe. If you closed one pipe before creating the next one, the same fds would be re-used, causing problems.

UncleO 2009-05-28 19:35:44

Jonathan thinks that Bash perhaps first forks, then from within that child, sets up all the pipes and forks/execs all the commands in the pipeline, exec'ing the last command and setting it as the group leader. While this seems reasonable, I can confirm that Bash 3.2 and 4.0 do not do this -- they pipe and fork from the parent, and uses the first command of the pipeline as the group leader. Incidentally, the simple xsh I left in a comment to the question does this as well.

ephemient 2009-05-28 20:26:23

@uncleo, it's okay for the pipe FDs to become re-used after you've closed them, as they'll refer to different pipes. The warning is to check before mangling the pipe FDs in case they overlap with the "standard range" of FDs.

ephemient 2009-05-28 20:44:10

@ephemient: you looked at the code - I bow to your superior knowledge. I'll have to see how the children are associated with the group leader, sometime, when I have time. My main points - that your answer was more on target than mine (even though I made one valid point), and that job control requires care - remain valid.

Jonathan Leffler 2009-05-29 19:25:28

After a while, I'm still finding the algoritm rather confusing to apply to my code... I honestly look at the algorithm and can't possibly understand how is that going to work the way I want it.

Nazgulled 2009-06-02 20:43:11

Answer 6

A:

the file descriptors from the pipe are reference counted, and incremented with each fork. for every fork, you have to issue a close on both descriptors in order to reduce the reference count to zero and allow the pipe to close. I'm guessing.

2010-08-26 12:32:46

and also close in the parent process to reduce the reference count to zero, sorry.

2010-08-26 12:33:32

You do realize this question is more than 1 year old right?

Nazgulled 2010-08-26 19:53:13

ansaurus

tags:

views:

answers:

Having trouble with fork(), pipe(), dup2() and exec() in C

related questions