ansaurus

Question

Parallel Processes Results Written To Single File

Answer 1

A:

With the following command they are not really in parallel, but you can continue using your terminal, and the results are taken apart:

{ traceroute -n -z 100 www.yahoo.com; traceroute -n -z 100 www.abc.com; } >> theLog.log &

enzotib 2010-10-19 03:33:03

Thanks a lot. On my system, I had to use "()" instead of "{}".

PED 2010-10-19 03:46:31

Answer 2

+1 A:

Perhaps you could investigate parallel (and tell us about your experience)? If you are on Ubuntu, you can do sudo apt-get install moreutils to obtain parallel.

rmk 2010-10-19 03:35:52

It is funny that this didn't come up in my search. Yes, I am using Ubuntu and only came across the "xargs". After looking at that command, it seemed as a dead end.

PED 2010-10-19 03:53:17

@PED: `parallel` promises to do exactly what you've asked. Example from README: `parallel traceroute ::: foss.org.my gnu.org freenetproject.org`. See `--group` option (enabled by default) http://savannah.gnu.org/projects/parallel

J.F. Sebastian 2010-10-19 04:39:28

`xargs` is used to construct argument lists and feed them to commands.

rmk 2010-10-19 17:34:05

@J.F. Sebastian - thanks for the detail information.

PED 2010-10-21 04:05:16

Answer 3

A:

As you have it written, the behavior is undefined. You might try what enzotib posted, or try to have each write to their own file and cat them together at the end.

Tristan 2010-10-19 03:36:10

Answer 4

+1 A:

If you want it to run parallel is better to keep the intermediary results in separated files them join them at the end. The steps would be to start each trace to it's log file and store their pid, wait for them all to stop, them join the results, something like the following:

traceroute  -n -z 100 www.yahoo.com > theLog.1.log & PID1=$!
traceroute  -n -z 100 www.abc.com > theLog.2.log & PID2=$!    
wait $PDI1 $PDI2    
cat theLog.1.log theLog.2.log > theLog.log
rm theLog.2.log theLog.1.log

diogok 2010-10-19 04:10:48

Answer 5

A:

traceroutes in the @enzotib's answer are executed one at a time in sequence.

You can execute traceroutes in parallel using suggested by @rmk the parallel utility.

$ /usr/bin/time parallel traceroute -n -z 100 <hosts.txt >> parallel.log
24.78user 0.63system 1:24.04elapsed 30%CPU (0avgtext+0avgdata 37456maxresident)k
72inputs+72outputs (2major+28776minor)pagefaults 0swaps

Sequential analog is 5 times slower:

$ /usr/bin/time ./sequential.sh 
24.63user 0.51system 7:19.09elapsed 5%CPU (0avgtext+0avgdata 5296maxresident)k
112inputs+568outputs (1major+8759minor)pagefaults 0swaps

Where sequential.sh is:

#!/bin/bash
( while read host; do traceroute -n -z 100 $host; done; ) <hosts.txt >>sequential.log

And hosts.txt is:

www.yahoo.com
www.abc.com
www.google.com
stackoverflow.com
facebook.com
youtube.com
live.com
baidu.com
wikipedia.org
blogspot.com
qq.com
twitter.com
msn.com
yahoo.co.jp
taobao.com
google.co.in
sina.com.cn
amazon.com
google.de
google.com.hk

J.F. Sebastian 2010-10-21 20:50:53

ansaurus

tags:

views:

answers:

Parallel Processes Results Written To Single File

related questions