ansaurus

Question

Multi-threaded BASH programming - generalized method?

Answer 1

+1 A:

If you need real thread safety, I would recommend to migrate to a better scripting system.

With python, for example, you can create real threads with safe synchronization using semaphores/queues.

rossoft 2009-11-05 22:30:33

Answer 2

A:

I believe you're actually forking processes here, and not threading. I would recommend looking for threading support in a different scripting language like perl, python, or ruby.

dlamblin 2009-11-06 07:04:54

Indeed, it does spawn sub-shells. I thought it was clear in the text that I meant the term "thread" conceptually; I did some edits to make that clearer.

NVRAM 2009-11-06 18:31:10

Process forking is fairly cheap in Unix-based systems, and separate processes are often referred to as "threads", inaccurately. In any case, separate processes are often better than threads, since they're more self-contained.

David Thornley 2009-11-06 18:44:38

I agree that forked processes may actually be the better approach than threads.

dlamblin 2009-11-07 06:55:00

But having separate process memory precludes sharing a variable as could be done w/threads. Hence, the use of files...

NVRAM 2009-11-09 17:24:46

Answer 3

+3 A:

#adjust these as required
args_per_proc=1 #1 is fine for long running tasks
procs_in_parallel=4

xargs -n$args_per_proc -P$procs_in_parallel povray < list

Note the nproc command coming soon to coreutils will auto determine the number of available processing units which can then be passed to -P

pixelbeat 2009-11-06 12:17:41

My question really is about programming in bash, not just getting the jobs to run in parallel. But this is good info - thanks!

NVRAM 2009-11-06 18:26:32

BTW, **grep -c ^processor /proc/cpuinfo** seems to work fine on Linux if **/proc** is mounted, although I don't have a lot of machines on which to test it.

NVRAM 2009-11-09 17:27:17

Yes that will list the number of online CPUs in the system. The number available to a process may be smaller though due to a previous taskset for example.

pixelbeat 2009-11-09 22:31:27

I'd misread your "coming soon" as if it applied to **xargs**. I'll accept this even if it isn't quite what I expected, and doesn't allow shell built-in or script functions.

NVRAM 2009-11-16 21:54:46

Answer 4

A:

sorry to bump this after so long, but I pieced together a fairly good solution for this IMO
It doesnt work perfectly, but it will limit the script to a certain number of child tasks running, and then wait for all the rest at the end.

#!/bin/bash

pids=()
thread() {
  local this
  while [ ${#} -gt 6 ]; do
    this=${1}
    wait "$this"
    shift
  done
  pids=($1 $2 $3 $4 $5 $6)
}
for i in 1 2 3 4 5 6 7 8 9 10
do
  sleep 5 &
  pids=( ${pids[@]-} $(echo $!) )
  thread ${pids[@]}
done
for pid in ${pids[@]}
do
  wait "$pid"
done

it seems to work great for what I'm doing (handling parallel uploading of a bunch of files at once) and keeps it from breaking my server, while still making sure all the files get uploaded before it finishes the script

KageUrufu 2010-08-23 22:25:18

ansaurus

tags:

views:

answers:

Multi-threaded BASH programming - generalized method?

related questions