ansaurus

Question

How can I handle scheduling threads with dependencies in Perl?

Answer 1

A:

Fork and create 2 processes:

In process 1:

sub_1; sub_3

In process 2:

sub_2; wait for sub_1 end; sub_4

Igor Oks 2010-08-10 11:10:52

what is this `wait for sub_1 end` syntax?is it ok to fork from an object's function?is there a more general solution for more complex dependencies and a larger number of subs to run concurrently?

David B 2010-08-10 11:42:44

It's just a pseudo-code for the general solution. You just have to find the appropriate syntax in Perl

Igor Oks 2010-08-10 11:54:11

please see edited post.

David B 2010-08-10 12:31:38

Answer 2

+1 A:

Here's a possible solution using threads and thread sharing. Most of the code is just mocking up the test and emulating threads that have to do "work" before they finish. In the example the main thread spawns seven threads that each have a random amount of time that they have to do "work". The threads cannot begin working until the other threads they are dependent on (set in the dependencies array) have finished. You can change the thread dependencies and run the example a few times to illustrate that it works correctly.

Additionally you can have each thread terminate after it runs and have the main thread terminate after all of the subthreads have finished by checking the status hash.

use strict;
use warnings;
use threads;
use threads::shared;


my %status : shared;

my $dependencies = [
                    {3 => 1},   #three can only run after one has finished...
                    {4 => 1},   #four can only run after one has finished...
                    {5 => 2},   #five can only run after two has finished...
                    {6 => 1},   #etc...
                    {6 => 2},
                    {7 => 1},
                    {7 => 2}
                   ];

main();

sub main{
    foreach my $thread_number (1..7){
        spawn_thread($thread_number);
    }

    while(1){
        print "I am the main thread\n";
        sleep(1);
    }
}

sub spawn_thread{
    my $thread_number = shift;
    $status{$thread_number} = 'wait';
    my $thr = threads->new(\&thread_routine, $thread_number); 

}

sub thread_routine{
    my $thread_number = shift;

    my $working_time_left =  int(rand(5)) + 1;    #make a random time that this thread needs to "work"

    while(1){
        print "I am thread number $thread_number with status $status{$thread_number}\n";
        {
            lock(%status);

                    #see if this thread is active;  if so, see if it finished running running
            if ($status{$thread_number} eq 'active'){
                if ($working_time_left <= 0){
                    $status{$thread_number} = 'ran';
                }
            }
            else{       
                            #see if we can activate                 
                if ($status{$thread_number} eq 'wait'){
                    my $can_activate = 1;
                    foreach my $index (0..$#$dependencies){
                        if (exists $dependencies->[$index]->{$thread_number}){
                            if ($status{$dependencies->[$index]->{$thread_number}} ne 'ran'){
                                $can_activate = 0;
                                last;
                            }
                        }
                    }
                    if ($can_activate){                 
                        $status{$thread_number} = "active";
                    }
                }

            }
        }

        sleep(1);

        if ($status{$thread_number} eq 'active'){   #do "work"
            $working_time_left--;
        }
    }
}

Narthring 2010-08-10 14:44:09

You are showing a multithreading newbie how to access global state without lock()s, and how to synchronize^Wcoordinate thread activity with polling loops? Let's let David B. first learn the rules, then master the rules, then break the rules.

pilcrow 2010-08-10 20:18:16

Good point about the locking the variable, since I had overlooked it. I have added that functionality to the example.

Narthring 2010-08-10 21:12:52

@Narthring, you might eliminate the sleep-looping by `cond_wait()` on a `%status` change to be `cond_broadcast()` .

pilcrow 2010-08-11 18:20:55

Answer 3

+3 A:

I'd suggest a "Boss/Worker" model, wherein one thread manages the subroutines to be executed in worker threads, who in turn report their status back to the boss upon completion.

In this model the boss is the only thread that needs to know how tasks are to be ordered. It might look something like this:

use threads;
use Thread::Queue;
use Thread::Pool;

our $done_queue = Thread::Queue->new;
our $work_pool  = Thread::Pool->new;

sub sub_1 {
  ... do the work ...
  $done_queue->enqueue('sub_1'); # tell the boss we're all done
}

sub sub_2 {
  ... do the work ...
  $done_queue->enqueue('sub_2'); # tell boss we're done
}

...

# Main loop (boss thread)

$work_pool->enqueue(\&sub_1);
$work_pool->enqueue(\&sub_2);

while (my $sub_name = $done_queue->dequeue) {
  # You, the boss thread, keep track of state and
  # transitions however you like.  You know what's
  # just finished and what's finished in the past
  ...
}

Of course, abstraction can make that neater -- you could hide the Pool and the Queue behind a single object, one which didn't require sub_1() to know about the status queue at all:

$boss->enqueue( 'sub_1' => \&sub_1 ); # Will return 'sub_1' via await_completed()
$boss->enqueue( 'sub_2' => \&sub_2 ); # Will return 'sub_1'

while (my $sub_name = $boss->await_completed) {
  ...
}

pilcrow 2010-08-12 04:28:04

ansaurus

tags:

views:

answers:

How can I handle scheduling threads with dependencies in Perl?

related questions