ansaurus

Question

Answer 1

+2 A:

I posted a description of topological sorting recently in a question about make -j. Serendipity! From the Wikipedia article:

The canonical application of topological sorting (topological order) is in scheduling a sequence of jobs or tasks; topological sorting algorithms were first studied in the early 1960s in the context of the PERT technique for scheduling in project management (Jarnagin 1960). The jobs are represented by vertices, and there is an edge from x to y if job x must be completed before job y can be started (for example, when washing clothes, the washing machine must finish before we put the clothes to dry). Then, a topological sort gives an order in which to perform the jobs.

Rough outline:

Build a dependency graph.
Find n modules that have no dependencies. These can be executed now in parallel.
Remove those modules from the graph.
Repeat step 2 until done.

Read those links for a more detailed description.

John Kugelman 2010-03-07 07:55:59

Answer 2

+1 A:

From your setting description you can also do it directly.

It looks like every modules known it's dependencies. Then adding a predicate function in every module stating if it can run is simple enough. A module can be run if and only if all of it's prerequisites dependencies are satisfied.

Top level modules have no dependencies so they can run from the start.

Basically that's a trivial implementation of a partial topological sorting (you don't have to explore all the dependency graph, just stay at top level).

Two pitfalls to be aware of:

If your dependencies contains cycles (A depends on B depending on C depending on A) it may loop forever (it means the problem has no solution). You should detect this case and report and error.

The modules you can run may be less than the number of thread. That should not be an error. Then you have found a solution either when you got n available modules to run or when you asked every modules if they can be run.

kriss 2010-03-07 08:23:23

You're right! I can just implement a can_run() method and loop through all remaining modules to find at most "n" modules and then start worker threads to run them. Thanks for the suggestion,

kumar 2010-03-07 11:46:28

ansaurus

tags:

views:

answers:

algorithm to find independent sets

related questions