ansaurus

Question

Graph algorithm - Looking to improve scalability

Answer 1

A:

Take a look at the DOT source code from Graphviz, it might give you some ideas.

Robert S. Barnes 2009-09-07 10:14:20

Answer 2

+5 A:

In order to answer this question, it is necessary to understand why do you need the list of paths. The list of paths does not give you any additional information over what you have in the DAG representation.

If you want to calculate things for each path separately, or calculate something like sum/min/max over all paths, it too could be done using the DAG itself.

If you do insist on saving separate paths, one option would be to convert your DAG into a variant of a Trie. Another option could be to use some variant of the Lempel-Ziv representation. It depends on your DAG types, and what you expect to do with the paths information.

Anna 2009-09-07 10:27:49

I specifically need it in the form of a multiset of paths because I am using it in that form for another algorithm which determines the entropic complexity.

Robert 2009-09-07 11:49:59

In this case, storing the paths in a different data structure won't help you, as you need a full length string representation.

Anna 2009-09-07 11:57:46

Edit: if you can change the parameters of the second algorithm, Lempel-Ziv (dictionary) style representation might save some space, and work faster.

Anna 2009-09-07 12:00:04

Answer 3

A:

Please allow me to put two (hopefully helpful) comments first:

I had some difficulties understanding your code, because some of the method names misled me. From looking at the names, I expected something else. May I suggest some refactoring:

makePathList -> createPathList  // you actually create a List here
append -> createPathList // yes, same name as above because it creates the same
                         // type of list, just with different parameters

(removed part of the answer that became obsolete after Robert's edit)

As Margus said, replacing the String concatenation with a StringBuilder append chain doesn't increase your performance. Compilers may optimize String concatenations to StringBuilder appends anyway (I've seen such byte code).

You could try to convert the DAG into a tree structure. Introduce an invisible root with all nodes as direct children. Now for each node add it's successors (child and/or sibling). The number of leaves now should be equal to the number of path and every graph from the root to any leaf is one path in the DAG.

Edit

A small improvement - it's micro-optimization but at least it will leave less garbage:

private List<String> append(String node, List<String> allPathsStartingAfterNode) {
    List<String> allPathsStartingAtNode = new ArrayList<String>();
    String nodeWithSeparator = node + "/";

    for (String aPathStartingAfterNode : allPathsStartingAfterNode) {
        allPathsStartingAtNode.add(nodeWithSeparator + aPathStartingAfterNode);
    }

    return allPathsStartingAtNode;
}

Andreas_D 2009-09-07 11:45:57

sorry, some superfluous code left in from when I was using a tree as input

Robert 2009-09-07 11:57:34

Answer 4

A:

A simple modification (depending on how you use the data) might be to load the paths lazily, that way if you tend to only use a few paths you'll never even generate some paths.

Of course, this depends entirely on expected use

Martin 2009-09-08 22:32:59

ansaurus

tags:

views:

answers:

Graph algorithm - Looking to improve scalability

related questions