ansaurus

Question

What's the simplest and most efficient data structure for building acyclic dependencies?

Answer 1

+7 A:

Since dependencies are always initialised before the objects that depend on them, and remain available until after such objects are destroyed, it should always be safe to destroy objects in strictly reverse order of initialisation. So all you need is a linked list to which you prepend objects as they are initialised and walk on destruction, and for each object to request initialisation of all its dependencies that have not yet been initialised before it initialises itself.

So for initialisation of each object:

initialise self, initialising uninitialised dependencies as we go
add self to front of destruction list (or push self onto stack if you're using a stack)

and for destruction, just walk the linked list from the front forwards (or pop items off stack until empty), destroying as you go. The example in your first paragraph initialised in order B, A, C would thus be destroyed in order C, A, B - which is safe; the example in your edit would be initialised in order B, A, E (not A, B, E since A depends on B), and thus destroyed in order E, A, B, which is also safe.

moonshadow 2009-08-11 16:16:04

They are not. His second paragraph details a scenario where this is not possible.

Hooked 2009-08-11 16:18:17

It says no such thing. If C and F are not initialised during initialisation of X, startup cannot proceed since X depends on them. Certainly he can leave off destroying X as late as he suggests, but he can also simply prepend it to the current safe order to obtain a new safe order.

moonshadow 2009-08-11 16:28:18

+1 A stack would be the data structure to implement this.

TheFogger 2009-08-11 16:46:43

"Since dependencies are always initialised before the objects that depend on them, and remain available until after such objects are destroyed," <-- I'm trying to *make this true*. It's not true already. See my edit.

Joseph Garvin 2009-08-11 16:57:56

In the example in your edit, B must be intialised before A, since A uses B. Therefore the stack will be BAE, not ABE or AEB; this is correct, since we want to destroy B last.

moonshadow 2009-08-11 17:10:51

Think of it this way: the graph you want to build, you must eventually walk. The stack is simply the representation of that walk, and in this case you can get that *without building the graph*.

moonshadow 2009-08-11 17:12:51

"In the example in your edit, B must be intialised before A, since A uses B" <-- No that's the issue ;) B is initialized *during the initialization of A, when A uses B*. B gets initialized "on demand".

Joseph Garvin 2009-08-11 17:48:35

OK, so if A doesn't initialise all its dependencies upfront, have A push itself on the stack at the end of its initialisation code. The effect is the same.

moonshadow 2009-08-11 18:04:24

Upon reconsideration I think you're right.

Joseph Garvin 2009-08-11 18:29:18

@moon: Could you edit your answer given our discussion? Then I'll mark as accepted.

Joseph Garvin 2009-08-11 18:45:25

I've actually been editing it as we've gone along ;) What else needs to be said that it currently doesn't?

moonshadow 2009-08-11 19:32:19

I didn't realize you already had the adding self to list as going after the initializing dependencies step ;)

Joseph Garvin 2009-08-12 15:36:47

Answer 2

+1 A:

It sounds like you should try to build a directed, acyclic graph with the pattern just as you described. An adjacency list representation (vector of linked lists, probably, seeing as you're getting new nodes on the fly) should do it.

One thing I'm not clear on: Do you need the computation at random times, or after you've gotten all of the information? I'm assuming the latter, that you can wait until you're graph is complete. If that's the case, your question is exactly a topological sort, for which there is time-linear-in-edges-and-vertices implementation. It is a relatively simple algorithm. I'm a bit turned around by your description (eating lunch makes me slow and sleepy, sorry), but you may in fact need a "reverse" topological sort, but the principles are identical. I won't try to explain how the algorithm works exactly (see: slow and sleepy), but I think the application should be clear. Unless if I'm entirely wrong, in which case, nevermind?

To summarize: In a sense, you're building up the data structure, a graph, in about as efficient time as you can hope for (it depends on how you're inserting). The graph reflects which objects need to wait on which other objects. Then when you're done building it you run the topological sort, and that reflects their dependencies.

Edit: It has been a while since I've mixed up "your" and "you're". :(

Agor 2009-08-11 16:18:26

"Do you need the computation at random times, or after you've gotten all of the information? I'm assuming the latter, that you can wait until you're graph is complete." <-- I'm trying to keep the algorithm in place, so it needs to be built 'as I go.' Otherwise I'd do exactly as you describe.

Joseph Garvin 2009-08-11 17:00:54

Answer 3

A:

Represent it like that: a graph with an edge from A to B if A's destructor must be run after B's. Inserting X now means adding two edges, and that's O(n log n)) if you keep a sorted index of nodes. To read the destruction order: pick any node, the follow the edges until you cannot anymore. That node's destructor can be safely called. Then pick one of the remaining nodes (e.g. the previous node you traversed) and try again.

From what you say, insertions happen often but the sequence is only iterated once for destruction: this datastructure should then be suitable since it has fast insertions, at the cost of slower lookups. Maybe someone else can suggest a faster way to do lookups in this datastructure.

redtuna 2009-08-11 16:19:22

Answer 4

A:

This sounds like you're building a tree from the leaves up.

Paul Nathan 2009-08-11 16:24:06

Sort of, except that I don't actually know which items are leaves or what order they'll be instantiated before hand.

Joseph Garvin 2009-08-11 17:05:49

What I'm seeing going on is that you have a Random Bag of elements in some kind of primordial preexistence. Then, as you know more, you can start setting up the data dependences into a tree. Yes? No?

Paul Nathan 2009-08-11 17:31:58

Answer 5

A:

Store it as a tree

have a node for each resource
have each resource keep a linked list of pointers to the resources that depend on that resource
have each resource keep a count of the number of resources that it depends on
keep a toplevel linked list of the resources that have no dependencies

To generate the order, go through your toplevel linked list

for each resourced processed, add it to the order
then decrement the counts of each resource that depends on it by one
if any count reaches zero, push that resource onto the toplevel list.

When the toplevel list is empty, then you've created a full order.

typedef struct _dependent Dependent;
typedef struct _resource_info ResourceInfo;

struct _dependent 
{
  Dependent * next;
  ResourceInfo * rinfo;
}
struct _resource_info
{
  Resource * resource; // whatever user-defined type you're using
  size_t num_dependencies;
  Dependent * dependents;
}

//...
Resource ** generateOrdering( size_t const numResources, Dependent * freeableResources )
{
  Resource ** const ordering = malloc(numResources * sizeof(Resource *));
  Resource ** nextInOrder = ordering;

  if (ordering == NULL) return NULL;
  while (freeableResources != NULL)
  {
    Dependent * const current = freeableResources;
    Dependent * dependents = current->rinfo->dependents;

    // pop from the top of the list
    freeableResources = freeableResources->next;

    // record this as next in order
    *nextInOrder = current->rinfo->resource;
    nextInOrder++;
    free(current->rinfo);
    free(current);

    while (dependents != NULL)
    {
       Dependent * const later = dependents;

       // pop this from the list
       dependents = later->next;

       later->rinfo->num_dependencies--;
       if (later->rinfo->num_dependencies == 0)
       {
          // make eligible for freeing
          later->next = freeableResources;
          freeableResources = later;
       }
       else
       {
           free(later);
       }
    }
  }
  return ordering;
}

To help create the tree, you might also want to have a quick lookup table to map Resources to ResourceInfos.

rampion 2009-08-11 17:04:03

Answer 6

+1 A:

It sounds to me that you have a directed acyclic graph and a topological sort will give you the order of object destruction. You will probably also need to special handle the case where the graph has cycles (circular dependencies).

Eugen Constantin Dinca 2009-08-11 18:23:48

Answer 7

A:

Are you more interested in destroying first-class C++ objects in the right order to avoid dependencies, or in modeling some external, real-world behavior where you're more interested in the algorithm and repeatability?

In the first case, you can use smart, reference-counting pointers (look for shared_ptr, available in Boost and the forthcoming C++ standard) to keep track of your objects, possibly with a factory function. When object A initializes and wants to use object B, it calls B's factory function and gets a smart pointer to B, increasing B's reference count. If C also references B, B's reference count increments again. A and C can be freed in any order, and B must be freed last. If you store shared_ptrs to all of your objects in an unordered data structure, when you're done running you'll free the list of all objects, and shared_ptr will take care of the rest, in the right order. (In this example, A and C are referenced only by the list of all objects, so their reference counts are both 1, and B is referenced by each A and C and the list of all objects, so its reference count is 3. When the list of all objects releases its reference to the objects, A and C's reference counts go to 0, so they can be freed in any order. B's reference count doesn't go to 0 until A and C are each freed, so it will continue to live until all references to it are freed.)

If you're more interested in the algorithm, you can model the reference counting in your own data structures, which may end up looking something like a directed acyclic graph when you're done.

Commodore Jaeger 2009-08-11 19:10:47

ansaurus

tags:

views:

answers:

What's the simplest and most efficient data structure for building acyclic dependencies?

related questions