ansaurus

Question

O(1) Make, Find, Union in Disjoint Sets Data Structure

Answer 1

+1 A:

Don't you also have to update the locator for the other elements in the same set as v? Where you are doing that?

Based on your comments, even if you are using pointer for locators, i believe you intended v.locator.representation = u.locator.representation

Consider the following case.

First you do A = B union C.

Now you do E = D Union A, will it work? What happens to the representation value of elements which were originally in C now?

Moron 2010-08-14 22:07:56

All elements in a set point to a locator, and that locator has a pointer to the real representation (for example the first element). That's why we only need to update the locator's pointer.

monn 2010-08-15 11:09:34

@monn:You are doing v.locator = u.locator. Don't you also have to do that for other elements in the same set as v?

Moron 2010-08-15 14:15:00

@monn: Even if locator was a pointer and you intended `v.locator.representation = u.locator.representation` it is wrong after you a a bunch of unions.

Moron 2010-08-15 14:27:26

@monn: Are you convinced now? Check out the edit to my answer about multiple unions...

Moron 2010-08-16 17:02:19

Answer 2

+2 A:

My intuition agrees with your colleague. You say:

u.set.list.append(v.set.list); // hypothetical method, append a list in O(1)

It looks like your intent is that the union is done via the append. But, to implement Union, you would have to remove duplicates for the result to be a set. So I can see an O(1) algorithm for a fixed set size, for example...

Int32 set1;
Int32 set2;

Int32 unionSets1And2 = set1 | set2;

But that strikes me as cheating. If you're doing this for general cases of N, I don't see how you avoid some form of iterating (or hash lookup). And that would make it O(n) (or at best O(log n)).

FYI: I had a hard time following your code. In makeSet, you construct a local Locator that never escapes the function. It doesn't look like it does anything. And it's not clear what your intent is in the append. Might want to edit and elaborate on your approach.

Rob 2010-08-14 22:18:36

The context for this disjoint set data structure is Kruskal's MST (as you can see on the slide). We grow forests initially by making each vertex as a forest. Then, there is no duplicate.

monn 2010-08-15 10:57:10

Ah, I forgot to set it. I've edited the code.I do the append only for the sake of printing it later when it's needed. For the Kruskal's MST, I think we don't need to have the list.

monn 2010-08-15 11:05:33

Answer 3

+2 A:

Using Tarjan's version of the Union-Find structure (with path compression and rank-weighed union), a sequence of m Finds and (n-1) intermixed Unions would be in O(m.α(m,n)), where α(m,n) is the inverse of Ackermann function which for all practical values of m and n has value 4. So this basically means that Union-Find has worst case amortized constant operations, but not quite.

To my knowledge, it is impossible to obtain a better theoretical complexity, though improvements have led to better practical efficiency.

For special cases of disjoint-sets such as those used in language theory, it has been shown that linear (i.e., everything in O(1)) adaptations are possible---essentially by grouping nodes together---but these improvements cannot be translated to the general problem. On the other hand of the spectrum, a somewhat similar core idea has been used with great success and ingenuity to make an O(n) algorithm for minimum spanning tree (Chazelle's algorithm).

So your code cannot be correct. The error is what Moron pointed out: when you make the union of two sets, you only update the "representation" of the lead of each list, but not of all other elements---while simultaneously assuming in the find function that every element directly knows its representation.

Jérémie 2010-08-14 22:40:07

I only need to update it because all elements refer to it. And it is a pointer to the representation.

monn 2010-08-15 11:00:10

ansaurus

tags:

views:

answers:

O(1) Make, Find, Union in Disjoint Sets Data Structure

related questions