ansaurus

Question

How do I manipulate a tree of immutable objects?

Answer 1

+2 A:

If your tree is immutable, then if you want to change it in anyway you have to produce a new tree.

This sounds bad, but its not if all your nodes are also immutable! Since you don't need to make copies of immutable objects, your new tree will mostly refer to the old tree except for the changes you made.

You'll have to design your tree in such a way that each immutable tree refers to other immutable trees. This way you won't need to reproduce the entire immutable tree either.

But if you go the immutable tree route, then you can't have back links. Otherwise you can't reuse sub trees.

Pyrolistical 2010-04-06 17:51:18

I figured out that the model is very much like the Git version control system, where changing a file causes the file, and thus the tree and all the above trees, to change.For back links, would there not be an "alias" approach or "path specifier" that can be resolved for a certain version of a tree?

Frederik 2010-04-06 18:06:09

What do you mean by back links? Because if a node links to the parent and the parent changes you'll have to regenerate all child and grandchild nodes, etc. That's a lot of work for one change.

Pyrolistical 2010-04-06 18:10:08

Well, a getParent() method would be a backlink to the Node's parent. If a Node would have a parent attribute, I would be unable to reuse the original Node. I was wondering if there was a smarter way to do this, equivalent to Unix's "symbolic links" for example.

Frederik 2010-04-06 18:37:45

I am not sure if you can do that without higher level structures over references or building your own pseudo reference model which might not perform well

Pyrolistical 2010-04-06 18:55:15

Answer 2

+3 A:

There are two concepts of interest here. First, persistent data structures. If all elements of the tree are immutable, then one can derive a new tree from the original tree by replacing some parts, but referring to the older parts, thus saving time and memory.

For example, if you were to add a third Port to the Node that has two ports already, you'd have to create a new Scene, a new Scene's Node's descendant, and the Node that you are changing. The other Node and all of the Ports do not need to be created anew -- you just refer to them in the new Scene/Nodes.

The other concept is that of a Zipper. A zipper is a way to "navigate" through a persistent data structure to optimize local changes. For instance, if you added four new Ports instead of just one, but you added each Port one at a time, you'd have to create four new Scenes, and eight new Nodes. With a zipper, you defer such creations until you are done, saving up on those intermediary objects.

The best explanation I ever read about zipper is here.

Now, use of a zipper to navigate a data structure remove the need to have back-links. You can have back-links in an immutable structure, by clever use of recursive constructors. However, such a data structure would not be persistent. Non-persistent immutable data structures have lousy modification performance, because you need to copy the whole data each time.

As for academic literature, I recommend Purely Function Data Structures, by Okasaki (dissertation PDF, fully fledged book).

Daniel 2010-04-06 18:09:35

+1 for both mentioning Zippers and Okasaki who, quite literally, wrote the book on this subject. Another interesting concept is Clojure 1.1's *transient* data structure. (Basically, a temporarily non-persistent datastructure.) In fact, Clojure in general is interesting: if Okasaki wrote the book on functional datastructures, Rich Hickey wrote the library. And, BTW: the Clojure datastructures are *specifically* written in such a way that they *can* be used as a Java library. They are completely independent from the Clojure language and the Clojure standard library.

Jörg W Mittag 2010-04-06 19:37:03

ansaurus

tags:

views:

answers:

How do I manipulate a tree of immutable objects?

related questions