ansaurus

Question

Answer 1

+6 A:

If you click on the method name of the RubyDoc you linked to, you will see the source (in C) of the Array#== method:

{
    // [...]
    if (RARRAY(ary1)->len != RARRAY(ary2)->len) return Qfalse;
    if (rb_inspecting_p(ary1)) return Qfalse;
    return rb_protect_inspect(recursive_equal, ary1, ary2);
}

This implementation (specifically the "recursive_equal") suggests that Array#== already implements the infinite recursion protection you're after.

Gareth 2010-09-12 20:43:46

Ah, so it does. Technically the protection we're after is to ultimately prevent it from even being allowed on our data type, but that's something for the add_child method. I guess it never occurred to us that it would just silently return and not throw some kind of error. It's all part of getting used to the language, I guess. Out of curiosity, do you know the reason for the silent return? (Admittedly, I'm not well versed in C, so I'm only vaguely following this implementation.)

David 2010-09-12 20:53:25

My C isn't great either, but it seems that the previous line `if (rb_inspecting_p(ary1)) return Qfalse;` is what actually triggers the "silent" false return. Basically it seems to return false if we encounter `ary1` at any point *inside* `ary1`.

Gareth 2010-09-12 21:22:18

@Gareth: Makes sense. I didn't even notice those implementations were linked on that site. That alone is going to help us in our efforts (as well as brush us up a little bit on reading C). Knowing that check is there implies to me that it's easy to check for, which will come in handy in the add_child method.

David 2010-09-12 21:33:31

Well, be aware that not everything in the C API is available in the Ruby API - the Array object is implemented in C rather than as a pure Ruby object. The Ruby calls are all you have to go on unless you're writing your own code as a C extension.

Gareth 2010-09-12 21:43:10

@Gareth: Thanks for the info. We wrote that test wanting it to explode with stack overflow, but we were only surprised to find that it did not.

Sean Copenhaver 2010-09-13 00:56:10

Woah, just found out that <Enter> submits the comment. Anyway, we have been finding that Ruby has some very powerful functions that has removed a lot of work we were expecting to have to do during our excises.

Sean Copenhaver 2010-09-13 00:57:36

The recursive_equal comment is correct but immaterial to the question. Why is this a surprise? I would like to know what language has a built in equality method that would do what the OP was expecting? Certainly not a well designed one.

2010-09-13 07:31:32

@beavis: As Gareth pointed out, Ruby does seem to be designed to avoid this infinite recursion. But I would hardly expect all "well-designed" languages to have that check built-in. We can certainly assume that C doesn't, since it had to be explicitly expressed in the C implementation here.

David 2010-09-13 09:46:05

I'm new to Ruby but it is my understanding that the original reference implementation was written in C but did not (and still does not?) follow a specification. Thus, is it possible that running this code on another implementation of Ruby (JRuby, for instance) could yield different results (specifically, infinite recursion)?Or am I misunderstanding the whole "different implementations of Ruby" situation?

Matt 2010-09-21 22:04:59

Answer 2

A:

Where is the recursive call?

Putting aside the question, "how is this a tree?" I guess I am confused how you think tree1==tree2 should not terminate. After you instantiate 2 "trees" and make the two add_child calls you have two object with the value of 1 and an array of size 1. The nested arrays inside that 1 element are not iterated over by ==

2010-09-12 20:59:06

Of course the arrays are iterated over. `Array#==` iterates over its contents. The only reason that this does not cause infinite recursion is that, as Gareth pointed out, the implementation of `==` has checks for that. Without that this would definitely cause infinite recursion.

sepp2k 2010-09-12 21:04:08

The recursive call is because comparing the two arrays `@nodes == node.nodes` should cause `Node#==` to be called on the members of the two @nodes arrays

Gareth 2010-09-12 21:04:22

Also: how is this not a tree?

sepp2k 2010-09-12 21:05:28

@nodes.size==1 and node.nodes.size==1. End of mystery.

2010-09-12 21:05:30

it is a node with a bunch of nodes in it. A node is a single place in the data structure. A node doesn't know what data structure it is living in and therefore can not arrange itself in a data structure.

2010-09-12 21:07:16

At best it is a really weird degenerate tree. AKA a really weird linked list.

2010-09-12 21:13:48

@beavis: A linked list and a tree are different only in that a linked list's node can refer to only one "child" (the next node in the linked list) whereas a tree's node can refer to any number of "children" (2 for a binary tree, or just an unbounded amount for a regular tree). And I'm not sure what you're getting at with your "@nodes.size==1 and node.nodes.size==1" statement. The comparison in this code isn't checking array length. The C implementation from Gareth does, but only for falsehood which doesn't apply here.

David 2010-09-12 21:18:31

My favourite part was where he suggested 2 arrays were equal as long as they were the same size

Gareth 2010-09-12 21:53:04

I never said that they were equal because they are the same size. What is in that single element? An array? No, it is a Node, and the same node. That is why they are equal. The fact that the == method protects against infinite recursion is completely irrelevant to the question. The mystery is why anyone would think that a comparison between two arrays, but with a single object in it would recurse infinitely.

2010-09-13 06:55:00

Yeah, a node in a tree can have 2+ references to other nodes. What it can't do is adjust the tree, therefore this is not a tree. A tree has nodes, but a node does not have a tree.

2010-09-13 07:00:25

"No, it is a Node, and the same node". It's not the same node. `tree1`'s `@nodes` contains `tree2`, and `tree2`'s `@nodes` contains `tree1`.

Gareth 2010-09-13 07:14:49

Yar, I just realized that D'oh! Still the point is that the question was "why is this recursion not infinite"? The answer is there is no recursion because the two arrays are size 1. Essentially, the question could have been rewritten as: why is [1]==[4] not recursive? Pretty senseless no? The fact that [Node] has an attribute nodes which has 1 Node which has ... is completely irrelevant.

2010-09-13 07:26:52

@beavis: I think you're confusing reference and value. tree1's child isn't a copy of tree2 with no further depth, it's a pointer to tree2, whose child is a pointer to tree1, etc. It's a circular reference, so it has infinite levels of children.

David 2010-09-13 09:35:14

@beavis: As for a tree and a node, you continue to not understand what those are. A node is, in fact, a tree. A single node is the smallest instance of a tree. The two are the same. Any node with zero or more child nodes is a tree of nodes.

David 2010-09-13 09:42:56

@beavis - "...is completely irrelevant" - No, it's not irrelevant. Comparing `[Node] == [Node]` checks 2 things in Ruby. First it checks that the array lengths are the same (as you mentioned). Then it checks that each Node in the first array is equal (==) to the Node in the same position in the second array. So, it calls `Node#==`

Gareth 2010-09-13 10:10:46

Basically, calling `Node#==` checks (among other things) that the two child `nodes` arrays are equal. To do that, you have to compare individual nodes from the child arrays, and to do *that* you have to call `Node#==`. So, to compare two (non-empty) nodes, you have to compare more nodes. This is pretty much the definition of recursive

Gareth 2010-09-13 10:14:00

@beavis: I'm morbidly curious... How would you organize a tree where any given node doesn't have references to its child nodes?

David 2010-09-13 11:54:30

ansaurus

tags:

views:

answers:

Why is this recursion NOT infinite?

related questions