ansaurus

Question

Find the character index of a node within its parent node with Hpricot

Answer 1

+1 A:

I don't think Hpricot works like that. Here is what I get doing a "node.inspect" based on your example

node.inspect
"{elem <b> \"years\" </b>}"

So, the position in the overall text that you are asking for just isn't there.

However, there are limited number of things you'd probably like to use the index for and you may be able to do these through the standard Hpricot methods

Mike Buckbee 2009-09-06 19:13:59

Also, see this Ruby-Forum topic:http://www.ruby-forum.com/topic/167535 where this same question is asked by someone wanting to check links. Relevant points: **1)** why do this when "Character position is meaningless in an XML and HTML DOM. Whitespace can change character positions without affecting the DOM at all" and **2)** Using libxml as an alternative since "libxml stores the line number of every element. So you can extract all links, check them, and print out element.line_num for each one that fails the check"

i5m 2010-01-18 14:15:51

ansaurus

tags:

views:

answers:

Find the character index of a node within its parent node with Hpricot

related questions