ansaurus

Question

Answer 1

+1 A:

HTML is not XML. The two are not interchangeable. Unless the "HTML" is actually XHTML, you will not be able to use XPATH to process it.

John Saunders 2009-12-15 21:28:43

I understand that - but Safari should be (and is, into the doc object) processing the "ugly" HTML into a nice, tidy, XHTML-compliant DOM, which should be able to be used with XPath, right?

Mike 2009-12-15 21:31:52

I was unaware of this magic cleanup feature of Safari.

John Saunders 2009-12-15 22:05:30

Answer 2

+1 A:

If you are using either:

a JS library or
you have a modern browser with the querySelectorAll method available (Safari is one)

You can try to use CSS selectors to parse the DOM instead of XPATH.

Mic 2009-12-15 21:50:21

ansaurus

tags:

views:

answers:

Parsing HTML with XPath/XMLHttpRequest

related questions