ansaurus

Question

How, using regex, can I capture the outer HTML element, when the same element type is nested within it?

Answer 1

+3 A:

Obviously, the "right" answer is to use a DOM parser instead of regex, but you say your markup is too broken for a parser.

Before resorting to a regex, though, check out whether simpleHTMLDOM can make sense out of it. it is a bit more lenient towards broken markup than the PHP DOM based parsers.

Pekka 2010-08-11 09:40:35

thanks for the library. i'm looking forward to trying it out!

pferdefleisch 2010-08-11 09:46:46

ansaurus

tags:

views:

answers:

How, using regex, can I capture the outer HTML element, when the same element type is nested within it?

related questions