ansaurus

Question

Answer 1

+6 A:

This problem cannot be solved with a regular expression match. Seriously. I'm not just repeating the "don't parse HTML with regex" dogma; regular expressions are logically incapable of handling nested tags (which is why everyone says "don't parse HTML with regex")

The best idea I can give you is to use an XML parser. If you insist on solving this problem using regular expressions, you will wind up writing your own recursive-descent parser anyway, so you might as well take advantage of the work others have done on that problem already.

David Zaslavsky 2010-08-17 04:08:03

Thanks for taking the time of not just saying "you can't".

Freddy 2010-08-17 04:14:54

You're welcome... you didn't seem to be convinced by the comments, so I figured a bit of explanation might help.

David Zaslavsky 2010-08-17 04:44:56

ansaurus

tags:

views:

answers:

What is the problem with this regex?

related questions