ansaurus

Question

Regex Lookaheads

Answer 1

+2 A:

Regexp is not a good tool to deal with programming language that are parsed with context-free grammars. Try to use XML DOM to do the job.

SHiNKiROU 2010-04-30 05:24:33

any hint or example on how to do with XML DOM? I don't want to be bound to Microsoft.XMLDOM indeed.

Michael 2010-04-30 05:26:14

Answer 2

+1 A:

I don't know JavaScript, so I can't help you with the DOM. I agree 100% that it's a bad idea to try and parse XML with regex. There might be a quick, very dirty, and very brittle workaround, though:

If indentation is consistent throughout the file, and <channel> elements are always at the same level of indentation, you could use that fact as a guide for the regex. In your example /^ {2}<pubDate>([^<]*)<\/pubdate>/m (= two spaces after start-of-line) might just work.

Use this at your own risk. Here be dragons etc.

Tim Pietzcker 2010-04-30 13:34:26

Answer 3

+1 A:

Check out jQuery and see if this helps reading/parsing the XML: http://think2loud.com/reading-xml-with-jquery/

KM

KM 2010-04-30 16:08:11

ansaurus

tags:

views:

answers:

Regex Lookaheads

related questions