ansaurus

Question

Answer 1

A:

What about Regular expressions ?

khmarbaise 2010-05-05 14:10:14

Answer 2

A:

As khmarbaise said, first make sure, if regular expressions can do it. But there are cases, in which they can't [*], and then I think, ANTLR might really be a legitimate choice.

[*] For the mathematical background on this, see http://en.wikipedia.org/wiki/Formal_grammar#The_Chomsky_hierarchy

Update

Now that you updated your question, I see what you really want to do: For modifying a complete HTML file, I'd use a parser like NekoHTML, or something similar: http://www.benmccann.com/dev-blog/java-html-parsing-library-comparison/

Then you can use these to extract the URL. Then

parse only the URL itself - e. g. with Regexes, Java's URL class (or sometimes better: URI), or maybe ANTLR
modify the parsed URL
and write out the HTML again, using NekoHTML/...

Do not use regular expressions to parse the entire HTML file! You could use ANTLR for that in theory, but it would be very hard to make that work reliably.

Chris Lercher 2010-05-05 14:24:53

What has ANTLR to do with regular expressions?

Bart Kiers 2010-05-07 20:01:12

@Bart: Regexes can parse Chomsky type 3 grammars. ANTLR can additionally parse Chomsky type 2 (context free). It can kick in, where regexes aren't powerful enough anymore. So if you need to do something very complex to the URL - and that's the way I had (mis-?)understood the original version of the question - it could be necessary. Also, even if you use ANTLR to just parse regular languages, it can be a lot cleaner than regexes, because the notation is BNF-like. Using ANTLR requires much more overhead of course, but to replace very complex regexes, it's absolutely worth to consider it!

Chris Lercher 2010-05-07 22:43:31

@Bart: Of course, after the update of the question (seeing that the author just wants to prepend foo.jpg with something), probably ANTLR won't be necessary... :-)

Chris Lercher 2010-05-07 22:49:11

ansaurus

tags:

views:

answers:

url rewriting with antlr

related questions