ansaurus

Question

How do I exclude characters / symbols using ANTLR grammar?

Answer 1

A:

you can define a literal, that matches all the characters, that you do not want. If this literal is not contained in any of your rules, antlr will throw a NonViableException.

For unicode this could look like this:

 UTF8 :  ('\u0000'..'\u002A'     // ! to * 
     | '\u002E'..'\u002F'           // . / 
     | '\u003B'..'\u00FF'           // ; < = > ? @ as well as letters brackets and stuff
     ) 
     ;

nebenmir 2010-07-09 10:07:52

Answer 2

A:

...
So basically I'm looking for some kind of syntax that says: match everything BUT "0..9,:-"

The following rule matches any single character except a digit, ,, : and -:

Foo
  :  ~('0'..'9' | ',' | ':' | '-')
  ;

(the ~ negates single characters inside lexer-rules)

But you might want to post your entire grammar: I get the impression there are some other things you're not doing as they should have been done. Your call.

Bart Kiers 2010-07-12 18:27:39

ansaurus

tags:

views:

answers:

How do I exclude characters / symbols using ANTLR grammar?

related questions