ansaurus

Question

What is wrong with this grammar? (ANTLRWorks 1.4)

Answer 1

+2 A:

You've put 'drop' and 'put' in two different lexer-rules:

PUT_SYN  : 'put' | 'place' | 'drop';          // drop & put
PUT2_SYN : 'put' | 'douse';                   //        put
...
DROP_SYN : 'drop' | 'throw' | 'relinquish';   // drop

When put is encountered by the lexer, PUT_SYN will always be the rule that matches it, so 'put' could (or should) be removed from the PUT2_SYN rule.

So, your problem with parsing the string drop object: the parser will try to match drop_a : (DROP_SYN)(ID); but the "drop" will be matched in the lexer rule PUT_SYN.

EDIT

Those synonym-lists can be better made into parser rules (instead of lexer-rules). Here's a small demo:

grammar TextAdventure;

parse
  :  command (EndCommand command)* EOF
  ;

command
  :  put_syn_1 OtherWord in_syn OtherWord
  |  put_syn_2 out_syn_1 OtherWord
  |  out_syn_2 OtherWord
  |  Drop Kick OtherWord
  |  drop_syn OtherWord
  ;

drop_syn
  :  Drop
  |  Throw 
  |  Relinquish
  ;

in_syn
  :  In
  |  Into
  |  Inside
  |  Within
  ; 

put_syn_1
  :  Put
  |  Place
  |  Drop
  ;

put_syn_2
  :  Put
  |  Douse
  ;

out_syn_1
  :  Out
  ;

out_syn_2
  :  Extinguish
  |  Douse
  ;

Space      : (' ' | '\t' | '\r' | '\n'){$channel=HIDDEN;};
EndCommand : ';';
Put        : 'put';
Place      : 'place';
Drop       : 'drop';
Douse      : 'douse';
In         : 'in';
Into       : 'into';
Inside     : 'inside';
Within     : 'within';    
Out        : 'out';
Extinguish : 'extinguish';
Throw      : 'throw';
Relinquish : 'relinquish';
Kick       : 'kick';
OtherWord  : ('a'..'z' | 'A'..'Z')+;

When interpreting the following source:

drop object ; put yourself in myshoes ; place it in avase

you'll see ANTLRWorks generate the following parse-tree:

alt text

Bart Kiers 2010-09-26 19:05:32

Sounds plausible...what is the workaround - to get the various alternatives? Use a non-terminal for 'drop' (and another for 'put') and then build the alternatives using that non-terminal?

Jonathan Leffler 2010-09-26 19:12:34

Thanks for the explanation, Bart. I too am wondering about a workaround.

Rao 2010-09-26 19:20:41

The solution is to factor out that commonality and put the keyword `put` into its own rule. Something like `PUT_SYN: 'put' (PUT_CMD); PUT_CMD: (ID) ...|(OUT_SYN) ...;` That is just an example of what I mean by 'factoring' out.

linuxuser27 2010-09-26 19:34:50

@Rao, @Jonathan, the fix/workaround is what @linuxuser27 mentioned.

Bart Kiers 2010-09-26 20:40:36

@Rao (or @rikki), perhaps you'd like to explain what kind of language you're trying to parse because I see quite a few odd things in your grammar that might need fixing.

Bart Kiers 2010-09-26 20:43:00

I'm penning down ideas for creating a text-adventure game creator. The user will be made to input the various possible commands that he wishes to allow in the game. These commands may have aliases. My question reflects the subtleties faced when different commands may have the same "words" or "tokens". I was planning to use a lot of C# Dictionary<T>'s, but decided to play around with ANTLR both for prototyping and _perhaps_ as a library for implementing the project.

Rao 2010-09-27 09:16:45

@Rao, see my edit.

Bart Kiers 2010-09-27 13:11:02

Wow, thanks! Works perfectly! I'll spend some time dwelling over it now. At present, I think this approach is good for my project. (I'm only sorry I can't 'accept' your answer, differing accounts and all.)

Rao 2010-09-27 13:36:21

@Rao, you're welcome, and no worries about not being able to accept my answer.

Bart Kiers 2010-09-27 13:38:50

ansaurus

tags:

views:

answers:

What is wrong with this grammar? (ANTLRWorks 1.4)

related questions