ansaurus

Question

AST with fixed nodes instead of error nodes in antlr

Answer 1

A:

I haven't used antlr much, but typically the way you handle this type of error is to add rules for matching wrong syntax, make them produce error nodes, and try to fix up after errors so that you can keep parsing. Fixing up afterwards is the problem because you don't want one error to trigger more and more errors for each new token until the end.

nategoose 2010-05-14 17:39:46

Answer 2

A:

I solved the problem by adding new alternate rules to the grammer for all possible erroneous statements.

Each Java import statement gets translated to an AST subtree with the artificial symbol IMPORT as the root for example. To make sure that I can differentiate between ASTs from correct and erroneous code the rules for the erroneous statements rewrite them to an AST with a root symbol with the prefix ERR_, so in the example of the import statement the artifical root symbol would be ERR_IMPORT.

More different root symbols could be used to encode more detailed information about the parse error.

My parser is now as error tolerant as I need it to be and it's very easy to add rules for new kinds of erroneous input whenever I need to do so. You have to watch out to not introduce any ambiguities into your grammar, though.

ahe 2010-06-14 07:50:41

ansaurus

tags:

views:

answers:

AST with fixed nodes instead of error nodes in antlr

related questions