ansaurus

Question

Does this program grammar only recognize variables with the name 'ID'?

Answer 1

+1 A:

ID is just the terminal type returned by the lexer. The idea is that, in the case of variable names (and numbers), other returned information will specify the name (or number). In C-like psuedo-code, the lexer is doing something like:

char *tok = tokenise();
if (!strcmp(tok, "int"))
{
    return INT;
}
else if (is_name(tok))
{
    strcpy(parser.name, tok);
    return ID;
}
else if (is_number(tok))
{
    parser.number = atoi(tok);
    return NUM;
}
...

The parser receives the terminal type (INT, ID, NUM, etc.) and this is sufficient information for it to apply the grammar rules. The actions in the rules can then include the extra information (parser.name, parser.number, etc.) either directly or when constructing the AST.

Edmund 2009-11-17 07:02:54

How can you tell that INT is taken literally, but not ID?

Phenom 2009-11-17 07:20:52

That is an extremely good question. There is no way to tell from the grammar. I was guessing based on the extreme similarity of your grammar to several others I have seen with their accompanying lexers. Though since your question seems to be "how would this work in the real world (with a realistic lexer)?" I think my assumption is reasonable.

Edmund 2009-11-17 07:52:18

ansaurus

tags:

views:

answers:

Does this program grammar only recognize variables with the name 'ID'?

related questions