ansaurus

Question

Answer 1

A:

It sounds to me like using a parsing tool is overkill, particularly if you aren't already familiar with a tool that could do the job.

Artelius 2009-12-18 00:37:45

Answer 2

A:

Is it possible to change the rules format to use an already-existing syntax that has a readily-available parser?

Ross 2009-12-18 00:51:05

Answer 3

+2 A:

While your language is simple, using ANTLR has a lot of advantages.

Speed. The generated code is VERY fast.
Simplicity. Since you're working in a higher-level language, small grammar changes are less costly and complex.
Extensibility. Since you're working in a higher-level language, adding features is a lower-cost activity.

Yes, you need to learn ANTLR. And if your grammar has ambiguities, you'll need to learn about shift-reduce and reduce-reduce conflicts. This can be time well spent.

Many problems are lexical scanning or parsing problems. Knowing how to create a lexical scanner and parser is a helpful skill.

S.Lott 2009-12-18 00:51:25

I understand this. But the format is not made by me. I just want to get experience on doing things like this.

Javier Badia 2009-12-18 01:26:01

To the edit: Yeah, I kinda suck at explaining. I was wondering whether I should use one of those tools like ANTLR or that's overkill and I should do the parsing myself.

Javier Badia 2009-12-18 15:20:15

@Javier Badia: "I kinda suck at explaining" False. You just focus on the fact you want but do not have. Write it as your first sentence. "I want to know [X]" In your case, it appears to be "I want to know which parser to use." The rest of the question is background to help us provide the fact you want. It helps to state the fact you want. Forget about "explaining".

S.Lott 2009-12-18 15:46:20

@@Javier Badia: Update the question with what you REALLY want to know. Write your question clearly and simply. Something focused on the fact you want but do not have. Something like "I want to know if I should use ANTLR or write my own parser?" I think that's the fact you're looking for.

S.Lott 2009-12-18 15:51:43

Answer 4

+1 A:

If your problem is simply that of parsing the rules, you might not need a parser generator. As you said, all the rules are in the form X/Y/Z and splitting them would be very easy in any language.

If, as I suspect, you are creating a tool that would read the rules and apply them to a file, the problem is considerably more complex.

To use a parser generator, assuming you have a fixed set of rules, you have to translate them in a set of grammar productions in the format required by your parser generator and feed them to it. Compiling the parser generator output, you will get a program that is able to translate a file according those rules. Given that your rules seems context sensitive (c/g/V_V) I suggest to look for parser generators that offer GLR (Tomita parsers) or PEG (Parsing Epression Grammars).

If your set of rules is not fixed and your program has to read them together with the file to transform, what you really need is a text transformation engine. In this case you will translate you X/Y/Z rules into the proper format and feed it to the engine together with the source file.

Assuming you don't want to write your own engine, you can can look at generalized macro processors (M4, Gema, ...) or directly to interpreted languages (perl, Lua, ...) to help you.

For example in Gema, you could translate c/g/V_V into:

<vowel>c<vowel>=$1g$2
vowel:a=a;e=e;i=i;o=o;u=u;=@terminate

and in Lua into:

function rule1(s)
  return (string.gsub(s,"([aeiou])c([aeiou])","%1g%2"))
end

In the end, it really depends if you need to create something for a given set of rules or if you need to be able to read and interpret any set of rules.

Of course, in any case you have to parse your rules to be able to tranform them in the proper format but, as I said at the beginning, the syntax looks very straightforward to me and wouldn't justify the use of a parser generator.

Remo.D 2009-12-19 10:29:55

Answer 5

A:

First answer to yourself to the question: "Does this language have any nested/recursive patterns?"

If yes, you need a parser of context-free grammar at least. Build by yourself by hand, or generated by some parser generator.

If no, regular expressions are enough.

SasQ 2010-09-01 04:18:48

ansaurus

tags:

views:

answers:

Should I use a parser/lexer for this?

Sound change format

Variables

Rule order

Optional elements in the environment

related questions