ansaurus

Question

Design strategy for a simple code parser

Answer 1

+1 A:

You should be able to put something together in a few hours, using regular expressions in combination with some code that uses the results.

Something like this should work: - Initialize the process by loading the file into a string.

Pull each top-level block from the string, using regex tags to separately identify the block keyword and contents.
If a block is found,
- Make a decision based on the keyword
- Pass the content to this process recursively.

Following this, you would process HEADING, then the first SUBHEADING, then the second SUBHEADING, then each sub-block. For the sub-block containing the block comment, you would presumably know based on the block's lack of a keyword that any sub-block is a comment, so there is no need to process the sub-blocks.

John Fisher 2009-05-27 19:20:05

Thanks for the advice. As a result I've taken the initiative to learn more about regular expressions.

polara 2009-05-29 14:33:16

Answer 2

+3 A:

I'd suggest writing a tokenizer and parser; this will give you more flexibility. The tokenizer basically does a simple text-wise breakdown of the sourcecode and puts it into more usable data structure; the parser figures out what to do with it, often leveraging recursion.

Terms to google: tokenizer, parser, compiler design, grammars

Math expression evaluator: http://www.codeproject.com/KB/vb/math_expression_evaluator.aspx (you might be able to take an example like this and hack it apart into what you want)

More info about parsing: http://www.codeproject.com/KB/recipes/TinyPG.aspx

You won't have to go nearly as far as those articles go, but, you're going to want to study a bit on this one first.

FastAl 2009-05-27 19:41:53

Answer 3

A:

No matter which solution you will choose, I'm pretty sure the best way is to have 2 parsers/tokenizers. One for the main file structure with {} as grouping characters, and one for the code blocks.

devio 2009-05-27 19:54:27

ansaurus

tags:

views:

answers:

Design strategy for a simple code parser

related questions