ansaurus

Question

PCRE: Find matching brace for code block

Answer 1

+4 A:

This is exactly what regular expressions are not good for. It's the classic example.

You should just iterate over the string character by character, and keep a count of the nesting level.

Mark Byers 2010-02-27 19:08:31

Answer 2

A:

$regex='%^(\\d|\\:|\\{|\\}|,){0,25)$%';
preg_match($regex,$target,$matches);

where: 25 on first line indicates maximum number of occurrences. then check:

$n=count($matches);

stillstanding 2010-02-27 19:08:43

Answer 3

A:

It is impossible since the language you are describing is not a regular language.

Use a parser instead.

Otto Allmendinger 2010-02-27 19:08:47

Answer 4

+4 A:

pcre has recursive patterns, so you can do something like this

$code_is_valid = preg_match('~^({ ( (?>[^{}]+) | (?1) )* })$~x', '{' . $code .'}');

the other thing, i don't think this will be faster or less memory consuming than simple counter, especially on large strings.

and this is how to find all (valid) codeblocks in a string

preg_match_all('~ { ( (?>[^{}]+) | (?R) )* } ~x', $input, $blocks);
print_r($blocks);

stereofrog 2010-02-27 19:30:05

Thanks for that, I'm going to make both solutions, and benchmark / profile them to see which is more appropriate.By the looks of things, I was wrong and was probably better off iterating over each character!

mynameiszanders 2010-02-28 18:57:31

Yes, this definitely needs profiling. Don't forget to share your benchmarks!

stereofrog 2010-02-28 19:14:05

Answer 5

A:

I created a solution, and have posted it as an answer on my previous question.

Thanks for all your help, mniz.

mynameiszanders 2010-03-01 03:38:32

ansaurus

tags:

views:

answers:

PCRE: Find matching brace for code block

Solution

related questions