ansaurus

Question

Take multiples matches with regex separated by defined marks

Answer 1

A:

I don't think you'll be able to achieve this with a single expression. Likely you'll need to break it down into an initial expression and then a loop to perform a 2nd expression match against each iteration of the first match.

Isaac Dealey 2009-01-20 21:18:28

Answer 2

A:

Am I missing something or is this what you are looking for?

/(_MARK1_ (.*?) _MARK2 (.*?))*/

I made some arbitrary assumptions about how you want to handle spaces, which I realize were probably only consistent to make your example case more readable.

Sparr 2009-01-20 21:21:26

Answer 3

+1 A:

That would be:

/(_MARK1_(.*?)_MARK2_((?:(?!_MARK1_).)*))/g

At least, it works on RegEx Coach on your test case.
Of course, you need to iterate on each match.
Note it might not work on all flavors of regex: JavaScript, for example, has no lookahead assertions.

PhiLho 2009-01-20 21:36:29

perfect. Thats it

Davi Kenji 2009-01-20 21:43:35

good catch, excluding _MARK2__MARK1_, I didn't cover that case in my solution

Sparr 2009-01-20 22:48:38

Answer 4

A:

I'm not sure whether you actually need the separating marks in your array. That part seems superfluous unless you have a specific spec for it. This solution assumes you don't really need that. Since you didn't specify a language, how about Perl?

use Data::Dumper;
my $text = 'textA textB _MARK1_ textC _MARK2_ textD _MARK1_ textE textF _MARK2_ textG textH textI';
my @results = $text =~ m/(?<=_MARK1_|_MARK2_)(.*?)(?=_MARK1_|_MARK2_|$)/g;
print Data::Dumper::Dumper @results;

However, there's no reason to try the general case with regular expressions. Use a parser instead.

2009-01-20 21:44:17

ansaurus

tags:

views:

answers:

Take multiples matches with regex separated by defined marks

related questions