ansaurus

Question

Answer 1

+7 A:

/XX(.*?)(?=XX)/

Sjoerd 2010-06-16 14:06:05

Beautiful, thanks!

itzy 2010-06-16 14:23:47

Answer 2

+3 A:

you can use split

@stuff_between_xx = split /XX/, $x1;

number of matches:

$stuff_between_xx = split /XX/, $x1;

knittl 2010-06-16 14:08:11

Thanks, that'll work. It's funny how you get stuck thinking in one way, and don't see obvious solutions. But I am curious if anyone has another solution that would work just with regex -- mostly so I can learn.

itzy 2010-06-16 14:13:58

This assigns to `$stuff_between_xx` the **number** of parts found

kemp 2010-06-16 14:14:52

@kemp: whops, corrected

knittl 2010-06-16 14:27:02

Answer 3

A:

my $x2 = 'XX a b XX c d XX e f XX';

my @parts = grep { $_ ne '' } split /\s*XX\s*/, $x2;

kemp 2010-06-16 14:12:52

Answer 4

+3 A:

I would suggest split as well as knittl. But you might want to remove the whitespace as well:

my @stuff = split /\s*XX\s*/, $line;

Also you could use lookaheads, but you really don't need them, because you can use reasonably complex alternations as well:

Non-ws version would just be:

my @stuff = $line =~ m/XX((?:[^X]|X[^X])*)/g;

The alternation says that you'll take anything if it's not an 'X'--but you will take an 'X' if it's not followed by another 'X'. There will be one character of lookahead, but it can consume characters aggressively, without backtracking.

The trimming version will have to backtrack to get rid of space characters, so the expression is uglier.

my @stuff = $line =~ m/XX\s*((?:[^X]|X[^X])*?(?:[^X\s]|X[^X]))/g;

Axeman 2010-06-16 14:22:13

ansaurus

tags:

views:

answers:

Perl regular expression question

related questions