ansaurus

Question

I'm new to Perl and have a few regex questions

Answer 1

+7 A:

how does the following line work...

([^"]*) # Anchor name

zero or more things which aren't ", captured as $1, $2, or whatever, depending on the number of brackets ( in we are.

how does the following line work...

([^>]*) # Post title

zero or more things which aren't >, captured as $1, $2, or whatever.

what does the "six" mean in the following line...

<\/dd>/six)

s = match as single line (this just means that "." matches everything, including \n, which it would not do otherwise)
i = match case insensitive
x = ignore whitespace in regex.

x also makes it possible to put comments into the regex itself, so the things like # Post title there are just comments.

See perldoc perlre for more / better information. The link is for Perl 5.10. If you don't have Perl 5.10 you should look at the perlre document for your version of Perl instead.

Kinopiko 2009-10-22 06:19:33

thank you very much!

BeachRunnerJoe 2009-10-22 06:22:04

"match as single line" is not very informative. /s means . matches any character, including newline, instead of the default any character except newline.

ysth 2009-10-22 06:24:08

@ysth: I've altered it as you suggest.

Kinopiko 2009-10-22 06:31:30

Answer 2

+1 A:

The code is an extended regex. It allows you to put whitespace and comments in your regexes. See perldoc perlre and perlretut. Otherwise like normal.
Same.
The characters are regex modifiers.

daotoad 2009-10-22 06:22:27

Answer 3

+2 A:

[^"]* means "any string of zero or more characters that doesn't contain a quotation mark". This is surrounded by quotes making forming a quoted string, the kind that follows <a name=
[^>]* is similar to the above, it means any string that doesn't contain >. Note here that you probably mean [^<], to match until the opening < for the next tag, not including the actual opening.
that's a collection of php specific regexp flags. I know i means case insensitive, not sure about the rest.

Blindy 2009-10-22 06:22:55

re: #3 - No, it's not PHP-specific. They're flags used by PCRE regex libraries, which are available for many different languages. See the documentation at http://www.pcre.org/pcre.txt for full details of PCRE. (PCRE = Perl-Compatible Regular Expressions)

Dave Sherohman 2009-10-22 09:37:42

regardless, I still don't know what they all mean :)

Blindy 2009-10-22 14:54:48

ansaurus

tags:

views:

answers:

I'm new to Perl and have a few regex questions

related questions