ansaurus

Question

Answer 1

+7 A:

The second regexp {:*} matches the empty string because the empty string is 0 occurrences of :. If you use the -indices option for regexp, you'll see that it matches at position 0.

 % regexp -indices :* "DHCP:Enabled" indices
 1
 % puts $indices
 0 -1

In other words, the regexp matches at the first character and returns.

Trey Jackson 2010-07-28 16:48:15

Answer 2

A:

It matches the empty string so that it can match that empty string at the start of “DHCP:Enabled”. The regular expression engine like to match things up as soon as possible. To show, here's an interactive session:

% regexp -inline {:*} "DHCP:Enabled"
{}
% regexp -inline -all {:*} "DHCP:Enabled"
{} {} {} {} : {} {} {} {} {} {} {}
% regexp -inline -indices -all {:*} "DHCP:Enabled"
{0 -1} {1 0} {2 1} {3 2} {4 4} {5 4} {6 5} {7 6} {8 7} {9 8} {10 9} {11 10}

The -inline option is useful for simple testing, the -all matches in every matchable location instead of just the first, and the -indices returns locations rather than the string.

Note that only once (4 4) is the end at least at the same index as the start; in all other cases, an empty string matches (and it's legal; you said that matching nothing was OK).

In general, it's a really good idea to make sure that your overall RE cannot match the empty string or you'll be surprised by the results.

Donal Fellows 2010-07-29 12:27:05

ansaurus

tags:

views:

answers:

TCL Regular Expression Doubt

related questions