views:

360

answers:

3

Hi, I am using VBscript in QTP and I am a bit confused:

Browser("name:=.*") //works

Why Browser("name:=*") does not work? Why is there a . character?

Thank you!

+6  A: 

While normal wildcards (such as those used in shells for specifying many files at once—e. g. *.txt) use only the askterisk (*) as a sign for at zero or more arbitrary characters, in regular expressions it is a quantifier. It tells the regex engine something about the preceding token. A dot (.) matches a single arbitrary character, a dot followed by an asterisk thereby matches zero or more arbitrary characters.

However, a = followed by a * will match 0 or more equals signs (=)—since the asterisk always works on the preceding token, which is just the equals sign here.

Note: A token can be many things, a single character like the =, a character class, such as ., \w or [a-z], a group such as (abc) which would then match any string like abcabcabc, &c. This allows for much richer types of expressions you can define than just the plain old wildcards.

Generally the following equivalences between wildcards and regular expressions hold—approximately; there are some details which may not immediately be obvious:

Wildcard        Regex
--------        -----
*               .*
?               .
[a-z]           [a-z]
Joey
Thanks. How could I tell the engine that it should accept *blah* meaning "blah" everything within the text
Tomas
Just "blah". Normal letters and numbers are not metacharacters and match exactly what they are.
Joey
Actually I think that QTP implicitly anchors the expression so you will need `.*blah.*` (you should check and see)
Motti
some other things to note... + is like star but will match one or more and the ? will make the regex not be greedy
Carter Cole
@Carter: I opted to leave those things out of that answer, since they might just be pretty confusing if you don't even know about quantifiers yet ;-)
Joey
+2  A: 

The * means: match an expression where the character to the left from the * appears 0 or more times. . means 'match any character'. So .* means: match any character 0 or more times. In your second expression, there is an equal sign before the *, so it means: match 0 or more equal signs.

Doc Brown
A: 

Nothing more than a copy of QTP's help page.

Special characters and sequences are used in writing patterns for regular expressions. The following table describes and gives an example of the characters and sequences that can be used.

Character Description

\ Marks the next character as either a special character or a literal. For example, "n" matches the character "n". "\n" matches a newline character. The sequence "\" matches "\" and "(" matches "(".

^ Matches the beginning of input.

$ Matches the end of input.

* Matches the preceding character zero or more times. For example, "zo*" matches either "z" or "zoo".

+ Matches the preceding character one or more times. For example, "zo+" matches "zoo" but not "z".

? Matches the preceding character zero or one time. For example, "a?ve?" matches the "ve" in "never".

. Matches any single character except a newline character.

(pattern) Matches pattern and remembers the match. The matched substring can be retrieved from the resulting Matches collection, using Item [0]...[n]. To match parentheses characters ( ), use "(" or ")".

x|y Matches either x or y. For example, "z|wood" matches "z" or "wood". "(z|w)oo" matches "zoo" or "wood".

{n} n is a nonnegative integer. Matches exactly n times. For example, "o{2}" does not match the "o" in "Bob," but matches the first two o's in "foooood".

{n,} n is a nonnegative integer. Matches at least n times. For example, "o{2,}" does not match the "o" in "Bob" and matches all the o's in "foooood." "o{1,}" is equivalent to "o+". "o{0,}" is equivalent to "o*".

{n,m} m and n are nonnegative integers. Matches at least n and at most m times. For example, "o{1,3}" matches the first three o's in "fooooood." "o{0,1}" is equivalent to "o?".

[xyz] A character set. Matches any one of the enclosed characters. For example, "[abc]" matches the "a" in "plain".

[^xyz] A negative character set. Matches any character not enclosed. For example, "[^abc]" matches the "p" in "plain".

[a-z] A range of characters. Matches any character in the specified range. For example, "[a-z]" matches any lowercase alphabetic character in the range "a" through "z".

[^m-z] A negative range characters. Matches any character not in the specified range. For example, "[m-z]" matches any character not in the range "m" through "z".

\b Matches a word boundary, that is, the position between a word and a space. For example, "er\b" matches the "er" in "never" but not the "er" in "verb".

\B Matches a non-word boundary. "ea*r\B" matches the "ear" in "never early".

\d Matches a digit character. Equivalent to [0-9].

\D Matches a non-digit character. Equivalent to [^0-9].

\f Matches a form-feed character.

\n Matches a newline character.

\r Matches a carriage return character.

\s Matches any white space including space, tab, form-feed, etc. Equivalent to "[ \f\n\r\t\v]".

\S Matches any nonwhite space character. Equivalent to "[^ \f\n\r\t\v]".

\t Matches a tab character.

\v Matches a vertical tab character.

\w Matches any word character including underscore. Equivalent to "[A-Za-z0-9_]".

\W Matches any non-word character. Equivalent to "[^A-Za-z0-9_]".

\num Matches num, where num is a positive integer. A reference back to remembered matches. For example, "(.)\1" matches two consecutive identical characters.

\n Matches n, where n is an octal escape value. Octal escape values must be 1, 2, or 3 digits long. For example, "\11" and "\011" both match a tab character. "\0011" is the equivalent of "\001" & "1". Octal escape values must not exceed 256. If they do, only the first two digits comprise the expression. Allows ASCII codes to be used in regular expressions.

\xn Matches n, where n is a hexadecimal escape value. Hexadecimal escape values must be exactly two digits long. For example, "\x41" matches "A". "\x041" is equivalent to "\x04" & "1". Allows ASCII codes to be used in regular expressions.

Albert Gareev