ansaurus

Question

.NET regular expression - ? isn't working (like I think it should)

Answer 1

+1 A:

There's a free tool called Regex Coach you can use to easily debug your regular expressions.

Adrian Grigore 2009-09-03 15:19:23

Answer 2

+2 A:

I'd consider preprocessing the HTML and removing content that is known to trip up the regex implementation.

As far as testers go, you can also use Regex Hero since Silverlight's Regex implementation is compatable .NET's.

Richard Szalay 2009-09-03 15:29:09

+1 - I've been looking for a decent online regex tester, thanks!

John Rasch 2009-09-03 16:49:15

Answer 3

A:

Try:

href[^<>]+>(.*?)<\\/a[^<>]*>(.*?)<\\/span

From what i can tell, it looks like "/a.*>" is being too greedy and i always try to be as specific as possible when writing Regex's... which i why i used "[^<>]+"

David Rogers 2009-09-03 15:31:08

Answer 4

+1 A:

Avoid the "." character. It usually gives you nothing but trouble... because it is unspecific.

Try something like this:

href=[^>]*>([^<]*)</a\s*>((?:(?!</span\s*>).)*)

Note: since your sample doesn't return a name-value pair, but rather just a name (assuming the first capture group is the name), I don't know what you'd expect it to match. Maybe post a more complete sample and specify exactly what parts you'd like to have captured.

Lucero 2009-09-03 15:51:24

ansaurus

tags:

views:

answers:

.NET regular expression - ? isn't working (like I think it should)

related questions