I'm parsing some html using regex and I want to match lines which start with a word without any html tags while also removing the white space. Using c# regex my first pattern was:
pattern = @"^\s*([^<])";
which attempts to grab all the white space and then capture any non '<' characters. Unfortunately if the line is all white space before the first '<' this returns the last white space character before the '<'. I would like this to fail the match.
Any ideas?