ansaurus

Question

Answer 1

A:

Since you said all instance of C,B,D, I would think you'd want to use a grouping for that [CBD]* Also, if you're just looking for something to be after the letter A but before F, then you should be able to use those literals along with some exclusions.

Here's a pattern I came up with. Group $4 should contain the letter DBC

([^A]*)(A)([^CBDF]*)([CBD]*)([^F]*)(F)(.*)

Here's an example of this pattern in action.

The question is, what do you want if the original string is CBDAEDEBECEFBCD?

Snekse 2010-09-30 20:34:41

Sorry, all the letters are place holders for more complex groups (I'll update the question) - so I can't just use literal exclusions. The string CBDAEDEBECEFBCD you suggest shouldn't match at all -- there's just a bunch of E's between A and the first (B|C|D), and a bunch of E's immediately before the F. Again, in my app, they're not just E's, they're just text that I don't need.

Jimmy 2010-09-30 21:07:10

If that's the case, then look-arounds are probably your only option.

Snekse 2010-09-30 21:15:18

Are you able to suggest a look around that works? Even with look arounds I still can't get it to work.

Jimmy 2010-09-30 22:04:35

Answer 2

+1 A:

Yeah, forget the lookarounds, they just complicate things needlessly. But I suspect your final regex will work if you make that first .+ reluctant:

(?<start>A).+?(?<content>B|C|D)+.+(?<end>F)

EDIT: yep:

string s = "CBDAEDBCEFBCD";
Regex r = new Regex(@"(?<start>A).+?(?<content>B|C|D)+.+(?<end>F)");

foreach (Match m in r.Matches(s))
{
  Console.WriteLine(@"Groups[""start""] = {0}", m.Groups["start"]);
  foreach (Capture c in m.Groups["content"].Captures)
  {
    Console.WriteLine(@"Capture[""content""] = {0}", c.Value);
  }
  Console.WriteLine(@"Groups[""end""] = {0}", m.Groups["end"]);
}

output:

Groups["start"] = A
Capture["content"] = D
Capture["content"] = B
Capture["content"] = C
Groups["end"] = F

Alan Moore 2010-09-30 23:22:57

Really! So lookarounds are not needed huh... amazing.

Jimmy 2010-10-01 00:44:17

@Jimmy: In this case, yes, lookarounds are not needed. But they do have their uses. :P

Alan Moore 2010-10-01 09:04:45

ansaurus

tags:

views:

answers:

Regex: captures, groups, confusion

related questions