ansaurus

Question

Difficulty with a simple regex for .obj files

Answer 1

+1 A:

Grouping in java regex is a little strange. Group 0 gives you the complete match of your regex - this is the same in all regex implementations I know. But group n (for n >= 1) will give you the last match of the n th declared group, not the n th match found.

Your second match gives you ' weasels' with a leading blank, because your pattern contains that blank. You declared your 2nd group (( \\S+)+) and this group gives you the second match.

If you apply your pattern to the string a b c d, your group 0 will be a b c d, group 1 will be a, group 2 will be b c d and group 3 will be d, because this is the last match of your 3rd declared (inner) group ( \\S+).

tangens 2010-09-25 22:07:55

Hm. From my perspective, *"the last match of the n'th declared group"* is the only logical thing. What regex engine gives you the n'th match found? This makes no sense at all.

Tomalak 2010-09-25 22:33:03

OK, perhaps it was only me who was confused about this, because I expected to be able to reference all matches even if a group matched multiple times.

tangens 2010-09-25 22:52:07

Following your logic, would that mean that in `(a)*(b)` the `b` would be represented by different numbers depending on how often `a` matched? That's just not right. ;-) The .NET framework [supports `CaptureCollection`](http://msdn.microsoft.com/en-us/library/system.text.regularexpressions.capturecollection(v=VS.90\).aspx) which lets you do this kind of thing. However, that's a rather unusual feature with regex engines.

Tomalak 2010-09-26 07:02:48

ansaurus

tags:

views:

answers:

Difficulty with a simple regex for .obj files

related questions