ansaurus

Question

Answer 1

A:

In which language are your trying to do this? This is perl-comaptible regular expression to match such case: ,(?!(\s|\d{3}[^\d])) (it will match commas not followed by space or exact 3 digits, so if string matches this regexp it is not valid)

krcko 2009-10-31 17:30:33

This one matches ,233 which it should not match

Andomar 2009-10-31 17:36:05

I'm using C#.Using your regex, Regex CommaError = new Regex(@",(?!(\s|\d{3}[^\d]))");It fails test case #2 for some reason.

gw 2009-10-31 17:36:23

It's failing because the `[^\d]` is saying there has to be a non-digit after the 3 digits. Since the 233 (or 234 in case #2) is at the end of the string, there is no non-digit after the 3 digits.

Laurence Gonsalves 2009-10-31 18:02:11

Instead of `[^\d]` it should be another lookahead: `(?!\d)`. @Laurence, your regex should have that, too. Currently, it fails to flag a comma that's followed by four or more digits, e.g. `1,2345`.

Alan Moore 2009-11-01 01:35:25

Answer 2

+5 A:

You can only use ^ to mean not inside of a character class (eg: [^a-b]) in most regex syntaxes.

The simplest thing for you to do wuld be to invert the condition in your if statement.

If you can't do that for whatever reason you can use a negative lookahead in some regex syntaxes. eg:

,(?!\d\d\d(?!\d)|\s)

In regex syntaxes that don't support negative assertions you can still do what you want, but the bigger the negative match the more complicated the regex gets. eg:

,($|[^ \d]|\d$|\d[^\d]|\d\d$|\d\d[^\d]|\d\d\d\d)

Essentially you have to enumerate all of the bad cases.

Laurence Gonsalves 2009-10-31 17:31:09

You don't need the non-capturing group when doing alternation inside a lookahead, using `,(?!\d\d\d|\s)` will work.

Peter Boughton 2009-10-31 17:36:54

Peter: good point. It was harmless, but unnecessary. I've removed it.

Laurence Gonsalves 2009-10-31 17:39:35

Peter, thank you, your regex works the way I want it to. :)

gw 2009-10-31 17:41:51

+1 Learning something every day on SO :) haha

Andomar 2009-10-31 17:43:38

The lookahead version is not quite right. See my comment to @krcko's answer.

Alan Moore 2009-11-01 01:38:38

Alan: thanks for pinting that out I hadn't noticed the "exactly" bit of the question (or the `,\d\d\d\d` testcase). I've updated both regexes.

Laurence Gonsalves 2009-11-01 17:00:45

ansaurus

tags:

views:

answers:

regex to test for correct use of commas

related questions