ansaurus

Question

regular expression for bit strings with even number of 1s

Answer 1

+5 A:

A if false. It doesn't get matched by 0110 (or any zeros-only non-empty string)

B represents OK. I won't bother proving it here since the page margins are too small.

C doesn't get matched by 010101010 (zero in the middle is not matched)

D as you said doesn't get matched by 00 or any other # with no ones.

So only B

DVK 2010-04-24 05:22:30

+1 for the Fermat reference!

Jim Lewis 2010-04-24 05:37:58

A also fails to match `0*`

BCS 2010-04-24 16:53:25

The string "`000`" has an even number of 1s (zero 1s) but the A regex doesn't match it. (I guess I should have said that the A regex doesn't match `0+` as it does get the empty string). --- I pointed it out because It's an important corner case that hadn't been brought up and I did so *here* because I didn't think it was worth it's own answer.

BCS 2010-04-24 17:57:49

Ah. OK, gotcha... Updated! Thanks!

DVK 2010-04-24 18:24:18

Answer 2

A:

Look for examples that should match but don't. 0, 11011, and 1100 should all match, but each one fails for one of those four

Michael Mrozek 2010-04-24 05:25:34

Answer 3

A:

C is incorrect because it does not allow any 0s between the second 1 of one group and the first 1 of the next group.

Ignacio Vazquez-Abrams 2010-04-24 05:26:26

Answer 4

A:

a quick python script actually eliminated all the possibilities:

import re

a = re.compile("(0*10*1)*")
b = re.compile("0*(10*10*)*")
c = re.compile("0*(10*1)* 0*")
d = re.compile("0*1(10*1)* 10*")

candidates = [('a',a),('b',b),('c',c),('d',d)]
tests = ['0110', '1100', '0011', '11011']
for test in tests:
    for candidate in candidates:
        if not candidate[1].match(test):
            candidates.remove(candidate)
            print "removed %s because it failed on %s" % (candidate[0], test)

ntests = ['1', '10', '01', '010', '10101']
for test in ntests:
    for candidate in candidates:
        if candidate[1].match(test):
            candidates.remove(candidate)
            print "removed %s because it matched on %s" % (candidate[0], test)

the output:

removed c because it failed on 0110
removed d because it failed on 0110
removed a because it matched on 1
removed b because it matched on 10

Igor 2010-04-24 05:40:57

Just because you haven't disproven B, doesn't mean that you have proven B. Nice effort, though, just fallacious logic.

polygenelubricants 2010-04-24 05:54:23

oops, my bad. when anchoring the expressions (putting each one between a ^ and a $), the only one that survives is B. of course, you'd still have to prove it...

Igor 2010-04-24 05:56:31

I don't think the whitespaces within the regular expressions are supposed to count. You should rerun it with whitespace ignored.

Gabe 2010-04-24 06:33:57

Answer 5

+2 A:

To solve such a problem you should

Supply counterexample patterns to all "incorrect" regexps. This will be either a string in L that is not matched, or a matched string out of L.
To prove the remaining "correct" pattern, you should answer two questions:
- Does every string that matches the pattern belong to L? This can be done by devising properties each of matched strings should satisfy--for example, number of occurrences of some character...
- Is every string in L matched by the regexp? This is done by dividing L into easily analyzable subclasses, and showing that each of them matches pattern in its own way.

(No concrete answers due to [homework]).

Pavel Shved 2010-04-24 06:44:52

Answer 6

A:

Examining the pattern B:

^0*(10*10*)*$

^          # match beginning of string
0*         # match zero or more '0'
(          # start group 1
 10*       # match '1' followed by zero or more '0'
 10*       # match '1' followed by zero or more '0'
)*         # end group 1 - match zero or more times
$          # end of string

Its pretty obvious that this pattern will only match strings who have 0,2,4,... 1's.

gnarf 2010-04-24 06:54:37

ansaurus

tags:

views:

answers:

regular expression for bit strings with even number of 1s

related questions