ansaurus

Question

Determining the unmatched portion of a string using a regex in Python

Answer 1

+1 A:

Use re.sub:

import re
s = "87 foo 87 bar"
r = re.compile(r"87\s*")
s = r.sub('', s)
print s

Result:

foo bar

Mark Byers 2010-02-03 20:51:50

Exactly what I was looking for. I knew it there was a simple way. Thanks!

Art 2010-02-03 21:17:37

You can also merge `r = re.compile(); s = r.sub()` into `s = re.sub()`.

EOL 2010-02-04 08:53:56

Answer 2

+1 A:

>>> import re
>>> re.sub("87\s*", "", "87 foo 87 bar")
'foo bar'

Greg Bacon 2010-02-03 21:05:19

Answer 3

+1 A:

Instead of splitting or separating, maybe you can use re.sub and substitute a blank, empty string ("") whenever you find the pattern. For example...

>>> import re
>>> re.sub("^a\s*", "","a foobar")
'foobar''
>>> re.sub("a\s*", "","a foobar a foobar")
'foobr foobr'
>>> re.sub("87\s*", "","87 foo 87 bar")
'foo bar'

VMDX 2010-02-03 21:05:29

Answer 4

+1 A:

from http://docs.python.org/library/re.html#re.split

>>> re.split('(\W+)', 'Words, words, words.')
['Words', ', ', 'words', ', ', 'words', '.', '']

so your example would be

>>> re.split(r'(^a\s*)', "a foobar")
['', 'a ', 'foobar']

at which point you can separate the odd items (your match) from the even items (the rest).

>>> l = re.split(r'(^a\s*)', "a foobar")
>>> l[1::2] # matching strings
['a ']
>>> l[::2] # non-matching strings
['', 'foobar']

This has the advantage over re.sub in that you can tell, when, where, and how many matches were found.

cobbal 2010-02-03 21:14:13

ansaurus

tags:

views:

answers:

Determining the unmatched portion of a string using a regex in Python

related questions