ansaurus

Question

Answer 1

A:

Assuming you want to split between the "Foo" and the number, you'd want something like:

r/(?<=\D)(?=\d)/

Which will match at a point between a nondigit and a digit, without consuming any characters in the split.

Anon. 2010-01-28 21:03:52

Great idea, but won't work, at least in Python. It ignores lookarounds in regexes that do not match any characters.

Max Shawabkeh 2010-01-28 21:09:20

...seriously? I wonder what the rationale for that behaviour is.

Anon. 2010-01-28 21:10:12

@Max S. see my edit...it appears to have some viability in python after all..

AJ 2010-01-28 21:12:18

Hmm, it seems to ignore them for `split()` but it does work for `search()`. Strange.

Max Shawabkeh 2010-01-28 21:14:40

Answer 2

+1 A:

Using groups:

import re

m=re.match('^(?P<first>[A-Za-z]+)(?P<second>[0-9]+)$',"Foo9")
print m.group('first')
print m.group('second')

Using search:

import re

s='Foo9'
m=re.search('(?<=\D)(?=\d)',s)
first=s[:m.start()]
second=s[m.end():]

print first, second

AJ 2010-01-28 21:07:24

Answer 3

+4 A:

You can't use split() since that has to consume some characters, but you can use normal matching to do it.

>>> import re
>>> r = re.compile(r'(\D+)(\d+)')
>>> r.match('abc444').groups()
('abc', '444')

Max Shawabkeh 2010-01-28 21:11:23

Answer 4

A:

>>> import re
>>> s="gnibbler1234"
>>> re.findall(r'(\D+)(\d+)',s)[0]
('gnibbler', '1234')

In the regex, \D means anything that is not a digit, so \D+ matches one or more things that are not digits.

Likewise \d means anything that is a digit, so \d+ matches one or more digits

gnibbler 2010-01-28 21:19:08

Answer 5

+1 A:

Keeping it simple:

>>> import re
>>> a = "Foo1String12345"
>>> re.split(r'(\d+)$', a)[0:2]
['Foo1String', '12345']

MikeyB 2010-01-28 21:25:22

simple... and allowing for digits in the "arbitrary string" :-p

MikeyB 2010-01-28 21:31:26

Python regex split, integer of arbitrary length