ansaurus

Question

Answer 1

+2 A:

s[len(start):-len(end)]

Tim McNamara 2010-07-30 05:56:47

This is very nice, assuming start and end are always at the start and end of the string. Otherwise, I would probably use a regex.

jeremiahd 2010-07-30 06:01:02

I went the most Pythonic answer to the original question I could think of. Testing using the `in` operator would probably be faster than regexp.

Tim McNamara 2010-07-30 06:13:14

Answer 2

+1 A:

My method will be to do something like,

find index of start string in s => i
find index of end string in s => j

substring = substring(i+len(start) to j-1)

Prabhu Jayaraman 2010-07-30 05:56:47

Answer 3

A:

s = "123123STRINGabcabc"

def find_between( s, first, last ):
    try:
        start = s.index( first ) + len( first )
        end = s.index( last, start )
        return s[start:end]
    except ValueError:
        return ""

def find_between_r( s, first, last ):
    try:
        start = s.rindex( first ) + len( first )
        end = s.rindex( last, start )
        return s[start:end]
    except ValueError:
        return ""


print find_between( s, "123", "abc" )
print find_between_r( s, "123", "abc" )

gives:

123STRING
STRINGabc

I thought it should be noted - depending on what behavior you need, you can mix index and rindex calls or go with one of the above versions (it's equivalent of regex (.*) and (.*?) groups).

cji 2010-07-30 05:58:16

He said that he wanted a way that was more Pythonic, and this is decidedly less so. I'm not sure why this answer was picked, even OP's own solution is better.

Jesse Dhillon 2010-07-30 06:37:34

Agreed. I'd use the solution by @Tim McNamara , or the suggestion by the same of something like `start+test+end in substring`

jeremiahd 2010-07-30 12:31:38

Right, so it's less pythonic, ok. Is it less efficient than regexps too? And there's also @Prabhu answer you need to downvote, as it suggest the same solution.

cji 2010-07-30 19:42:21

Answer 4

+1 A:

Here is one way to do it

_,_,rest = s.partition(start)
result,_,_ = rest.partition(end)
print result

Another way using regexp

import re
print re.findall(re.escape(start)+"(.*)"+re.escape(end),s)[0]

or

print re.search(re.escape(start)+"(.*)"+re.escape(end),s).group(1)

gnibbler 2010-07-30 05:58:26

Answer 5

+2 A:

import re

s = 'asdf=5;iwantthis123jasd'
result = re.search('asdf=5;(.*)123jasd', s)
print result.group(1)

Nikolaus Gradwohl 2010-07-30 05:59:57

OP added additional information that makes this one the best solution, IMO.

Jesse Dhillon 2010-07-30 06:39:33

@Jesse Dhillon -- what about @Tim McNamara's suggestion of something like `''.join(start,test,end) in a_string`?

jeremiahd 2010-07-30 13:13:25

Answer 6

A:

This I posted before as code snippet in Daniweb:

# picking up piece of string between separators
# function using partition, like partition, but drops the separators
def between(left,right,s):
    before,_,a = s.partition(left)
    a,_,after = a.partition(right)
    return before,a,after

s = "bla bla blaa <a>data</a> lsdjfasdjöf (important notice) 'Daniweb forum' tcha tcha tchaa"
print between('<a>','</a>',s)
print between('(',')',s)
print between("'","'",s)

""" Output:
('bla bla blaa ', 'data', " lsdjfasdj\xc3\xb6f (important notice) 'Daniweb forum' tcha tcha tchaa")
('bla bla blaa <a>data</a> lsdjfasdj\xc3\xb6f ', 'important notice', " 'Daniweb forum' tcha tcha tchaa")
('bla bla blaa <a>data</a> lsdjfasdj\xc3\xb6f (important notice) ', 'Daniweb forum', ' tcha tcha tchaa')
"""

Tony Veijalainen 2010-07-30 07:16:36

Answer 7

A:

String formatting adds some flexibility to what NG suggested. start and end can now be amended as desired.

import re

s = 'asdf=5;iwantthis123jasd'
start = 'asdf=5;'
end = '123jasd'

result = re.search('%s(.*)%s' % (start, end), s).group(1)
print(result)

Tim McNamara 2010-07-30 07:47:56

ansaurus

tags:

views:

answers:

Find string between two substrings

related questions