ansaurus

Question

How do I check for if an exact string exists in another string?

Answer 1

+5 A:

There are ways to do it without a regex, but most of those ways are so convoluted that you'll wish you had spent the time learning the simple regex sequence that you need for it.

Ignacio Vazquez-Abrams 2010-10-22 05:01:49

That's fair, and what I figured. I was just making sure that there weren't any not convoluted solutions.

Mana 2010-10-22 05:30:09

Answer 2

A:

A little lengthy but gives an idea / of course regex is there to make it simple

>>> indicators = ["therefore", "for", "since"]
>>> phrase = "... therefore, I conclude I am awesome."
>>> phrase_list = phrase.split()
>>> phrase_list
['...', 'therefore,', 'I', 'conclude', 'I', 'am', 'awesome.']
>>> phrase_list = [ k.rstrip(',') for k in phrase_list]
>>> indicators_in_phrase = [indicator for indicator in indicators if indicator in phrase_list]
>>> indicators_in_phrase 
['therefore']

pyfunc 2010-10-22 05:05:06

Answer 3

+1 A:

I think what you are trying to do is something more like this:

import string

words_in_phrase = string.split(phrase)

Now you'll have the words in a list like this:

['...', 'therefore,', 'I', 'conclude', 'I', 'am', 'awesome.']

Then compare the lists like so:

indicators_in_phrase = []
for word in words_in_phrase:
  if word in indicators:
    indicators_in_phrase.append(word)

There's probably several ways to make this less verbose, but I prefer clarity. Also, you might have to think about removing punctuation as in "awesome." and "therefore,"

For that use rstrip as in the other answer

jgritty 2010-10-22 05:08:57

Answer 4

+1 A:

Is the problem with "for" that it's inside "therefore" or that it's not a word? For example, if one of your indicators was "awe", would you want it to be included in indicators_in_phrase?

How would you want the following situation to be handled? indicators = ["abc", "cde"] phrase = "One abcde two"

Francis Potter 2010-10-22 05:09:39

If it was "awe", I would not want it to be included in indicators_in_phrase. In the example you gave, indicators_in_phrase would be the empty list.

Mana 2010-10-22 05:26:47

Answer 5

A:

You can strip off punctuations from your phrase, then do split on it so that all words are individual. Then you can do your string comparison

>>> indicators = ["therefore", "for", "since"]
>>> phrase = "... therefore, I conclude I am awesome."
>>> ''.join([ i for i in phrase.lower() if i not in string.punctuation]).strip().split()
['therefore', 'I', 'conclude', 'I', 'am', 'awesome']
>>> p = ''.join([ i for i in phrase.lower() if i not in string.punctuation]).strip().split()
>>> indicators_in_phrase = [indicator for indicator in indicators if indicator in p ]
>>> indicators_in_phrase
['therefore']

ghostdog74 2010-10-22 05:09:43

Answer 6

+1 A:

It is one line with regex...

import re

indicators = ["therefore", "for", "since"]
phrase = "... therefore, I conclude I am awesome."

indicators_in_phrase = set(re.findall(r'\b(%s)\b' % '|'.join(indicators), phrase.lower()))

Paulo Scardine 2010-10-22 05:13:02

This is awesome, but can you please explain how the regex here works? I'm struggling to understand what's going on.

Mana 2010-10-22 05:28:00

The regex is `\b(therefore|for|since)\b` which looks for either a word of the three, surround by *word boundaries* (`\b`). So you can be sure that those words are separate words like that.

poke 2010-10-22 05:47:23

Ahh, wow. That's great. Definitely looking into learning Regex then.

Mana 2010-10-22 07:22:28

Answer 7

+1 A:

Create set of indicators
Create set of phrases
Find intersection

Code:

indicators = ["therefore", "for", "since"]
phrase = "... therefore, I conclude I am awesome."
print list(set(indicators).intersection(set( [ each.strip('.,') for each in phrase.split(' ')])))

Cheers:)

ShyamLovesToCode 2010-10-22 05:18:42

You can replace `each.strip('.').strip(',')` with `each.strip('.,')` see also http://docs.python.org/library/stdtypes.html#str.strip

rubik 2010-10-22 14:21:30

Thanks for the information, I will make the change :)

ShyamLovesToCode 2010-10-23 04:40:32

Answer 8

+1 A:

The regex are the simplest way! Hint:

re.compile(r'\btherefore\b')

Then you can change the word in the middle!

EDIT: I wrote this for you:

import re

indicators = ["therefore", "for", "since"]

phrase = "... therefore, I conclude I am awesome. "

def find(phrase, indicators):
    def _match(i):
        return re.compile(r'\b%s\b' % (i)).search(phrase)
    return [ind for ind in indicators if _match(ind)]

>>> find(phrase, indicators)
['therefore']

rubik 2010-10-22 05:32:31

ansaurus

tags:

views:

answers:

How do I check for if an exact string exists in another string?

related questions