ansaurus

Question

Python: Replace string with prefixStringSuffix keeping original case, but ignoring case when searching for match

Answer 1

+2 A:

This ok?

>>> import re
>>> myString = "HI there. You should higher that person for the job. Hi hi."
>>> keyword = "hi"
>>> search = re.compile(r'\b(%s)\b' % keyword, re.I)
>>> search.sub('<b>\\1</b>', myString)
'<b>HI</b> there. You should higher that person for the job. <b>Hi</b> <b>hi</b>.'

The key to the whole thing is using word boundaries, groups and the re.I flag.

Paolo Bergantino 2009-05-04 04:01:01

This is pretty much what I wanted. I might have to edit what constitutes a word boundary is as stated by Dave B, but that should be easy to edit and I would have to look though the data and figure that out later (if I need to). Otherwise this is exactly what I needed and I'm sure covers all cases I could come up with. Thanks.

Johnny4000 2009-05-04 20:49:31

Answer 2

A:

You should be able to do this very easily with re.sub using the word boundary assertion \b, which only matches at a word boundary:

import re

def SurroundWith(text, keyword, before, after):
  regex = re.compile(r'\b%s\b' % keyword, re.IGNORECASE)
  return regex.sub(r'%s\0%s' % (before, after), text)

Then you get:

>>> SurroundWith('HI there. You should hire that person for the job. '
...              'Hi hi.', 'hi', '<b>', '</b>')
'<b>HI</b> there. You should hire that person for the job. <b>Hi</b> <b>hi</b>.'

If you have more complicated criteria for what constitutes a "word boundary," you'll have to do something like:

def SurroundWith2(text, keyword, before, after):
  regex = re.compile(r'([^a-zA-Z0-9])(%s)([^a-zA-Z0-9])' % keyword,
                     re.IGNORECASE)
  return regex.sub(r'\1%s\2%s\3' % (before, after), text)

You can modify the [^a-zA-Z0-9] groups to match anything you consider a "non-word."

2009-05-04 04:01:59

I ranbefore = '<b>'after ='</b>'text = "HI there. You should higher that person for the job. Hi hi."keyword = 'hi'print 'result = ', SurroundWith( text , keyword, before, after)and got result = <b>

Johnny4000 2009-05-04 20:50:06

Answer 3

A:

I think the best solution would be regular expression...

import re
def reg(keyword, myString) :
   regx = re.compile(r'\b(' + keyword + r')\b', re.IGNORECASE)
   return regx.sub(r'<b>\1</b>', myString)

of course, you must first make your keyword "regular expression safe" (quote any regex special characters).

Francis 2009-05-04 04:06:50

Answer 4

+1 A:

Here's one suggestion, from the nitpicking committee. :-)

myString = "HI there. You should higher that person for the job. Hi hi."

myString.replace('higher','hire')

2009-07-20 19:08:31

ansaurus

tags:

views:

answers:

Python: Replace string with prefixStringSuffix keeping original case, but ignoring case when searching for match

related questions