tags:

views:

47

answers:

2

Hey all,

related question: http://stackoverflow.com/questions/919056/python-case-insensitive-replace

What's the best way to do a case insensitive replace WITHOUT HURTING THE CACHE in the re module? I'm monitoring carefully the cache to make sure my favorite regexes stay there (speed, of course).

I just notice that my code:

ner_token_result = re.sub('(?i)'+leftover, corrected_word, ner_token_result)

is re.compiling every time it is run. leftover is dynamic (based on user input).

I like regular expressions (fast, I can read them) but I don't want to hurt my cache.

I don't want to use a caseless string class...

I don't want the ugliness of converting to lowercase, replacing and restoring case...

Please help?

+2  A: 

If your other expressions are pre-compiled it means you did something like this:

regex = re.compile(leftover, re.I)

Which means you will be able to refer to regex regardless of cache overloading. If you didn't do this, do it for those regexes that need to be re-used throughout your code.

SilentGhost
+2  A: 

Obviously the dynamic regex needs to be compiled each time leftover changes. Are you worried that this is pushing your other regexs out of the cache?

If so, simply compile the other regexs you are using with re.compile

gnibbler
hmmm... Compile uses the cache as well... What am I missing?
Tal Weiss
I get it. Silly of me...
Tal Weiss