python regex match and replace | ansaurus

tags:

python
regex

views:

52

answers:

1

+1 Q:

python regex match and replace

I need to find, process and remove (one by one) any substrings that match a rather long regex:

# p is a compiled regex
# s is a string  
while 1:
    m = p.match(s)
    if m is None:
        break
    process(m.group(0)) #do something with the matched pattern
    s = re.sub(m.group(0), '', s) #remove it from string s

The code above is not good for 2 reasons:

It doesn't work if m.group(0) happens to contain any regex-special characters (like *, +, etc.).
It feels like I'm duplicating the work: first I search the string for the regular expression, and then I have to kinda go look for it again to remove it.

What's a good way to do this?

+3 A:

The re.sub function can take a function as an argument so you can combine the replacement and processing steps if you wish:

def process_match(m):
    # Process the match here.
    return ''

s = p.sub(process_match, s)

Mark Byers 2010-08-22 21:55:02

Thanks, forgot about that..

max 2010-08-22 22:42:00

Ah and I figured out what to do about if I do want to replace a string that may contain regex symbols in it.. re.escape(s) takes care of that.

max 2010-08-23 06:05:19

related questions

Programmatically talking to a Serial Port in OS X or Linux

Best ways to teach a beginner to program?

Calling a Function From a String With the Function's Name in Python

An executable Python app

Text Editor For Linux (Besides Vi)?

What Hosting Service is best for Django applications?

File size differences after copying a file to a server vía FTP

Python: what is the difference between (1,2,3) and [1,2,3], and when should I use each?

Python: What OS am I running on?

How do I make a menu in python that does not require the user to press (enter) to make a selection?

How do you express binary literals in Python?

What is the most efficient graph data structure in Python?

Adding a Method to an Existing Object

How to learn Python: Good Example Code?

How do I use Python's itertools.groupby()?

Python and MySQL

Class views in Django

Is there an IDE that provides code completion for Python

Using 'in' to match an attribute of Python objects in an array

cx_Oracle - what is the best way to iterate over a result set?

cx_Oracle - How do I access Oracle from Python?

Continuous Integration System for a Python Codebase

Get a preview jpeg of a pdf on windows?

How can I find the full path to a font from its display name on a Mac?

XML Processing in Python