Python RegEx Matching Newline | ansaurus

tags:

python
regex

views:

66

answers:

1

Q:

Python RegEx Matching Newline

I have the following regular expression:

[0-9]{8}.*\n.*\n.*\n.*\n.*

Which I have tested in Expresso against the file I am working and the match is sucessfull.

I want to match the following:

Reference number 8 numbers long
Any character, any number of times
New Line
Any character, any number of times
New Line
Any character, any number of times
New Line
Any character, any number of times
New Line
Any character, any number of times

My python code is:

for m in re.findall('[0-9]{8}.*\n.*\n.*\n.*\n.*', l, re.DOTALL):
       print m

But no matches are produced, as said in Expresso there are 400+ matches which is what I would expect.

What I am missing here?

+3 A:

Don't use re.DOTALL or the dot will match newlines, too. Also use raw strings (r"...") for regexes:

for m in re.findall(r'[0-9]{8}.*\n.*\n.*\n.*\n.*', l):
   print m

However, your version still should have worked (although very inefficiently) if you have read the entire file in memory as one large string.

So the question is, are you reading the file like this:

with open("filename","r") as myfile:
    mydata = myfile.read()
    for m in re.findall(r'[0-9]{8}.*\n.*\n.*\n.*\n.*', mydata):
        print m

Or are you working with single lines (for line in myfile: or myfile.readlines())? In that case, the regex can't work, of course.

Tim Pietzcker 2010-09-17 09:23:18

Hi, yes I am running python on windows but the file is from a unix environment.

humira 2010-09-17 09:29:40

The origin of the file is unlikely to matter. The question was whether you were opening the whole file at once or using an iterator. Python iterators will iterate over new line characters.

Tim McNamara 2010-09-17 10:24:17

related questions

Programmatically talking to a Serial Port in OS X or Linux

Best ways to teach a beginner to program?

Calling a Function From a String With the Function's Name in Python

An executable Python app

Text Editor For Linux (Besides Vi)?

What Hosting Service is best for Django applications?

File size differences after copying a file to a server vía FTP

Python: what is the difference between (1,2,3) and [1,2,3], and when should I use each?

Python: What OS am I running on?

How do I make a menu in python that does not require the user to press (enter) to make a selection?

How do you express binary literals in Python?

What is the most efficient graph data structure in Python?

Adding a Method to an Existing Object

How to learn Python: Good Example Code?

How do I use Python's itertools.groupby()?

Python and MySQL

Class views in Django

Is there an IDE that provides code completion for Python

Using 'in' to match an attribute of Python objects in an array

cx_Oracle - what is the best way to iterate over a result set?

cx_Oracle - How do I access Oracle from Python?

Continuous Integration System for a Python Codebase

Get a preview jpeg of a pdf on windows?

How can I find the full path to a font from its display name on a Mac?

XML Processing in Python