ansaurus

Question

Best method for reading newline delimited files in Python and discarding the newlines?

Answer 1

+14 A:

lines = open(filename).read().splitlines()

Curt Hagenlocher 2009-02-13 06:35:50

This answer does what I was going for, I'm sure I'll need to add some error checking and such, but for this specific need, it's great. Thank you all for providing answers!

solarce 2009-02-13 06:48:45

Answer 2

+4 A:

for line in file('/tmp/foo'):
    print line.strip('\n')

David Zaslavsky 2009-02-13 06:36:08

Answer 3

A:

I'd do it like this:

f = open('test.txt')
l = [l for l in f.readlines() if l.strip()]
f.close()
print l

2009-02-13 06:43:59

Answer 4

+3 A:

Here's a generator that does what you requested. In this case, using rstrip is sufficient and slightly faster than strip.

lines = (line.rstrip('\n') for line in open(filename))

However, you'll most likely want to use this to get rid of trailing whitespaces too.

lines = (line.rstrip() for line in open(filename))

TimoLinna 2009-02-13 08:35:46

Answer 5

+2 A:

I use this

def cleaned( aFile ):
    for line in aFile:
        yield line.strip()

Then I can do things like this.

lines = list( cleaned( open("file","r") ) )

Or, I can extend cleaned with extra functions to, for example, drop blank lines or skip comment lines or whatever.

S.Lott 2009-02-13 11:07:27

Answer 6

+1 A:

Just use generator expressions:

blahblah = (l.rstrip() for l in open(filename))
for x in blahblah:
    print x

Also I want to advise you against reading whole file in memory -- looping over generators is much more efficient on big datasets.

2009-02-14 07:43:58

ansaurus

tags:

views:

answers:

Best method for reading newline delimited files in Python and discarding the newlines?

related questions