ansaurus

Question

Answer 1

A:

not really sure what you mean, but you can always check for existence with if

>>> reader = csv.reader("file")
>>> for r  in reader:
...   if r: print r
...

if this is not what you want, you should describe your problem more clearly by showing examples of things that doesn't work for you, including sample file format and desired output you want.

ghostdog74 2010-02-11 10:19:15

Answer 2

+1 A:

The Django community has addressed Python CSV import issues, so it might be worth searching for CSV import there, or posting a question. Also, you could edit the offending line directly in the CSV file before trying the import.

Dave Everitt 2010-02-11 10:38:25

Answer 3

+1 A:

Maybe you could catch the exception raised by the CSV reader. Something like this:

filename = "my.csv"
reader = csv.reader(open(filename))
try:
    for row in reader:
        print 'Row read with success!', row
except csv.Error, e:
    sys.exit('file %s, line %d: %s' % (filename, reader.line_num, e))

Or you could use next():

while True:
    try: 
        print reader.next()
    except csv.Error:
        print "Error"
    except StopIteration:
        print "Iteration End"
        break

Pedro Ghilardi 2010-02-11 11:22:54

Hi Pedro!Great! It is what I was looking for!AFG

Abruzzo Forte e Gentile 2010-02-11 12:25:47

You cannot catch this exception (test it).

telliott99 2010-02-11 12:53:22

@Pedro: What @telliott99 means (if I'm channeling him correctly) is that you need to do `import _csv` then `except _csv.Error:`

John Machin 2010-02-11 13:54:22

Ah... <whacks self on head> The issue is that import csv does not import _csv, and that's why I couldn't catch the exception? Doesn't seem to work.

telliott99 2010-02-11 14:15:57

@telliot99: WHAT "doesn't seem to work"?? Show code that demonstrates "could'nt catch the exception". If you're having difficulty, try asking a separate question!

John Machin 2010-02-11 21:03:20

Answer 4

A:

If your problem is specific to the last line being empty, you can use numpy.genfromtxt (or the old matplotlib.mlab.csv2rec)

$: cat >csv_file.txt
foo,bar,baz
yes,no,0
x,y,z



$:
$: ipython
>>> from numpy import genfromtxt
>>> genfromtxt("csv_file.txt", dtype=None, delimiter=',')
array([['foo', 'bar', 'baz'],
       ['yes', 'no', '0'],
       ['x', 'y', 'z']], 
      dtype='|S3')

dalloliogm 2010-02-11 11:55:54

Answer 5

+1 A:

You need (always) to say EXACTLY what is the error message that you got. Please edit your question.

Probably this:

>>> import csv; csv.reader("\x00").next()
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
_csv.Error: line contains NULL byte
>>>

The csv module is not 8-bit clean; see the docs: """Also, there are currently some issues regarding ASCII NUL characters."""

The error message is itself in error: it should be "NUL", not "NULL" :-(

If the last line in the file is empty, you won't get an exception, you'll merely get row == [].

Assuming the problem is one or more NULs in your file(s), you'll need to (1) speak earnestly to the creator(s) of your file(s) (2) failing that, read the whole file in (mode="rb"), strip out the NUL(s), and feed fixed_text.splitlines() to the csv reader.

John Machin 2010-02-11 12:30:55

Hi!You are right! FOr the sake of clarity I added my question with the Traceback. I solved at the end with a try/except solutoin

Abruzzo Forte e Gentile 2010-02-11 12:35:36

Answer 6

A:

I don't have an answer, but I can confirm the problem, and that most answers posted don't work. You cannot catch this exception. You cannot test for if line. Maybe you could check for the NULL byte directly, but I'm not swift enough to do that... If it is always on the last line, you could of course skip that.

import csv
FH = open('data.csv','wb')
line1 = [97,44,98,44,99,10]
line2 = [100,44,101,44,102,10]
for n in line1 + line2:
    FH.write(chr(n))
FH.write(chr(0))
FH.close()
FH = open('data.csv')
reader = csv.reader(FH)
for line in reader:
    if '\0' in line:  continue
    if not line:  continue
    print line

$ python script.py 
['a', 'b', 'c']
['d', 'e', 'f']
Traceback (most recent call last):
  File "script.py", line 11, in <module>
    for line in reader:
_csv.Error: line contains NULL byte

telliott99 2010-02-11 12:32:14

Hi!I tried the simple try/except approach. In my case it was easy and good since I have an error on the last line of a CSV file where there is no valid data. I hope this help.

Abruzzo Forte e Gentile 2010-02-11 12:34:46

Answer 7

A:

You could try cleaning the file as you read it:

def nonull(stream):
    for line in stream:
        yield line.replace('\x00', '')

f = open(filename)
reader = csv.reader(nonull(f))

Assuming, of course, that simply ignoring NULL characters will work for you!

John Fouhy 2010-03-15 20:58:00

@John Fouhy: Without testing, I wouldn't rely on that approach working if fields had embedded newlines, ESPECIALLY as you don't ensure binary mode ('rb').

John Machin 2010-03-15 21:32:54

A fair point, although I'd imagine a lot of software chokes on CSV files with newlines embedded inside fields..

John Fouhy 2010-03-16 04:36:26

Answer 8

A:

@John Fouhy: Your simple cleaning approach worked well for reading txt files created with the statistical software JMP. About every line of the txt files contained NUL characters.

Travis N 2010-10-29 16:14:28

ansaurus

tags:

views:

answers:

reading csv file without for

related questions