ansaurus

Question

Answer 1

+5 A:

You're importing the csv module but never use it. Why?

If you do

import csv
reader = csv.reader(open(file, "rb"), dialect="excel") # Python 2.x
# Python 3: reader = csv.reader(open(file, newline=""), dialect="excel")

you get a reader object that will contain all you need; the first row will contain the headers, and the subsequent rows will contain the data in the corresponding places.

Even better might be (if I understand you correctly):

import csv
reader = csv.DictReader(open(file, "rb"), dialect="excel") # Python 2.x
# Python 3: reader = csv.DictReader(open(file, newline=""), dialect="excel")

This DictReader can be iterated over, returning a sequence of dicts that use the column header as keys and the following data as values, so

for row in reader:
    print(row)

will output

{'Name': 'Nick', 'Designation': 'F4321', 'Type': 'Subject', 'Total': '29', 'First-term assessment': '10', 'Second-term assessment': '19', 'Description': 'D1234'}
{'Name': 'HTML', 'Designation': 'F4321', 'Type': 'Unit', 'Total': '', 'First-term assessment': '18', 'Second-term assessment': '', 'Description': 'D1234-1'}
{'Name': 'Tags', 'Designation': 'F4321', 'Type': 'Topic', 'Total': '', 'First-term assessment': '18', 'Second-term assessment': '', 'Description': 'First Term'}
{'Name': 'Review of representation of HTML', 'Designation': '', 'Type': 'Subtopic', 'Total': '', 'First-term assessment': '', 'Second-term assessment': '', 'Description': ''}

Tim Pietzcker 2010-06-24 07:00:58

i have fixed the indentation

Hulk 2010-06-24 07:03:14

In Python 2.x, *ALWAYS* open the file in binary mode ('rb' or 'wb', as appropriate).

John Machin 2010-06-24 11:07:49

@John Machin: Why? The csv module docs say nothing about this, and I've never had problems opening files without the `b` flag. Some examples use it, some don't. You may be very right, but I'd like to know the rationale behind this.

Tim Pietzcker 2010-06-24 11:47:51

@Tim: 2.x docs http://docs.python.org/library/csv.html#csv.reader say something: "If csvfile is a file object, it must be opened with the ‘b’ flag on platforms where that makes a difference." i.e. Windows platforms. So for platform independence, one should use 'rb' always. The same applies when writing even though the docs don't say so. CSV records are terminated by CRLF independent of platform -- it's in essence a BINARY format. If you don't supply 'wb' on Windows, you get CRCRLF.

John Machin 2010-06-24 12:45:43

@John: Must have been blind (I just grepped the page for `rb`), thanks. Will edit.

Tim Pietzcker 2010-06-24 12:52:55

Nice one..Thanks

Hulk 2010-06-25 07:26:10

ansaurus

tags:

views:

answers:

Python script reading from a csv file

related questions