ansaurus

Question

python csv help

Answer 1

+1 A:

You can get the csv module to tell you, just feed your desired output into the writer

In [1]: import sys,csv

In [2]: csv.writer(sys.stdout).writerow(['one", f"', 'two', 'three'])  
"one"", f""",two,three

In [3]: csv.reader(['"one"", f""",two,three']).next()  
Out[3]: ['one", f"', 'two', 'three']

gnibbler 2009-10-14 11:02:12

when I try this with my real input I don't get desired outputthis is strigNAME: "2801 chassis", DESCR: "2801 chassis, Hw Serial#: xxxxxxx, Hw Revision: 6.0",PID: CISCO2801 , VID: V03 , SN: xxxxxxxxx

Ib33X 2009-10-14 12:53:02

So actually your data is not CSV, but in some kind of dictionary format? A comma-separated list of key-value pairs?

Ferdinand Beyer 2009-10-14 13:37:21

Answer 2

+6 A:

Actually the result you get is correct—your CSV syntax is wrong.

If you want to quote commas or other characters in a CSV value, you have to use quotes surrounding the whole value, not parts of it. If a value does not start with the quote character, Python's CSV implementation does not assume the value is quoted.

So, instead of using

one",f",two,three

you should be using

"one,f",two,three

Ferdinand Beyer 2009-10-14 11:02:44

unfortunately I don't have control on input string

Ib33X 2009-10-14 12:53:48

Then I'm afraid you cannot use the `csv` module out of the box but have to write your own data reader.

Ferdinand Beyer 2009-10-14 13:35:40

Answer 3

+1 A:

Your input string is not really CSV. Instead your input contains the column name in each row. If your input looks like this:

NAME: "2801 chassis", DESCR: "2801 chassis, Hw Serial#: xxxxxxx, Hw Revision: 6.0",PID: CISCO2801 , VID: V03 , SN: xxxxxxxxx
NAME: "2802 wroomer", DESCR: "2802 wroomer, Hw Serial#: xxxxxxx, Hw Revision: 6.0",PID: CISCO2801 , VID: V03 , SN: xxxxxxxxx
NAME: "2803 foobars", DESCR: "2803 foobars, Hw Serial#: xxxxxxx, Hw Revision: 6.0",PID: CISCO2801 , VID: V03 , SN: xxxxxxxxx

The simplest you can do is probably to filter out the column names first, in the whole file. That would then give you a CSV file you can parse. But that assumes each line has the same columns in the same order.

However, if the data is not that consistent, you might want to parse it based on the names. Perhaps it looks like this:

NAME: "2801 chassis", PID: CISCO2801 , VID: V03 , SN: xxxxxxxxx, DESCR: "2801 chassis, Hw Serial#: xxxxxxx, Hw Revision: 6.0"
NAME: "2802 wroomer", DESCR: "2802 wroomer, Hw Serial#: xxxxxxx, Hw Revision: 6.0",PID: CISCO2801 , VID: V03 , SN: xxxxxxxxx
NAME: "2803 foobars",  VID: V03 ,PID: CISCO2801 ,SN: xxxxxxxxx

Or something. In that case I'd parse each line by looking for the first ':', split out the column head from that, then parse the value (including looking for quotes), and then continue with the rest of the line. Something like this (completely untested code):

def parseline(line):
    result = {}
    while ':' in line:
        column, rest = line.split(':',1)
        column = column.strip()
        rest = rest.strip()
        if rest[0] in ('"', '"'): # It's quoted.
            quotechar = rest[0]
            end = rest.find(quotechar, 1) # Find the end of the quote
            value = rest[1:end]
            end = rest.find(',', end) # Find the next comma
        else: #Not quoted, just find the next comma:
            end = rest.find(',', 1) # Find the end of the value
            value = rest[0:end]
        result[column] = value
        line = rest[end+1:]
        line.strip()
    return result

Lennart Regebro 2009-10-14 13:39:53

Your function will fail since ':' can be part of the (quoted) value (see DESCR). It might be easier to use a regular expression here!

Ferdinand Beyer 2009-10-14 15:55:08

It will not fail because of that, as it's treats quoted values separately. It never looks in the quoted value for a :

Lennart Regebro 2009-10-14 17:33:38

But it would fail because I forgot the ",1" in the split, had [0, end] instead of [0:end] in one place, and return value instead of result. With those three changes it the works. Pretty good for code I didn't even try to run. :)

Lennart Regebro 2009-10-14 17:38:53

ansaurus

tags:

views:

answers:

python csv help

related questions