ansaurus

Question

Python creating a dictionary and swapping these into another file

Answer 1

+2 A:

It looks like you could more usefully use the Python standard library csv module here. rather than perform the text processing parts youself "manually". E.g.:

import csv
with open("one.csv", "r") as f:
  rows_one = list(csv.reader(f, delimiter='\t'))
with open("second.csv", "r") as g:
  rows_two = list(csv.reader(g, delimiter='\t'))
rows_totl = [r + s[1:] for r, s in zip(rows_one, rows_two)]
with open("total.csv", "w") as h:
  csv.writer(h, delimiter='\t').writerows(rows_totl)

The with statement is one of the jewels of Python 2.6 (it's also usable in 2.5, but only if you from __future__ import with_statement!-) -- as used here, it gives you an open file and ensures it gets closed as soon at the with body's done... plus, it has a zillion more uses, e.g. for locks and all sorts of your own custom-coded "context managers".

Alex Martelli 2010-04-01 20:06:44

Nice solution!!

systempuntoout 2010-04-01 20:50:23

Alex, In addition to using the csv module, you are using the "with" statement. It might be nice to point out why you made that change as well.

John Mulder 2010-04-02 04:23:27

@John, OK, editing to point that out.

Alex Martelli 2010-04-02 04:48:04

Answer 2

A:

May I suggest, instead of hand-parsing CSV file use csv that is built-in. It takes care of delimiters, character escaping etc. It's API is simple, too:

import csv

# Auto-detector of this particular CSV dialect (delimiters and such)
dialect = csv.Sniffer().sniff(open('one.csv').read())

# csv.reader yields every row found in the file using the given dialect
rows = csv.reader(open('one.csv'), dialect = dialect)

# [list comprehension][2]
resulting_dict = dict((row[0], row[1:]) for row in rows)

You can refactor the code into a function and use it for both files (coding from memory, though, beware for errors).

Now you've got two dicts for two files, let's say dict1 and dict2, you can combine them:

combined_dict = dict((key, dict1[key] + dict2[key]) for key in dict2)

Writing it to a .csv file is also straightforward:

writer = csv.writer(open('second.csv', 'w'), delimiter = '\t')
for key, values in combined_dict:
    writer.writerow(key, *values)

Definitely check out the docs for the more detailed reference.

Edit: My solution doesn't take line ordering into account (dict is unordered). There are two solutions:

if you're running Python 3 or Python 2.7, use collections.OrderedDict
otherwise, you need to store the order of the lines - for example, while reading the second file break the list comprehension into standard for statement and store the headers in a list.

Mike Hordecki 2010-04-01 20:10:56

Just wondered what's up with those unicorns and saw my own one..

Mike Hordecki 2010-04-01 20:12:36

Answer 3

A:

Have a look to csv module:

import csv
reader1 = csv.reader(open('input1.csv'), delimiter = '\t')
reader2 = csv.reader(open('input2.csv'), delimiter = '\t')
csvwriter = csv.writer(open('output.csv', 'w'),delimiter = '\t')
while True:
    row1 = reader1.next()
    if row1:
       row2 = reader2.next()
       new_row = row2 + row1[1:]
       csvwriter.writerow(new_row)
    else:
        break

systempuntoout 2010-04-01 20:23:34

ansaurus

tags:

views:

answers:

Python creating a dictionary and swapping these into another file

related questions