ansaurus

Question

How to use csv.reader in python with french character like é,à,ç,ê,ë,...

Answer 1

+1 A:

Although I am not familiar with csv.reader or writer, I have been dealing with utf-8 file reading recently and perhaps using the codecs module might help you out.

Instead of,

data = open(..., "wb")

try,

import codecs

and then for all your utf-8 files, use,

data = codecs.open(..., "rb", "utf-8")

This automatically reads your files in as unicode (utf-8) and might write them to your file correctly.

emish 2010-07-28 19:08:50

thanks, you solved my issue with "foreign characters"

Uku Loskit 2010-07-28 20:37:14

@sheepz: Glad it helped you -- it's certainly not a solution to the OP's problem !-)

John Machin 2010-07-28 20:40:52

Answer 2

A:

The problem is in this line:

resultat.writerow( [ `info1`,`info2` ,`line[1]`,`line[2]`,`line[3]`,`line[4]`,`line[5]`,`line[6]`,`line[7]`,`line[8]`,`line[9]`,`line[10]`,`line[11]`,`line[12]`,`line[13]`,`line[14]`,`line[15]`,`line[16]`,`line[17]` ] )

Wrapping an expression in "back-ticks" aka "grave accents" is an old-fashioned and deprecated way of saying repr(expression).

Please consider the following:

>>> s = "Montréal"
>>> print s
Montréal
>>> print repr(s)
'Montr\xe9al'
>>> ord(s[5])
233
>>> hex(233)
'0xe9'
>>> s == "Montr\xe9al"
True
>>> `s` == repr(s)
True

The offending (in 3 ways) line should be simply replaced by

resultat.writerow([info1, info2] + [line[1:18]]) # WRONG (sorry!)
resultat.writerow([info1, info2] + line[1:18]) # RIGHT

John Machin 2010-07-28 20:27:10

that's not working!! the : line[1:18] write all info in one column !! and still wrong accent

francois 2010-07-29 12:59:14

@francois: Fixed; please try again.

John Machin 2010-07-29 19:34:27

Thanks you John, you just save my holidays.With you help, I will be able to import all those file before tonight.regardsfrancois

francois 2010-07-30 13:16:01

ansaurus

tags:

views:

answers:

How to use csv.reader in python with french character like é,à,ç,ê,ë,...

related questions