views:

57

answers:

2
f = open("go.txt", "w")
f.write(title)
f.close()

What if "title" is in japanese/utf-8? How do I modify this code to be able to write "title" without having the ascii error?

Edit: Then, how do I read this file in UTF-8?

+2  A: 

How to use UTF-8:

import codecs

# ...
# title is a unicode string
# ...

f = codecs.open("go.txt", "w", "utf-8")
f.write(title)

# ...

fileObj = codecs.open("go.txt", "r", "utf-8")
u = fileObj.read() # Returns a Unicode string from the UTF-8 bytes in the file
Jared Updike
+2  A: 

It depends on whether you want to insert a Unicode UTF-8 byte order mark, of which the only way I know of is to open a normal file and write:

import codecs

f = open('go.txt', 'wb')
f.write(codecs.BOM_UTF8)
f.write(title.encode('utf-8')
f.close()

Generally though, I don't want to add a UTF-8 BOM and the following will suffice though:

import codecs

f = codecs.open('go.txt', 'w', 'utf-8')
f.write(title)
f.close()
David Morrissey
You can also use the 'utf-8-sig' codec. It writes a BOM when writing and removes it (if necessary) when reading.
Mark Tolonen