ansaurus

Question

Answer 1

+5 A:

You can use the python codecs for these that are known by their names 'mac-roman', 'mac-turkish', etc.

>>> 'foo'.decode('mac-turkish')
u'foo'

You'll have to refer to them by their names, these numbers you've got in your question don't appear in the source files. For more information look at $pylib/encodings/mac_*.py.

Jerub 2009-10-20 07:09:54

Also, those Mac encodings date back to classic MacOS days and are largely obsolete in Mac OS X.

Ned Deily 2009-10-20 07:41:58

Answer 2

+3 A:

It seems that at least Mac Roman and Mac Turkish encodings exist in Python stdlib, under names macroman and macturkish. See http://svn.python.org/projects/python/trunk/Lib/encodings/aliases.py for a complete list of encoding aliases in the most up-to-date Python.

Tuure Laurinolli 2009-10-20 07:10:02

Answer 3

+2 A:

No.

However, unicode.org provides codec description files that you can use to generate modules that will parse those codecs. Included with python source distributions is a script that will convert these files: Python-x.x/Tools/unicode/gencodec.py.

Aaron Gallagher 2009-10-20 07:10:50

ansaurus

tags:

views:

answers:

Decoding Mac OS text in Python

related questions