ansaurus

Question

Answer 1

+1 A:

You did not even decode the RTF file. RTFs are not just simple text files. A file containing "äöü", for example, contains this:

{\rtf1\ansi\ansicpg1252\deff0\deflang1031{\fonttbl{\f0\fswiss\fcharset0 Arial;}}

{*\generator Msftedit 5.41.15.1507;}\viewkind4\uc1\pard\f0\fs20\'e4\'f6\'fc\par

}

when opened in a text editor. So the characters "äöü" are encoded as windows-1252 as declared at the beginning of the file (äöü = 0xE4 0xF6 0xFC).

For reading RTF you'll first need something that converts RTF to text (already asked here).

AndiDog 2010-02-03 14:08:10

OK, I didn't know that. Thank you.

AP257 2010-02-03 22:19:22

ansaurus

tags:

views:

answers:

Python: convert RTF file to unicode?

related questions