ansaurus

Question

Unrecognized extra characters in file parsed with php

Answer 1

+5 A:

I may be wrong, but this smells like an UTF-16 encoded file. Can you try

$f = iconv("utf-16", "utf-8", $f);

?

Pekka 2009-12-27 21:29:35

The character spacing almost certainly indicates it is a unicode file. utf-16 is a very likely guess too.

Goyuix 2009-12-27 21:31:03

In particular, it is UTF-16LE (little-endian) encoding, the UTF-16 variant Windows misleadingly describes as just “Unicode”. The two bytes at the start are a Byte Order Mark that will allow `utf-16`-with-unspecified-endianness to work by automatically detecting the little-endianness.

bobince 2009-12-27 21:45:06

ansaurus

tags:

views:

answers:

Unrecognized extra characters in file parsed with php

related questions