ansaurus

Question

Using Wide Character Constants with clang Gets "extraneous characters in wide character constant ignored" Error

Answer 1

A:

At the heart of the program is the interpretation of the source file. You know that it's UTF-8 encoded. That's why the 6 bytes L'﹤' are to be interpreted as 4 Unicode characters. But how would clang know? It sees 6 bytes, and assumes an 8 bit encoding. Thus, it sees L'xyz' (the precise characters depend on the assumed 8 bit character set). clang tells you that it is interpreting L'xyz' as L'x' , ignoring y and z. It's extremely unlikely that works as intended.

MSalters 2010-07-27 14:00:29

Hmm gcc never had any problem here. Is there a way to tell clang to properly handle UTF-8 source files, or alternatively to input the wide characters so that clang understands them?

Ventzi Zhechev 2010-07-27 14:07:01

http://github.com/bratsche/clang suggests not: IV. Missing Functionality / ImprovementsLexer: * Source character mapping. GCC supports ASCII and UTF-8.

MSalters 2010-07-27 14:30:50

ansaurus

tags:

views:

answers:

Using Wide Character Constants with clang Gets "extraneous characters in wide character constant ignored" Error

related questions