ansaurus

Question

3 byte UTF-8 String replacement in .NET (Convert 3-byte UTF-8 to String or Char)

Answer 1

+2 A:

You're mixing code points with UTF-8 encoding. Internally, all .NET strings use UTF-16 so you just need to specify the Unicode code point, not UTF-8 byte data:

Const FigureSpaceChar As Char = ChrW(&H2007)

Codepoint from www.fileformats.info.

Konrad Rudolph 2009-08-10 17:30:35

.NET uses UTF-16, not UTF-32. (Each char is a UTF-16 code point.)

Jon Skeet 2009-08-10 18:02:49

Jon: of course. Typo. Thanks for spotting it.

Konrad Rudolph 2009-08-10 18:04:43

TJ 2009-08-10 18:08:13

ansaurus

tags:

views:

answers:

3 byte UTF-8 String replacement in .NET (Convert 3-byte UTF-8 to String or Char)

related questions