ansaurus

Question

tchar safe functions -- count parameter for UTF-8 constants

Answer 1

+3 A:

Yes, you get it right.

The question, however, is why do you port it to TCHAR - something that is sensitive to _UNICODE define.

Why not use UTF8 and char*?

Pavel Radzivilovsky 2010-06-07 21:58:11

doesn't that defeat the point?I'm porting to tchar only because that's what the surrounding code is using. do i have a choice here?

Dustin Getz 2010-06-07 22:04:40

there's a serious belief that TCHAR is a misguided effort that should be abandoned. See http://stackoverflow.com/questions/1049947/should-utf-16-be-considered-harmful

Pavel Radzivilovsky 2010-06-07 22:43:34

Im porting a parsing method, written in 1995, to operate on basic_string<TCHAR>. i'm starting to think this could be a as parsing logic will be sensitive to multi-byte characters. i don't think passing in UTF-8 byte arrays to this function will be very pretty.

Dustin Getz 2010-06-08 14:15:30

If you have a method operating on char*, it's much easier to make it UTF-8 compliant than TCHAR-compliant. And the result is better.

Pavel Radzivilovsky 2010-06-08 14:22:04

Answer 2

+1 A:

TCHAR is a type that's either 8 or 16 bits depending on whether _UNICODE is defined. But UTF-8 always uses 8-bit code units, so using TCHAR is silly. Just use char.

TCHAR is tied to the existence of two versions of the Windows API: "A" functions that use legacy 8-bit code pages, and "W" functions that use UTF-16. UTF-8 is not supported. You can use UTF-8 on Windows by explicitly converting your UTF-8 strings to UTF-16 for API calls, but you won't get any help from _UNICODE or TCHAR.

dan04 2010-06-09 00:11:12

ansaurus

tags:

views:

answers:

tchar safe functions -- count parameter for UTF-8 constants

related questions