ansaurus

Question

how to convert char * to uchar16 in JNI C++

Answer 1

A:

Your code will work, as long as str is ASCII; calling strlen() in the loop condition is probably a bad idea, though. It might be easier to just use swprintf() if it's available on your system:

uchar16_t buf[32];
char *str = "Test";
swprintf(buf, sizeof buf, "%s", str);

Carl Norum 2010-06-11 18:59:23

Carl, what can I use in place of strlen instead - do I check for '\0'? My code does not work - when I print it, it prints all blank chars :(

Sagar Hatekar 2010-06-11 19:01:29

@Sagar, What prints all blank chars?

Carl Norum 2010-06-11 19:01:57

@Sagar: It is probably empty

0A0D 2010-06-11 19:02:50

@Carl, after using swprintf, here's what I get:Cannot convert from uchar16_t to wchar_t.

Sagar Hatekar 2010-06-11 19:26:28

Answer 2

+3 A:

Strlen? buf[32]? Trying to destroy the universe?

You want to use a wstringstream.

std::wstringstream lols;
lols << "Test";
std::wstring cakes;
lols >> cakes;

Edit@Comment: You shouldn't use strlen because any decent string system allows embedded zeros, and strlen is seriously slow. In addition, you didn't resize your buffer as needed, so if you had a string of size > 31 you would get a buffer overflow. In addition, you would have to (if you did dynamically size your buffer) manually free it afterwards. Both of these things are serious failings of the C string system. My example code makes your standard library writer do all the work and avoid all these problems for you.

DeadMG 2010-06-11 19:00:09

Well, I am learning - how will I learn unless you told me why is it wrong to do an strlen and what could be used in place of that? :)

Sagar Hatekar 2010-06-11 19:41:44

@Sagar: For learning purposes, I suggest looking at this paper where Bjarne Stroustrup does a side-by-side analysis of "the C way" vs "the C++ way" of doing things (often the reason people think C is faster code is simply because you are omitting things that are required in order to be correct): http://www2.research.att.com/~bs/new_learning.pdf

Hostile Fork 2010-06-11 19:53:05

@Hostile Fork: Thanks, that's really some valuable piece of information.

Sagar Hatekar 2010-06-11 20:01:16

@DeadMG great info - thanks!

Sagar Hatekar 2010-06-14 14:26:41

Answer 3

A:

Have a look here.

Also, is there a good reason you are defining your own type?

If you have a (narrow) char string, you cannot convert it to a wchar_t string by setting your locale to "C" and then passing the string through mbstowcs(). That's because the "C" locale specifies a -particular- character encoding, and that encoding might not match the encoding of the execution character set, so mbstowcs() might map the characters to something unexpected, or could even fail (if the execution character set happened to use encodings that were incompatible with the encoding structure for the C locale character set.)

Thus, in order to convert a char string into a wider string, you have to copy the chars one by one into an array of wchar_t . If you need to work with Unicode or utf-16 or whatever after that, then wcstombs() is what you should look at.

Serapth 2010-06-11 19:01:23

@Serapth Thanks for explanation. The typedef has been done in an existing library so I have to use it 'as-is'.

Sagar Hatekar 2010-06-11 19:12:36

wcstombs outputs a char* - I need a uint16_t So when I do that, it gives me an error saying "cannot convert from char* to uchar16_t*"

Sagar Hatekar 2010-06-11 19:34:02

Answer 4

+1 A:

That's actually OK if your string will always be ASCII. To do it correctly, the portable function is mbstowcs which assumes you're converting from the default locale or if you're on Windows then there's API functions that let you specify the source code page explicitly.

Rup 2010-06-11 19:03:16

ansaurus

tags:

views:

answers:

how to convert char * to uchar16 in JNI C++

related questions