ansaurus

Question

Problem converting C/C++ unsigned char to JAVA

Answer 1

A:

IIRC Java uses a 16-bit representation for chars (UNICODE?) and C++ normally doesn't unless you use wchars.

I think you'd be better off trying to get C++ to use the UNICODE characters that Java uses rather than the other way around.

Timo Geusch 2010-09-02 20:33:44

Hi Timo,Thank you for the prompt reply.I'm trying to write my app in JAVA. So I need a way to get 160 out of the char † . :(

metalhawk 2010-09-02 20:38:42

"UNICODE?" UTF-16 to be more precise.

R. Bemrose 2010-09-02 21:01:55

Answer 2

+3 A:

In C++ you are using "narrow" characters in some specific encoding that happens to define character '†' as 160. In other encodings 160 may mean something else, and character '†' may be missing altogether.

In Java, you are always dealing with Unicode. 8660 = 0x2020 = U+2020 "DAGGER".

To get "160", you need to convert your string to the same encoding you are using with C++. See String.getBytes(charset).

atzz 2010-09-02 20:41:05

Thanks atzz, that is great explanation.I'm now trying to get what charset is being used in C++.Thank you ! :)

metalhawk 2010-09-02 20:57:10

@ravikumar1: Try US-ASCII. If that doesn't work, try ISO-8859-1.

R. Bemrose 2010-09-02 21:04:05

Thank you Bemrose. I wrote a small fn to get the charset. I found a hit for -96 (256-96=160). Thank you all for the support. :) Below is my test fn:

metalhawk 2010-09-02 21:24:52

Here it is . public void findCharsets() { Map charSets = Charset.availableCharsets(); Iterator it = charSets.keySet().iterator(); String str = Character.toString('†'); while (it.hasNext()) { try { String csName = (String) it.next(); byte b[] = str.getBytes(Charset.forName(csName)); if (b[0] == -96) { System.out.println("Found: " + csName); } } catch (Exception e) { // do nothing; go to next Charset } } }

metalhawk 2010-09-02 21:25:07

This is the output of the programFound: MacRomanFound: x-MacCentralEuropeFound: x-MacCroatianFound: x-MacCyrillicFound: x-MacGreekFound: x-MacRomaniaFound: x-MacTurkishFound: x-MacUkraine

metalhawk 2010-09-02 21:27:02

Answer 3

A:

If you write out the unsigned char 160 in C++ as a single byte, and use InputStream.read() you will get 160. Which character this means depends on the assumed encoding but the value 160 is unchanged.

Peter Lawrey 2010-09-02 20:53:29

Thanks Peter, I'm trying to write in JAVA only. I dont have a program in C++ which runs first. Simply, I'm decoding in JAVA only, for which I need 160 for char †

metalhawk 2010-09-02 20:59:14

ansaurus

tags:

views:

answers:

Problem converting C/C++ unsigned char to JAVA

related questions