tags:

views:

615

answers:

3

Hi,

I'm reading some data that has already been converted to html style υ code.

I now need to convert this back to UTF-8 characters for viewing. Unfortunately I can't use a browser to view the string.

I've read around about conversion in java and it seems if you have a string of \uxxxx then the compiler will convert for you; However that wont work of course because I want to read in dynamic strings.

So can this be done?

Many thanks! Dan

A: 

native2ascii

Use the "-reverse" option.

ksuralta
A: 

You need to use:

String StringEscapeUtils.unescapeJava(String str)

from the Apache Commons Library.

It will find \uxxxx sequences in the input string and convert them to a normal Java String.

Alnitak
Excellent, thanks. Works a treat.
Codek
A: 

converted to html style υ code.

Which of those? \uXXXX is a Java convention and has nothing to do with HTML. HTML entities look like ¯

Michael Borgwardt
both; The way to do it is to convert xx; to \uxxxx and then to convert that to UTF-8 using apache commons.
Codek