ansaurus

Question

Perl - Unicode::String sub need to add/convert for Latin-9 support

Answer 1

+2 A:

As you have been told already, Unicode::String is not an appropriate choice of module. Perl ships with a module called 'Encode' which can do everything you need.

If you have a character string in Perl like this:

my $euro = "\x{20ac}";

You can convert it to a string of bytes in Latin-9 like this:

my $bytes = encode("iso8859-15", $euro);

The $bytes variable will now contain \xA4.

Or you can have Perl automatically convert it out output to a filehandle like this:

binmode(STDOUT, ":encoding(iso8859-15)");

You can refer to the documentation for the Encode module. And also, PerlIO describes the encoding layer.

I know you are determined to ignore this final piece of advice but I'll offer it one last time. Latin-9 is a legacy encoding. Perl can quite happily read Latin-9 data and convert it to UTF-8 on the fly (using binmode). You should not be writing more software that generates Latin-9 data you should be migrating away from it.

Grant McLean 2010-06-18 19:29:28

@grant mclean, thnx again for your input on Latin-9 being legacy as well as Unicode::String. but as my job has required me to use Latin-9 in this project. Looking over the code they have provided me they are also using Unicode::String perl module. I also understand how optimal it would be to use the new encode() and remove the Unicode::String, but as project requirements don't allow me to do this I'm stuck with it. What I was trying to do (that doesn't work) is use the encode() instead of the latin1() in the sub but I wasn't having any luck. Thanks again for you advice and knowledge gained

Phill Pafford 2010-06-18 19:51:07

ansaurus

tags:

views:

answers:

Perl - Unicode::String sub need to add/convert for Latin-9 support

related questions