Hi, I have an xml-file I want to parse:
<?xml version="1.0" encoding="UTF-8" ?>
<tag>û</tag>
It's perfectly parsed by firefox. But XML::Simple corrupts some data. I have a perl-program like this:
my $content = "<?xml version=\"1.0\" encoding=\"UTF-8\" ?>\n";
$content .= "<tag>\x{c3}\x{bb}</tag>\n";
print "input:\n$content\n";
my $xml = new XML::Simple;
my $data = $xml->XMLin($content, KeepRoot => 1);
print "data:\n";
print Dumper $data;
and get:
input:
<?xml version="1.0" encoding="UTF-8" ?>
<tag>û</tag>
data:
$VAR1 = {
'tag' => "\x{fb}"
};
it doesn't seem to be what I expected. I think there some encoding issues. Am I doing something wrong?
UPD: I thought that XMLin returned text in utf-8 (as the input). Just added
encode_utf8($data->{'tag'});
and it worked