Hello,
We have HTML source files which contain special characters encoded as &#nnnn;
like in the word:
außergewöhnlich
We would like to convert them into plain UTF-8:
außergewöhnlich
Is there any small tool to do that?
Hello,
We have HTML source files which contain special characters encoded as &#nnnn;
like in the word:
außergewöhnlich
We would like to convert them into plain UTF-8:
außergewöhnlich
Is there any small tool to do that?
You can do this with perl, and HTML::Entities
if you wish.
echo 'echo 'außergewöhnlich' |
perl -MHTML::Entities -pe'binmode STDOUT, ":utf8"; HTML::Entities::decode_entities($_)'