views:

584

answers:

3

How can I convert a string like Žvaigždės aukštybėj užges or äüöÖÜÄ to Zvaigzdes aukstybej uzges or auoOUA, respectively, using Bash?

Basically I just want to convert all characters which aren't in the Latin alphabet.

Thanks

+5  A: 

Depending on your machine you can try piping your strings through

iconv -f utf-8 -t ascii//translit

(or whatever your encoding is, if it's not utf-8)

Michael Krelin - hacker
+1  A: 
echo Hej på dig, du den dära | iconv -f utf-8 -t us-ascii//TRANSLIT

gives:

Hej pa dig, du den dara
Emil Vikström
+1  A: 

You might be able to use iconv...

For example : the string

Žvaigždės aukštybėj užges or äüöÖÜÄ

is in file testutf8.txt, utf8 format.

Running command

iconv -f UTF8 -t US-ASCII//TRANSLIT testutf8.txt

results in

Zvaigzdes aukstybej uzges or auoOUA

Steve De Caux