views:

50

answers:

3

I want to know the standard algorithm for converting unicode characters into lowercase as proposed by unicode.org.

Also, do most programming languages follow this proposed standard?

A: 

Programming languages vary in how well they support unicode. Most do not have unicode characters as a built-in type. Typically it is either handled in a library, or by OS calls.

For instance, C++ doesn't have a native unicode character type, but does have locale support in the stl (which is defined as part of the language). Ada does have a native type Wide_Character, as well as library support for manipulating it.

T.E.D.
"most do not have unicode characters as built-in type": that's no longer true for more modern languages.
Joachim Sauer
Perhaps, but many of those "older" languages (eg: The C family) are still in immensely heavy use. A lot of those "more modern languages" get more press than use. Still, they are available if native unicode support is important to you.
T.E.D.
Thanks for the info!
Albert
Even "modern" languages like Java and C# don't actually have a Unicode character type; `char` means a UTF-16 code unit, which could be only half of a character.
dan04
A: 

.NET does have unicode support and offers built-in functions to switch between upper and lower case. This is probably true with some other languages, as well.

Russ
.NET is not a language.
mickeyf
.NET is a platform, not a language. Win32 has unicode support as well.
T.E.D.
But it applies to all languages using the .NET Framework, including C#, VB.NET, F#, etc.
Russ
+1  A: 
dan04
How about javascript? Does it follow the standard?
Albert
JavaScript implements the basic casing rules, but not the special ones.
dan04
That's great info. Thanks!
Albert