What is the standard algorithm for converting unicode characters into lowercase? | ansaurus

tags:

views:

50

answers:

3

+2 Q:

What is the standard algorithm for converting unicode characters into lowercase?

I want to know the standard algorithm for converting unicode characters into lowercase as proposed by unicode.org.

Also, do most programming languages follow this proposed standard?

A:

Programming languages vary in how well they support unicode. Most do not have unicode characters as a built-in type. Typically it is either handled in a library, or by OS calls.

For instance, C++ doesn't have a native unicode character type, but does have locale support in the stl (which is defined as part of the language). Ada does have a native type Wide_Character, as well as library support for manipulating it.

T.E.D. 2010-08-19 13:56:10

"most do not have unicode characters as built-in type": that's no longer true for more modern languages.

Joachim Sauer 2010-08-19 14:08:32

Perhaps, but many of those "older" languages (eg: The C family) are still in immensely heavy use. A lot of those "more modern languages" get more press than use. Still, they are available if native unicode support is important to you.

T.E.D. 2010-08-19 14:15:13

Thanks for the info!

Albert 2010-08-21 04:47:35

Even "modern" languages like Java and C# don't actually have a Unicode character type; `char` means a UTF-16 code unit, which could be only half of a character.

dan04 2010-08-21 21:51:22

A:

.NET does have unicode support and offers built-in functions to switch between upper and lower case. This is probably true with some other languages, as well.

Russ 2010-08-19 13:57:47

.NET is not a language.

mickeyf 2010-08-19 14:11:15

.NET is a platform, not a language. Win32 has unicode support as well.

T.E.D. 2010-08-19 14:16:41

But it applies to all languages using the .NET Framework, including C#, VB.NET, F#, etc.

Russ 2010-08-19 14:25:51

+1 A:

dan04 2010-08-20 06:20:24

How about javascript? Does it follow the standard?

Albert 2010-08-20 12:59:57

JavaScript implements the basic casing rules, but not the special ones.

dan04 2010-08-20 13:29:41

That's great info. Thanks!

Albert 2010-08-21 04:46:28

related questions

What causes java.io.CharConversionException with EOF or isHexDigit messages in Tomcat?

C# VB.NET Conversion

Tabs and spaces conversion

Converting large ASP.NET VB.NET project to C# - incrementally?

How to migrate SVN with history to a new Git repository?

Best way to convert pdf files to tiff files

How to convert numbers between Hex and Decimal in C#

Using C#, what is the most efficient method of converting a string containing binary data to an array of bytes

How do I programmatically convert mp3 to an itunes-playable aac/m4a file?

Using Java JAR file in .NET

Best way to convert text files between character sets?

Convert from scientific notation string to float in C#

BufferedImage in IKVM

What tools exist to convert a Delphi 7 application to C# and the .Net framework?

Is there a tool that can convert common image formats (.bmp, jpg,..) to .emf files?

XAML to SVG?

Backward Converting SQL Databases

Converting latitude/longitude to Alberta 10 TM Projection

Migrating from ASP Classic to .NET and pain mitigation

PHP ToString() equivalent

Java: Best way of converting List<Integer> to List<String>

Easy way for Crystal Reports to MS SQL Server Reporting Services conversion

CVS to SVN conversion and reorganizing branches

C# Convert Integers into Written Numbers

Are there any conversion tools for porting Visual J# code to C#?