which is the best collation for European + English language | ansaurus

tags:

views:

65

answers:

1

+2 Q:

which is the best collation for European + English language

HI There,

i am developing for European languages and also for English, the string are stored as NVARCHAR in sql server 2005.

so, which is the best collation to be used is "Latin1_General_CI_AS" covers all? there are variations as well like Latin1_General_CP1_CI_AS,Latin1_General_BIN,Latin1_General_BIN2 etc

comments\suggestions appreciated.

Regards DEE

+1 A:

For general purpose sorting "General Latin1" is probably the best choice for western European and English languages.

I believe that if the code page (e.g., CP1) is not specified, then it defaults to code page 1252 (which is also what CP1 signifies). So my understanding is that Latin1_General_CI_AS and Latin1_General_CP1_CI_AS are equivalent. Given that, my opinion is that Latin1_General_CP1_CI_AS would be the better choice for clarity reasons. Whether you use CI_AS, CS_AS, or CI_AI is purely a usability issue based on whether you want case sensitivity and/or accent sensitivity. With CI, "a" == "A" and with AI, "á" == "â".

The _BIN and _BIN2 options signify that the collation will be binary based on the code point values. For sorting purposes, you probably do not want that because the order would not necessarily match any kind of dictionary order. However, if you are only using the index for searching for data, then one of those might be appropriate because it could be faster. Relatively little computation is necessary to convert a character value to the associated key value.

Edit As Martin points out in the comment, the code page will not matter unless you are using char, memo, or varchar. If you stick completely with Unicode (nchar, nvarchar, nmemo), then the code page will not come into play. If you translate a Unicode character to a single-byte character, though, it will be used.

Mark Wilkins 2010-08-12 15:50:39

+1 Just to mention to avoid any confusion that the code page applies to `CHAR` representations rather than `NVARCHAR`.

Martin Smith 2010-08-12 15:54:28

@Martin, That's a good point. I'll add that.

Mark Wilkins 2010-08-12 15:58:13

Thanks Mark , this was really helpfull

DEE 2010-08-17 09:04:34

related questions

SQL Server, convert a named instance to default instance?

Natural (human alpha-numeric) sort in Microsoft SQL 2005

In MS SQL Server 2005, is there a way to export, the complete maintenance plan of a database as a SQL Script?

Does ReadUncommitted imply NoLock

Query to list all tables that contain a specific column with SQL Server 2005.

Can I maintain state between calls to a SQL Server UDF?

What are the benefits of using partitions with the Enterprise edition of SQL 2005

Upgrading Sharepoint 3.0 to SQL 2005 Backend?

What's the simplest way to execute a query in Visual C++

Can you perform an AND search of keywords using FREETEXT() on SQL Server 2005?

Create a database from another database?

What's the fastest way to bulk insert a lot of data in SQL Server (C# client)

SQL 2005 Book For Optimization Techniques

How do I create a mapping table in SQL Server Management Studio?

Cannot Add a Sql Server Login

SQL Server 2005 - Export table programatically (run a .sql file to rebuild it)

Diagnosing Deadlocks in SQL Server 2005

SQL2005: Linking a table to multiple tables and retaining Ref Integrity?

How to create a new instance of Sql Server 2005

SQL Server 2008 vs 2005 Linq integration

How do you kill all current connections to a SQL Server 2005 database?

Conditional Visibility and Page Breaks with SQL Server 2005 Reporting Services

SQL Server 2005 and 2008 on same developer machine?

Paging SQL Server 2005 Results

SQL 2005 For XML Explicit - Need help formatting