ansaurus

Question

Storing regional (slovenian) characters in the database

Answer 1

+2 A:

I prefer to use the NVARCHAR() datatype instead. NVARCHAR uses Unicode, which is a lot friendlier when it comes to localization.

Anyway, it's definitely a material issue that the database was initially created with a different collation. When you set the collation on a database, what you are actually doing is setting the default collation for newly created objects. Take a look at the tables themselves. I'm willing to bet that they are still set to the old collation. You may have to recreate or ALTER the tables and indices in order for the new collation to take effect.

http://msdn.microsoft.com/en-us/library/ms175835.aspx

Dave Markle 2010-08-05 12:23:18

Well that I know, but it isn't really relevant... As far as I can see varchar is perfectly capable of storing characters that I am interested in - it successfully does so on the very same server!

Rekreativc 2010-08-05 12:32:25

True enough, but always using Unicode eliminates the need to use default code page and such. Sometimes Slovenian isn't the only language which will fill those columns. IMO by not using Unicode, you open yourself up to pain. Just my opinion.

Dave Markle 2010-08-05 13:20:35

+1 for MSDN link which explains that collation change of DB does not affect user-defined tables which already exists

pastacool 2010-08-05 13:24:30

Answer 2

+1 A:

Have you definitely changed the collation on the database itself? Not just the column? When I try the following script on a test database and switch the database collation back and forth between slovenian and latin I get different results for the č character (the N prefixed version always works)

SET NOCOUNT ON

DECLARE @testtable TABLE
(
A VARCHAR(5) COLLATE Slovenian_CI_AS,
B  VARCHAR(5) COLLATE Slovenian_CI_AI
)

INSERT INTO @testtable
VALUES ('čžš','čžš')

INSERT INTO @testtable
VALUES (N'čžš',N'čžš')

SELECT *,CAST(A AS VARBINARY(6)) ,CAST(B AS VARBINARY(6))  
FROM @testtable

Slovenian_CI_AS

A     B                    
----- ----- -------------- --------------
čžš   čžš   0xE89E9A       0xE89E9A
čžš   čžš   0xE89E9A       0xE89E9A

Latin1_General_CI_AS

A     B                    
----- ----- -------------- --------------
cžš   cžš   0x639E9A       0x639E9A
čžš   čžš   0xE89E9A       0xE89E9A

Martin Smith 2010-08-05 13:15:12

You are correct. The database and the table collation ware correctly set to Slovenian_CI_AS , however the **column** collation remained at the previous setting (from before the collation was changed on the database). Thank you!

Rekreativc 2010-08-05 13:38:20

@Rekreativc - That was actually Dave Markle's suggestion I thought you might have the opposite problem!

Martin Smith 2010-08-05 13:40:54

ansaurus

tags:

views:

answers:

Storing regional (slovenian) characters in the database

related questions