collation

Collation STRENGTH and local language relation

I have read the following from Collator's Javadoc. "The exact assignment of strengths to language features is locale dependant. For example, in Czech, "e" and "f" are considered primary differences, while "e" and "ê" are secondary differences, "e" and "E" are tertiary differences and "e" and "e" are identical." Does this mean that I sh...

Sort order in SQLite3 with umlauts

I'm working with a sqlite database, using python/django. I would need to have my results, which contain german umlauts (ä,ö,ü) to be correctly sorted (not with the umlauts at the end). Reading different articles on the web I'm not even sure if it is possible or not. So any advise/instructions on that are appreciated. I already studied th...

Collation Conflict

i always got the message "Cannot resolve the collation conflict between "SQL_Latin1_General_CP1_CI_AS" and "Latin1_General_CI_AI" in the equal to operation." when i run the script that came from MSSQL 2005 server. btw i used MSSQL 2008. here is the script USE [database1] GO SET ANSI_NULLS ON GO SET QUOTED_IDENTIFIER ON GO --exec rsp_S...

mysql check collation of a table

Simple question, how can I see what collation a table has? I.E. I want to see: +-----------------------------+ | table | collation | |-----------------------------| | t_name | latin_general_ci | +-----------------------------+ Thanks for your help. ...

Collation with multiple languages

Hi, I am installing SQL 2008 for a web project. My question is which Collation should I select for a web site database which can receive all languages and characters? My understanding is that I can use varchar and nvarchar to Insert the data for specific character coming in from the web site. I assume varchar and nvarchar will mange t...

Is there something better than the kakasi library for gojûon collation?

"Better" primarily means accuracy, but I am also interested any other criteria in which other systems excel. I sampled the Perl binding Text::Kakasi for correctness in an admitted limited fashion and it works just fine for our needs. use utf8; use Encode; use Text::Kakasi; use Unicode::Collate; my $k = Text::Kakasi->new(qw(-iutf8 -outf...

"Illegal mix of collations" error from MySql while running rails test suite

I've recently dusted off an old Ruby on Rails project of mine. In the past, I've never had any problems getting all the tests to pass, but now there is one test that gives me the following error: ActiveRecord::StatementInvalid: Mysql::Error: #HY000Illegal mix of collations (latin1_swedish_ci,IMPLICIT) and (utf8_general_ci,COERCIBLE)...

Sorting a column against different collations in SQL Server

I have a Companies table with a multilingual CompanyName column (defined as nvarchar(512)). I also have a stored procedure to search through and return a list of companies, it takes 2 nvarchar parameters - one is the search term, and the other is an ISO language code. What I would like to be able to do is return the results of the sear...

what languages are supported in icu collation?

I was browsing through the ICU source code (http://icu-project.org/), and I couldn't find what languages it supports out of the box for collation. Could someone help me? ...

Can Unicode compression in SQL 2008 R2 cause GUIDs to become corrupt?

We recently had some trouble with our production SQL 2008 R2 databases and had to failover to log-shipped warm spares for most of our databases. This morning I found some very odd GUID values scattered throughout our set of databases. For background, we have a main clients database, that holds various information about installed clients,...

i get an error when creating a variable and specifing the collation in sql server 2005

when i tried this: DECLARE @var nvarchar(500) collate Arabic_BIN i got that: Msg 156, Level 15, State 1, Line 1 Incorrect syntax near the keyword 'collate'. that is the full code, it works, i do not know how but the person who give it to me have used it successfully CREATE FUNCTION fn_RemoveTashkeel (@InputString nvarchar(2300) ) ...

Converting utf8_general_ci tables and fields to utf8_unicode_ci

Hi Everyone, I have made a mistake when designing my application database several years ago and collation settings of my tables and table fields are mixed. Some of them are utf8_general_ci and some of them are utf8_unicode_ci. This causes problems when joining tables with different collations. Now, I am planning to change collation set...

Issues with stored JSON encoded POSTs in MySQL

I have a situation where after several years of use we are suddenly have some JSON-encoded values that are giving our Perl script fits due to backslashes. The issues are with accented characters like í and é. An example is Matí encoded as Mat\ud873. It is unclear what may have changed in the environment. PHP, Perl, and MySQL are involv...

What issues to anticipate having different collations between development and production SQL Servers?

Subquestioning "Unable to update sys.columns - any other approach?" vaguely mentioning the problems on deployment to server with a different collation. The problem is that by default SQL Server collation is determined (during setup) by Windows Regional and Language Options --> Advanced --> "Select a language to match the language vers...

How is Kana sensitivy applied to non-Japanese letters?

Why is Kana Sensitivity applied to Latin and Cyrillic collations? i.e. "_KS" in, for ex., Latin1_General_100_CS_AI_KS I tried to search for Kana and get exactly what I expected: When Japanese kana characters Hiragana and Katakana are treated differently, it is called Kana sensitive. How is it applied to Latin letters? What am I l...

how to script non-default collation and skip explicit scripting for default collation?

In SSMS 2008 R2 I create the table aTest(Albnian varchar(10), Dflt varchar(10)) In SSMS table designer both column properties have Collation: < database default > (under Column Properties--> "Table designer") I change collation of column Albnian to a non-default, for ex., to Albanian_CI_AS. If I script the table in SSMS (right-cli...

Changing MS SQL Collation To Store Different Types Of Characters

Hi everyone, I am working on a website where the content has to be displayed in English, Polish, Slovak and Czech. Currently, everytime a special character used in the non-english languages (such as Á or â), it displays as just a normal A or a. I've traced the problem down to the database. Currently the collation is "Latin1_General_CI_A...