ansaurus

Question

Sql trying to change case letter and group similar nvarchar values

Answer 1

A:

Try replacing ı and such with english equivalent after lowercasing

Ray 2009-08-05 13:29:37

Well, I'm searching for a way that could resolve that automatically, because it could be another letter apart form "i" so you can't just write down every possible situation.

Izabela 2009-08-05 13:34:01

Answer 2

+2 A:

When you group, you should use an Accent Insensitive collation. You can add this directly to your group by clause. The following is an example:

Declare @Temp Table(Data nvarchar(100))

Insert Into @Temp Values(N'izla')
Insert Into @Temp Values(N'İZLA')
Insert Into @Temp Values(N'IZLA')
Insert Into @Temp Values(N'Izla')

Select  Data, 
     Count(*) 
From    @Temp 
Group By Data

Select  Data Collate Latin1_General_CI_AI, 
     Count(*) 
From    @Temp 
Group By Data Collate Latin1_General_CI_AI

When you run this example, you will see that the first query creates two rows (with count 3 and count 1). The second example uses an accent insensitve collation for the grouping, so all 4 items are grouped together.

I used Latin1_General_CI_AI in my example. I suggest you examine the collation of the column you are using and then use a collation that most closely matches by changing the AS on the end to AI.

G Mastros 2009-08-05 13:41:43

Thank you! It helps a lot.

Izabela 2009-08-05 13:48:56

Don't you mean Case Intensitive collation? Or am I missing something about accents?

pjp 2009-08-05 13:56:16

@pjp, you're right. I changed the explanation. Thanks for pointing this out.

G Mastros 2009-08-05 13:59:09

Answer 3

A:

This all comes down to collation, which is the way that the system sorts string data.

You could say something like:

SELECT *, COUNT(*) OVER (PARTITION BY fieldname COLLATE Latin1_General_CI_AI), COUNT(*) OVER (PARTITION BY fieldname COLLATE Latin1_General_CI_AS)
FROM yourtable

This will provide some nice figures for you around how many times each name appeared in the various formats. There are many collations, and you can search in Books Online for a complete list. You may also be interested in Latin1_General_BIN for example.

Rob

Rob Farley 2009-08-05 13:53:10

ansaurus

tags:

views:

answers:

Sql trying to change case letter and group similar nvarchar values

related questions