ansaurus

Question

Is there a function that returns the root letter for special chars?

Answer 1

+9 A:

See if this works out for you:

http://weblogs.asp.net/fmarguerie/archive/2006/10/30/removing-diacritics-accents-from-strings.aspx

Chetan Sastry 2009-02-09 19:24:34

Answer 2

+2 A:

By the way (completely unrelated to the question), your code operates on strings. This isn't only less efficient, it actually doesn't really make sense since you're interested in individual characters rather than strings, and these are distinct data types in .NET.

To get a single-character literal rather than a string literal, append c to your literal:

Select Case c
  Case "á"c, "à"c, "ã"c, "â"c, "ä"c, "ª"c : x = "a"c
  ' … and so on. '
End Select

Konrad Rudolph 2009-02-09 19:28:18

Answer 3

+1 A:

taken from Chetan Sastry response, here I give you the VB.NET code and the C# one copied from his GREAT answer :)

VB:

Imports System.Text
Imports System.Globalization

''' <summary>
''' Removes the special attributes of the letters passed in the word
''' </summary>
''' <param name="word">Word to be normalized</param>
Function RemoveDiacritics(ByRef word As String) As String
    Dim normalizedString As String = word.Normalize(NormalizationForm.FormD)
    Dim r As StringBuilder = New StringBuilder()
    Dim i As Integer
    Dim c As Char

    For i = 0 To i < normalizedString.Length
        c = normalizedString(i)
        If (CharUnicodeInfo.GetUnicodeCategory(c) <> UnicodeCategory.NonSpacingMark) Then
            r.Append(c)
        End If
    Next

    RemoveDiacritics = r.ToString
End Function

C#

using System.Text;
using System.Globalization;

/// <summary>
/// Removes the special attributes of the letters passed in the word
/// </summary>
/// <param name="word">Word to be normalized</param>
public String RemoveDiacritics(String word)
{
  String normalizedString = word.Normalize(NormalizationForm.FormD);
  StringBuilder stringBuilder = new StringBuilder();
  int i;
  Char c;

  for (i = 0; i < normalizedString.Length; i++)
  {
    c = normalizedString[i];
    if (CharUnicodeInfo.GetUnicodeCategory(c) != UnicodeCategory.NonSpacingMark)
  stringBuilder.Append(c);
  }

  return stringBuilder.ToString();
}

I hope it helps someone like me :)

balexandre 2009-02-09 19:48:39

Answer 4

A:

hi guys, there is as simple method compare string in .NET

public static string NormalizeString(string value) { string nameFormatted = value.Normalize(System.Text.NormalizationForm.FormKD); Regex reg = new Regex("[^a-zA-Z0-9 ]"); return reg.Replace(nameFormatted, ""); }

Javier Mateos 2010-01-29 23:22:46

ansaurus

tags:

views:

answers:

Is there a function that returns the root letter for special chars?

related questions