ansaurus

Question

C#: Issues using dictionary with languages other than english

Answer 1

+1 A:

The problem is with the enconding you are using when opening the file to read. Looks like you may be using ASCIIEncoding.

.NET handles strings internally as UTF-8, so this kind of issue would not happen internally.

Oded 2010-01-06 12:04:56

I wonder if encoding comes into it at all until you try to serialize/deserialize string/char data. How .net handles strings internally should be free of such encoding quandries and of no concern to the developer.

spender 2010-01-06 12:09:35

@spender: Reading a text file *is* deserializing character data. The encoding used for this has to be right, or the data will be corrupt.

Jon Skeet 2010-01-06 12:26:44

@Jon: I didn't make it clear that it's the second para of this answer I was commenting on.

spender 2010-01-06 12:33:37

@Oded: I'm pretty sure that `string` uses UTF-16 encoding internally.

LukeH 2010-01-06 12:47:42

Answer 2

+6 A:

I assume you're trying to get case insensitivity for the dictionary. Instead of calling ToLower, use the constructor of Dictionary which takes an equality comparer - and use StringComparer.Create(culture, true) to construct a suitable comparer.

I don't know what your second problem is about - we'd need more detail to diagnose it, including the code you're using, ideally.

EDIT: UTF-7 is almost certainly not the correct encoding. Don't just guess at the encoding; find out what it's really meant to be. Where did this text file come from? What can you open it successfully in?

I suspect that at least some of your problems are due to using UTF-7.

Jon Skeet 2010-01-06 12:05:25

Many thanks, adding the StringComparer.Create(culture, true) solved my first problem.Second one still remains, im using UTF-7 since neither UTF-8 or ASCII encodings recognized the accents.

brokencoding 2010-01-06 12:20:00

ansaurus

tags:

views:

answers:

C#: Issues using dictionary with languages other than english

related questions