ansaurus

Question

Why Encoding.Default.GetBytes() returns different results in VB.NET and C#?

Answer 1

A:

The default encoding is machine dependent as well as thread dependent because it uses the current codepage. You generally should use something like Encoding.UTF8 so that you don't have to worry about what happens when one machine is using unicode and another is using 1252-ANSI.

JasonRShaver 2009-05-29 19:16:33

Answer 2

A:

Different operating systems might use different encodings as the default. Therefore, data streamed from one operating system to another might be translated incorrectly. To ensure that the encoded bytes are decoded properly, your application should use a Unicode encoding, that is, UTF8Encoding, UnicodeEncoding, or UTF32Encoding, with a preamble. Another option is to use a higher-level protocol to ensure that the same format is used for encoding and decoding.

from http://msdn.microsoft.com/en-us/library/system.text.encoding.default.aspx

can you check what each language produces when you explicitly encode using utf8?

marduk 2009-05-29 19:18:37

Answer 3

+11 A:

If you use ChrW(149) you will get a different result- 63, the same as the C#.

Dim b As Char() = {ChrW(149)}
Console.WriteLine(Encoding.Default.GetBytes(b)(0))

Read the documentation to see the difference- that will explain the answer

RichardOD 2009-05-29 19:22:25

Here's a link to the documentation: http://msdn.microsoft.com/en-us/library/613dxh46(VS.80).aspx

Jon B 2009-05-29 19:25:47

Cheers Jon- I was just in the process of adding a link.

RichardOD 2009-05-29 19:26:50

Thanks! I was thinking it had something to do with the Chr() bit, but I wasn't sure how to avoid using Chr() in VB.NET.

Coderuckus 2009-05-29 19:45:28

Glad I solved the mystery for you.

RichardOD 2009-05-29 19:48:25

Answer 4

+4 A:

The VB Chr function takes an argument in the range 0 to 255, and converts it to a character using the current default code page. It will throw an exception if you pass an argument outside this range.

ChrW will take a 16-bit value and return the corresponding System.Char value without using an encoding - hence will give the same result as the C# code you posted.

The approximate equivalent of your VB code in C# without using the VB Strings class (that's the class that contains Chr and ChrW) would be:

char[] chars = Encoding.Default.GetChars(new byte[] { 149 });
Console.Write(Encoding.Default.GetBytes(chars)[0]);

Joe 2009-05-29 19:25:24

Answer 5

A:

I believe the equivalent in VB is ChrW(149).

So, this VB code...

    Dim c As Char() = New Char() { Chr(149) }
    'Dim c As Char() = New Char() { ChrW(149) }
    Dim b As Byte() = System.Text.Encoding.Default.GetBytes(c)
    Console.WriteLine("{0}", Convert.ToInt32(c(0)))
    Console.WriteLine("{0}", CInt(b(0)))

produces the same output as this C# code...

    var c = new char[] { (char)149 };
    var b = System.Text.Encoding.Default.GetBytes(c);
    Console.WriteLine("{0}", (int)c[0]);  
    Console.WriteLine("{0}", (int) b[0]);

Cheeso 2009-05-29 19:25:31

ansaurus

tags:

views:

answers:

Why Encoding.Default.GetBytes() returns different results in VB.NET and C#?

related questions