ansaurus

Question

Base64 String throwing invalid character error.

Answer 1

+4 A:

You say

The string is exactly what was written to the file (with the addition of a "\0" at the end, but I don't think that even does anything).

In fact, it does do something (it causes your code to throw a FormatException:"Invalid character in a Base-64 string") because the Convert.FromBase64String does not consider "\0" to be a valid Base64 character.

  byte[] data1 = Convert.FromBase64String("AAAA\0"); // Throws exception
  byte[] data2 = Convert.FromBase64String("AAAA");   // Works

Solution: Get rid of the zero termination. (Maybe call .Trim("\0"))

Notes:

The MSDN docs for Convert.FromBase64String say it will throw a FormatException when

The length of s, ignoring white space characters, is not zero or a multiple of 4.

-or-

The format of s is invalid. s contains a non-base 64 character, more than two padding characters, or a non-white space character among the padding characters.

and that

The base 64 digits in ascending order from zero are the uppercase characters 'A' to 'Z', lowercase characters 'a' to 'z', numerals '0' to '9', and the symbols '+' and '/'.

Daniel LeCheminant 2009-04-02 18:00:40

I trim the \0 off, it still throws.

Brandon 2009-04-02 19:23:42

It still throws a FormatException, or something else? What is the exact string being passed to FromBase64String?

Daniel LeCheminant 2009-04-02 19:31:17

The exact string is a little bit long to post. Is there a size limit I don't know about? What is there is valid though, I checked it for any characters not allowed in Base64. Maybe I just did the trim wrong, although it doesn't explain why the tests are running fine.

Brandon 2009-04-02 19:36:41

@Brandon: Is the length a multiple of 4? Honestly, even if you posted the first and last 8 bytes, and the string length, that would probably be enough to see that the string is the correct format.

Daniel LeCheminant 2009-04-02 19:39:52

It is a multiple of 4, and I'm assuming the == at the end of the string (see my response to the original post) is there just for padding purposes?

Brandon 2009-04-02 19:45:07

@Brandon: Yeah, it makes the length a multiple of 4

Daniel LeCheminant 2009-04-02 19:47:44

Answer 2

+2 A:

Whether null char is allowed or not really depends on base64 codec in question. Given vagueness of Base64 standard (there is no authoritative exact specification), many implementations would just ignore it as white space. And then others can flag it as a problem. And buggiest ones wouldn't notice and would happily try decoding it... :-/

But it sounds c# implementation does not like it (which is one valid approach) so if removing it helps, that should be done.

One minor additional comment: UTF-8 is not a requirement, ISO-8859-x aka Latin-x, and 7-bit Ascii would work as well. This because Base64 was specifically designed to only use 7-bit subset which works with all 7-bit ascii compatible encodings.

StaxMan 2009-04-02 18:08:58

Answer 3

+1 A:

Daniel is correct in that you can just remove the \0, but I would say that you should attempt to determine why the null character is showing up in the first place. In C strings are a collection of characters with a '\0' at the end to specify the end of the string. C# does not use the '\0' to specify the end of the string, and can contain the null character.

Somewhere in the processing of your data the extra character is getting added. I'm sure you stored and read plenty of strings to a database just as I have, but I've never seen the string become null terminated after doing so. What database engine are you using?

NerdFury 2009-04-02 18:24:10

Answer 4

A:

If removing \0 from the end of string is impossible, you can add your own character for each string you encode, and remove it on decode.

abatishchev 2009-04-02 19:28:41

ansaurus

tags:

views:

answers:

Base64 String throwing invalid character error.

related questions