ansaurus

Question

MD5 of String in ActionScript returning incorrect results when some hex is part of the string( ie"abc\xBF\x4E")

Answer 1

A:

Most likely, PHP and ActionScript are using different encodings for strings; one is probably using ISO-8859-1 and the other is using UTF-8.

For abcd\x28\xBF, the values are:

fcfebaeb81afe401c4b608dc684ad08f under ISO-8859-1
47ef883a009ddbe01711ece0a0a8764e under UTF-8

And for abcd\x28\xBF\x4E (your other example), the values are:

ea382d63efca32d8d7861a314a6112e3 under ISO-8859-1
dc11cdbaa05aa41640a821fb8e290eae under UTF-8

Chris Jester-Young 2010-02-08 22:52:35

This was exactly it. When the string is passed to the MD5 function it is converted to a ByteArray. It was using writeUTFBytes(stringname);switching towriteMultiByte(stringname, "iso-8859-1");fixed it. I really appreciate the help Chris.

Outclassed 2010-02-09 04:56:13

Crap, now when \x00 appears in the string it is causing the writeMultiByte to stop and just end. Let me see if I can figure this one out.

Outclassed 2010-02-09 05:59:32

Answer 2

A:

Your second problem is due to strings being commonly defined as NUL (or zero) terminated buffers.

There's a workaround, though. iso-8859-1 defines 256 possible characters (including the NUL char). The first 256 code points in UTF are the same as in iso-8859-1 (the encoding may differ if you use UTF-8, UTF-16, etc, but the codepoints are the same regarless how you enconde those codepoints).

So, if you know that all of the codepoints in your string will be in the range 0-255 (since it's latin1) and you know it's ok to have embedded NULs, you can manually iterate over your string, get the codepoint of each char and store it as a byte in your buffer. Something like this:

var s:String = "abc\x00d\x28\xBF";
var buffer:ByteArray = new ByteArray();
var len:int = s.length;
for(var i:int = 0; i < len; i++) {
    buffer.writeByte(s.charCodeAt(i));
}

//  trace it
buffer.position = 0;
while(buffer.bytesAvailable) {
    trace("0x" + buffer.readUnsignedByte().toString(16));
}

Juan Pablo Califano 2010-02-11 02:44:36

Awesome, This works for the second issue. Thank you Juan, everything is good to with the MD5 function. Ive got one last issue about encoding that I may need help on that I will post about tomorrow dealing with hex values 80-9F being encoded as 3F, but values above and below being fine, ie 2E and A0 when using the FileStream.WriteBytes() function.

Outclassed 2010-02-16 21:12:06

Seems the third problem was another encoding issue. The program was using Windows-1252 as the encoding type which was dropping the 80-9F (even though Wikipedia shows it supporting characters in this range). Switching to ISO 8859-1 fixes the issue.

Outclassed 2010-02-17 19:28:35

Cool. Glad to see you sorted it out.

Juan Pablo Califano 2010-02-17 23:14:12

ansaurus

tags:

views:

answers:

MD5 of String in ActionScript returning incorrect results when some hex is part of the string( ie"abc\xBF\x4E")

related questions