ansaurus

Question

Is any substring of an encrypted hash (md5, sha1) more "random" than another?

Answer 1

+6 A:

All substrings of a good hash (and md5 is reasonably good despite being cryptographically unsafe) are equally random, so yes, take any bits you like from the string, they should be equally distributed.

Gintautas Miliauskas 2010-09-29 07:56:22

Answer 2

+2 A:

I was curious myself, so I went ahead and wrote a program to test this. You'll need Crypto++ to compile the code.

Disclaimer: When it comes to cryptography, or even just mathematics in general, I know just enough to shoot myself in the foot. So, take the following results with a grain of salt and keep in mind that I only have a cursory knowledge of the tools I'm using.

I only sampled three substrings: the first 8 bytes, the middle 8 bytes, and the last 8 bytes. Long story short, they're equally random.

However, when using a smaller sample space, it appears as if the last 8 bits are slightly more random. The larger the sampling space, the closer all three substrings approach complete randomness.

1000 iterations:

First:  0.995914
Middle: 0.996546
Last:   0.998104

5000 iterations:

First:  0.998387
Middle: 0.998624
Last:   0.999501

10000 iterations:

First:  0.999614
Middle: 0.999457
Last:   1

30000 iterations:

First:  1
Middle: 1
Last:   1

"Randomness" is measured by Crypto++'s MaurerRandomnessTest class. For reference, the executable compiled from the above code has a randomness value of 0.632411 and a copy of Shakespeare's Macbeth downloaded from Project Gutenburg has a randomness value of 0.566991.

kurige 2010-09-29 10:41:20

I'm marking this accepted as it actually demonstrates the "randomness." Thanks @kurige!

macek 2010-09-29 17:53:23

Answer 3

+2 A:

Nitpick: "random" is the wrong word to use here, since hash functions are deterministic.

As for answering what you mean :), a desirable property of hash functions is achieving the Avalanche effect: basically, to have every bit of input cause drastic changes to the output. So, for a well-designed hash, every substring should be affected equally often ("be as random") as any other.

snemarch 2010-09-29 11:50:05

@snemarch, I put the word random in quotes for this very reason :) +1 for linking to the avalanche effect.

macek 2010-09-29 17:54:47

ansaurus

tags:

views:

answers:

Is any substring of an encrypted hash (md5, sha1) more "random" than another?

related questions