ansaurus

Question

checksum/hash function with reversible property

Answer 1

+1 A:

Just add and subtract your codes. With h(x) being any hash function:

hm2 = hm1 + h(B->Y)
hm3 = hm2 + h(B->Y1)
hm4 = hm3 - h(B->Y1) 
hm5 = hm4 + h(B->Y2)

hb2 = hb1 + h(B->Y)
hb3 = hb1 + h(B->Y2)

hm5 and hb3 are equal.

Note that it does not necessarily have to be add or subtract. Any reversible operation will work (theoretically multiply/divide can also work, but there could be more overflow issues and ambiguity about what happens around 0).

CookieOfFortune 2009-04-05 18:51:29

Ah, good idea! You may want to use XOR instead of addition/subtraction so you don't need to worry about overflow.

Brian Campbell 2009-04-05 19:03:56

Answer 2

A:

Hmm. I'm not sure about a hash function that does exactly what you are asking for. But it seems that a structure similar to how Git stores its revisions might do what you need (which was inspired by how Monotone stored its revisions).

Git computes the SHA-1 sum of each of the files in the repository. These are used as blob identifiers. It then has a tree, which maps filenames to blob IDs (and other sub-trees, for subdirectories). The identifier of a tree is its SHA-1 sum. (Though its not relevant to your usage, I don't believe, the trees are then referred to by revisions, which include things like author, date, and one or more parent revisions).

This means that you don't have to re-compute the SHA-1 sum for each blob as you update one; you just recompute the SHA-1 for the blob that changes, and recompute the SHA-1 for the tree.

You could do the same with your data. Compute the hash of each of your objects, and put all your key->hash(value) mappings into one file, and compute the hash of that. If the file containing key->hash(value) is too big for you to want to re-hash it each time, you could divide it into sections, and have a key->hash(section), where each section had key->hash(value). One level of branching should generally be sufficient for most cases, but you can build a tree structure out of these if you really need to.

Brian Campbell 2009-04-05 18:59:53

ansaurus

tags:

views:

answers:

checksum/hash function with reversible property

related questions