ansaurus

Question

Answer 1

+1 A:

Perhaps take a look at the source code of FSLint for pointers: http://code.google.com/p/fslint/source/browse/trunk/fslint/findup

Ash Kim 2010-06-09 13:18:59

Answer 2

+2 A:

On F1:

# find / -type f -exec md5sum {} + > F1

On F2:

# find / -type f -exec md5sum {} + > F2

then:

# diff F1 F2

You might want to check more options for find, this line only finds regular files.

jgr 2010-06-09 13:22:53

that's awesome!

Ash Kim 2010-06-09 13:25:20

Thanks user362458, that's useful - however because 'md5sum' puts the name of the file next to the checksum, no line in the file 'F1' will match any line in 'F2', even if the checksums are identical.

grw 2010-06-09 13:31:16

ah, I read your post a bit too fast, if a new hier is created you'd have to go for a solution like Unknown's at the bottom, preferably without the UUOC ;)

jgr 2010-06-09 13:36:18

Answer 3

+1 A:

You can do something like this:

f1# find yourrootdir -type f -exec sha1sum {} >> initial_files \; 
f1# ...copy initial_files to machine f2...
f1# ...start copy...
f2# find yournewrootdir -type f -exec sha1sum {} >> final_files \;
f2# sort initial_files > INITIAL
f2# sort final_files > FINAL
f2# for sha1 in `comm -23 <(cat INITIAL | awk '{print $1}') <(cat FINAL | awk '{print $1}')`; do grep $sha1 INITIAL; done

This will show the lines in "initial_files" that don't have the SHA1 in the final_files.

The last line uses only the sha1sums to execute a comm command, then greps in initial_files each sha1sum that's missing.

Unknown 2010-06-09 13:24:55

That's absolutely great - exactly what I was looking for. Does the job wonderfully :)

grw 2010-06-09 13:36:02

Hi Unknown,I've made your solution into a script, and licensed it under the GPL. I hope this is okay with you - it seems like the best way to ensure that anyone can use it. If that's a problem, let me know and I'll take it down.http://github.com/capncodewash/Misc-shell-scripts/blob/master/find_missing_files.sh

grw 2010-06-11 11:32:58

Don't worry about it, it's not like it's some top secret algorithm.. :)

Unknown 2010-06-11 19:02:14

ansaurus

tags:

views:

answers:

Finding missing files by checksum

related questions