There has been a similar question concerning comparison of file trees, but it was suggested to start a new question.
I think to write a file tree comparison tool for huge directory trees including extended attributes. The available tools like find, diff, xattr seem to be not optimum for that. I want to
compare huge file trees incrementally, i.e. not as a batch but with immediate feedback
select what to look for, what to ignore
compare huge file systems, like Time Machine backups
add a GUI later that
makes it a snatch to browse and compare such huge trees.
refine searches, prune, relax, interactively improve the result
- ...
What to compare and what to ignore will be very general, to such extremes like ignoring even file names and just comparing the existence and value of certain xattrs.
The question:
Does something like that exist, partially? I can write a tool from scratch, but it would be a waste if I could extend a good existing project, provided it is python.
A hint if it makes sense to write a Pathfinder plugin would be welcome as well, although I doubt it would fit it.
While I am at it: please feel free to add your own ideas what the tool should be able to do.
(modulo offline comparisons are a possible later extension, but too far-fetched)
The reason
Actually I have read many posts about Time Machine and xattr, tried them all, but still failed to move a backup between machines and make Time Machine believe they are identical. I want to solve the missing spot, and finally produce a tool for Time Machine. But first I need this tool, which I think will make sense, alone.
Related question
http://stackoverflow.com/questions/3851884/compare-folders-recursively-using-python
cheers -- chris