Hello
I need to write code in python language for comparing the text of document uning fingerprint techniques. I do not know to take fingerprint of a document or to generate fingerprint of a document. Please if anyone know the method or have a source code for generating fingerprints of documents which is stored in bits form.
please guid...
I'm wondering if there's an integrated solution to have a database with versioned records supported by rails (ala version_fu ar_versioned) and a differ thanks!
...
i need to write a module to detect similar documents.
i have read many papers of fingerprints of documents technique and others but i do not know
how to write code or implement. the algorithm should work for Chinese, Japanese, English and German language or language independent.
can you help me. or sending me code for this problem
i am ...
For years, one of my most important tools has been incremental copy utility that compares the contents of two directories and shows me which files are newer / older / added / deleted. Every day I find myself copying folders of source code between my two desktop machines and the server, and such a utility is critical to avoid overwriting...
First off: I'm using a rather obscure implementation of javascript embedded as a scripting engine for Adobe InDesign CS3. This implementation sometimes diverges from "standard" javascript, hence my problem.
I'm using John Resig's jsdiff library (source here) to compare selections of text between two documents. jsdiff uses vanilla object...
I can't find any good ones in sourceforge :( Anyone has any success with open source (or retail) database diff tool?
EDIT: for sqlserver 2005
...
I'm trying to identify differences between a base case and supplied case. Looking for a library to tell me similarity in percentage or something like that.
For Example:
I've 10 different HTML pages.
* All of them are 404 responses with only one 2 lines of random code (such as time or quote of the day).
Now when I supply a new 404 pag...
Are there any libraries (3rd party or built-in) in PHP to calculate text diffs?
...
I really like Araxis Merge for a graphical DIFF program for the PC. I have no idea
what's available for linux, though. We're running SUSE linix on our z800 mainframe.
I'd be most grateful if I could get a few pointers to what programs everyone else likes.
Thanks, Scott
...
We have a lot of spreadsheets (xls) in our subversion repository. These are usually edited with gnumeric or openoffice.org, and are mostly used to populate databases for unit testing with dbUnit. There are no easy ways of doing diffs on xls files that I know of, and this makes merging extremely tedious and error prone.
I've found Spre...
Let me start off with a bit of background.
This morning one of our users reported that Testuff's setup file has been reported as infected with a virus by the CA antivirus. Confident that this was a false positive, I looked on the web and found that users of another program (SpyBot) have reported the same problem.
A now, for the actual ...
Before moving on to use SVN, I used to manage my project by simply keeping a /develop/ directory and editing and testing files there, then moving them to the /main/ directory. When I decided to move to SVN, I needed to be sure that the directories were indeed in sync.
So, what is a good way to write a shell script [ bash ] to recursivel...
I've got two versions of a project with a JUnit test. Looking at the diff of the files there's no obvious reason to this, but the test passes on one project but not on the other.
For a while now I've wondered if there was a program that would record a trace through the whole program when it runs and save it to a file. This would be real...
My project is currently using a svn repository which gains several hundred new revisions per day.
The repository resides on a Win2k3-server and is served through Apache/mod_dav_svn.
I now fear that over time the performance will degrade due to too many revisions.
Is this fear reasonable?
We are already planning to upgrade to 1.5, so h...
I'm running git-diff on a file, but the change is at the end of a long line.
If I use cursor keys to move right it loses colour coding and worse the lines don't line up, making it harder to track the change.
Is there a way to prevent that problem, or to simply make the lines wrap instead?
(running git 1.5.5 via mingw32)
...
Requirements:
free, preferably open-source
implemented in one of the .NET managed langs
Google found these:
A Generic, Reusable Diff
Algorithm on codeproject
An O(ND) Difference Algorithm for C#
Diff/Merge/Patch Library for C#/.NET by Joshua Tauberer
EDIT:
No apps please, only libraries.
...
In continuation of the my previous question, are the any good controls for text diffs visualization?
Something like StackOverflow's revision diff viewer but for WinForms or WPF
Requirements:
free, preferably open-source
based on WPF or WinForms
No apps please, only components.
I'm not interested in OSS diff tools
...
I have a class that processes a 2 xml files and produces a text file.
I would like to write a bunch of unit / integration tests that can individually pass or fail for this class that do the following:
For input A and B, generate the output.
Compare the contents of the generated file to the contents expected output
When the actual co...
Hi, I need an algorithm that can compare two text files and highlight their difference and ( even better!) can compute their difference in a meaningful way ( meaning, two similar files should have a similarity score higher than two dissimilar files, with the word "similar" defined in the normal terms). It sounds easy to implement, but i...
When using mercurial, I'd like to be able to diff the working copy of a file with the tip file in my default remote repository. Is there an easy way to do this?
I know I can do an "hg incoming -p" to see the patch sets of changes coming in, but it'd be nice to just directly see the actual changes for a particular file that I'd get if I...