diff

i need to write code in python for comparing text of two documents uning fingerprint techniques

Hello I need to write code in python language for comparing the text of document uning fingerprint techniques. I do not know to take fingerprint of a document or to generate fingerprint of a document. Please if anyone know the method or have a source code for generating fingerprints of documents which is stored in bits form. please guid...

versioned rails db and differ

I'm wondering if there's an integrated solution to have a database with versioned records supported by rails (ala version_fu ar_versioned) and a differ thanks! ...

Algorithm to detect similar documents in python script

i need to write a module to detect similar documents. i have read many papers of fingerprints of documents technique and others but i do not know how to write code or implement. the algorithm should work for Chinese, Japanese, English and German language or language independent. can you help me. or sending me code for this problem i am ...

Incremental File Copy Tool and NIH

For years, one of my most important tools has been incremental copy utility that compares the contents of two directories and shows me which files are newer / older / added / deleted. Every day I find myself copying folders of source code between my two desktop machines and the server, and such a utility is critical to avoid overwriting...

What is a javascript hash table implementation that avoids object namespace collisions?

First off: I'm using a rather obscure implementation of javascript embedded as a scripting engine for Adobe InDesign CS3. This implementation sometimes diverges from "standard" javascript, hence my problem. I'm using John Resig's jsdiff library (source here) to compare selections of text between two documents. jsdiff uses vanilla object...

Anyone know of any good Database Diff tools?

I can't find any good ones in sourceforge :( Anyone has any success with open source (or retail) database diff tool? EDIT: for sqlserver 2005 ...

Identifying if 2 HTML pages are similar

I'm trying to identify differences between a base case and supplied case. Looking for a library to tell me similarity in percentage or something like that. For Example: I've 10 different HTML pages. * All of them are 404 responses with only one 2 lines of random code (such as time or quote of the day). Now when I supply a new 404 pag...

Calculate text diffs in PHP

Are there any libraries (3rd party or built-in) in PHP to calculate text diffs? ...

Graphical DIFF programs for linux

I really like Araxis Merge for a graphical DIFF program for the PC. I have no idea what's available for linux, though. We're running SUSE linix on our z800 mainframe. I'd be most grateful if I could get a few pointers to what programs everyone else likes. Thanks, Scott ...

How do I diff two spreadsheets?

We have a lot of spreadsheets (xls) in our subversion repository. These are usually edited with gnumeric or openoffice.org, and are mostly used to populate databases for unit testing with dbUnit. There are no easy ways of doing diffs on xls files that I know of, and this makes merging extremely tedious and error prone. I've found Spre...

Find matching sequences in two binary files

Let me start off with a bit of background. This morning one of our users reported that Testuff's setup file has been reported as infected with a virus by the CA antivirus. Confident that this was a false positive, I looked on the web and found that users of another program (SpyBot) have reported the same problem. A now, for the actual ...

How to compare files with same names in two different directories using a shell script

Before moving on to use SVN, I used to manage my project by simply keeping a /develop/ directory and editing and testing files there, then moving them to the /main/ directory. When I decided to move to SVN, I needed to be sure that the directories were indeed in sync. So, what is a good way to write a shell script [ bash ] to recursivel...

Java: Eclipse - Diff Trace

I've got two versions of a project with a JUnit test. Looking at the diff of the files there's no obvious reason to this, but the test passes on one project but not on the other. For a while now I've wondered if there was a program that would record a trace through the whole program when it runs and save it to a file. This would be real...

SVN performance after many revisions

My project is currently using a svn repository which gains several hundred new revisions per day. The repository resides on a Win2k3-server and is served through Apache/mod_dav_svn. I now fear that over time the performance will degrade due to too many revisions. Is this fear reasonable? We are already planning to upgrade to 1.5, so h...

git diff - handling long lines?

I'm running git-diff on a file, but the change is at the end of a long line. If I use cursor keys to move right it loses colour coding and worse the lines don't line up, making it harder to track the change. Is there a way to prevent that problem, or to simply make the lines wrap instead? (running git 1.5.5 via mingw32) ...

Any decent text diff/merge engine for .NET ?

Requirements: free, preferably open-source implemented in one of the .NET managed langs Google found these: A Generic, Reusable Diff Algorithm on codeproject An O(ND) Difference Algorithm for C# Diff/Merge/Patch Library for C#/.NET by Joshua Tauberer EDIT: No apps please, only libraries. ...

Text diff visualization control for WinForms or WPF

In continuation of the my previous question, are the any good controls for text diffs visualization? Something like StackOverflow's revision diff viewer but for WinForms or WPF Requirements: free, preferably open-source based on WPF or WinForms No apps please, only components. I'm not interested in OSS diff tools ...

Unit Tests for comparing text files in NUnit

I have a class that processes a 2 xml files and produces a text file. I would like to write a bunch of unit / integration tests that can individually pass or fail for this class that do the following: For input A and B, generate the output. Compare the contents of the generated file to the contents expected output When the actual co...

Text difference algorithm

Hi, I need an algorithm that can compare two text files and highlight their difference and ( even better!) can compute their difference in a meaningful way ( meaning, two similar files should have a similarity score higher than two dissimilar files, with the word "similar" defined in the normal terms). It sounds easy to implement, but i...

Using Mercurial, is there an easy way to diff my working copy with the tip file in the default remote repository

When using mercurial, I'd like to be able to diff the working copy of a file with the tip file in my default remote repository. Is there an easy way to do this? I know I can do an "hg incoming -p" to see the patch sets of changes coming in, but it'd be nice to just directly see the actual changes for a particular file that I'd get if I...