Binary diff algorithm
I am looking for text and binary diff algorithms. It would be helpful if you point me to some sources - how to implement a one (algorithm), COTS components (with redistribution license) or source code.
I am looking for a recommendation - a human recommendation :)
Uh, I thought google gave human recommendations?
can't go wrong with this:
Check windiff on Sourceforge.
The problem is with what you consider trivial differences. Do you consider tabs to be the same as 4 spaces or not? Do you consider unix end of line characters to be the same as windows or mac ones?
Binary diff are text diff are usually considered to be entirely different problems.
For text diff, the standard is the Levenstein/Levenshtein edit distance algorithm. It's easy to implement, and there are many existing implementations. Here's one discussion, with examples in Java, C++ and VB:
For text, there's also the ancient (but simple) "longest common subsequence" algorithm.
J. Random Hacker
Google for "edit distance".
If you invent a new binary diff algorithm, will you please name it "Biff" ? Fans of "Back to the Future" will thank you...
The version control system Subversion claims to have a diff algorithm that is working equally on text and binary data.
I am going through vcdiff.I am struggling to make it in windows.Since i am having lack of support in windows.And i want to know whether vcdiff is the exact one for binary differencing?Whether it will worked out or not ?
Fog Creek Home