ansaurus

Question

Should I be comparing bytes using struct?

Answer 1

+4 A:

Did you try difflib and filecmp modules?

This module provides classes and functions for comparing sequences. It can be used for example, for comparing files, and can produce difference information in various formats, including HTML and context and unified diffs. For comparing directories and files, see also, the filecmp module.

The filecmp module defines functions to compare files and directories, with various optional time/correctness trade-offs. For comparing files, see also the difflib module

.

Ashish 2010-08-14 17:57:55

Did you try it?

Ashish 2010-08-17 10:06:22

Answer 2

A:

You are probably encountering encoding/decoding problems. Someone may suggest a better solution, but you could try reading the file into a bytearray so you're reading raw bytes instead of decoded characters:

Here's a crude example:

$ od -Ax -tx1 /tmp/aa
000000 e0 b2 aa 0a
$ od -Ax -tx1 /tmp/bb
000000 e0 b2 bb 0a

$ cat /tmp/diff.py 
a = bytearray(open('/tmp/aa', 'rb').read())
b = bytearray(open('/tmp/bb', 'rb').read())
print "%02x, %02x" % (a[2], a[3])
print "%02x, %02x" % (b[2], b[3])

$ python /tmp/diff.py 
aa, 0a
bb, 0a

bstpierre 2010-08-15 03:37:13

ansaurus

tags:

views:

answers:

Should I be comparing bytes using struct?

related questions