ansaurus

Question

Answer 1

+2 A:

Couldn't you just put each number in a single line and then diff(1) them? You might need to sort the lists beforehand, though for that to work properly.

Joey 2009-10-21 11:35:44

Will that actually provide counts?

Casebash 2009-10-21 12:30:30

Not as such, but you can get that with `grep`/`wc` afterwards. This was just a suggestion on how to improve the quadratic runtime. You will get a somehow (depending on the options to `diff`) readable list of differences. You can just count them, then.

Joey 2009-10-21 12:43:29

Okay, will have to play around with this

Casebash 2009-10-21 13:19:00

diff will have a < for values in the second, but not the first and > for values in the first but not the second. A simple grep and wc should provide the desired answer

Casebash 2009-10-21 23:27:18

Answer 2

A:

In the special case where one file is a subset of the other, the following:

cat f1 f2 | sort | uniq -u

would list the lines only in the larger file. And of course piping to wc -l will show the count.

However, that isn't exactly what you described.

This one-liner serves my particular needs often, but I'd love to see a more general solution.

pavium 2009-10-21 12:08:40

Answer 3

+2 A:

I like 'comm' for this sort of thing. (files need to be sorted.)

$ cat f1
1
2
3
$ cat f2
1
4
5
$ comm f1 f2
     1
2
3
    4
    5
$ comm -12 f1 f2
1
$ comm -23 f1 f2
2
3
$ comm -13 f1 f2
4
5
$

Stephen Paul Lesniewski 2009-10-21 15:15:53

For numerical results it complained that it wasn't in sorted order. --nocheck-order will suppress

Casebash 2009-10-21 23:30:12

Again, a simple grep and wc can be used to find the actual result

Casebash 2009-10-21 23:30:43

ansaurus

tags:

views:

answers:

Comparing two lists with a shell script

related questions