ansaurus

Question

find results piped to zcat and then to head

Answer 1

A:

zcat -r * 2>/dev/null | awk -vRS= -vFS="\n" '{print $1}'

ghostdog74 2010-07-27 02:30:18

Answer 2

+1 A:

It worked as you asked it to.

head did its job, printed one line, and exited. zcat then running under the auspices of xargs tried to write to a closed pipe and received a fatal SIGPIPE for its efforts. Having its child die, xargs reported the whyfor.

To get the desired behaviour, you'd need to find -exec ... construction or a custom zhead to give to xargs.

added junk code I found behind the fridge:

#!/usr/bin/python

"""zhead - poor man's zcat file... | head -n
   no argument error checking, prefers to continue in the face of
   IO errors, with diagnostic to stderr

   sample usage: find ... | xargs zhead.py -1"""

import gzip
import sys

if sys.argv[1].startswith('-'):
    nlines = int(sys.argv[1][1:])
    start = 2
else:
    nlines = 10
    start = 1

for zfile in sys.argv[start:]:
    try:
        zin = gzip.open(zfile)
        for i in range(nlines):
            line = zin.readline()
            if not line:
                break
            print line,
    except Exception as err:
        print >> sys.stderr, zfile, err
    finally:
        try:
            zin.close()
        except:
            pass

It processed 10k files in /usr/share/man in about a minute.

msw 2010-07-27 03:21:50

Good explanation, I wish I could upvote you, and I'll be back when I have reached 15reps.

furedde 2010-07-27 04:56:41

Glad to be of help. Don't worry about the vote, that's not why I do it (and Dennis Williamson got my vote because it was better).

msw 2010-07-27 05:09:21

Answer 3

+1 A:

You should find that this will work:

find . -name "*.gz" | while read -r file; do zcat -f "$file" | head -n 1; done

Dennis Williamson 2010-07-27 03:40:41

worked flawlessly, thank you. didn't know you could use while and read like that, i'll remember it.

furedde 2010-07-27 04:52:34

Answer 4

A:

If you have GNU Parallel http://www.gnu.org/software/parallel/ installed:

find . -name '*.gz' | parallel 'zcat {} | head -n1'

Watch the intro video to GNU Parallel at http://www.youtube.com/watch?v=OpaiGYxkSuQ

Ole Tange 2010-08-09 18:48:54

ansaurus

tags:

views:

answers:

find results piped to zcat and then to head

related questions