ansaurus

Question

Splitting gzipped logfiles without storing the ungzipped splits on disk.

Answer 1

+1 A:

There's zipsplit, but that uses the zip algorithm as opposed to the gzip algorithm.

Tony Miller 2010-10-18 15:47:48

Answer 2

+3 A:

A script like the following might suffice.

#!/usr/bin/perl
use PerlIO::gzip;

$filename = 'out';
$limit = 500000;

$fileno = 1;
$line = 0;

while (<>) {
    if (!$fh || $line >= $limit) { 
        open $fh, '>:gzip', "$filename_$fileno"; 
        $fileno++;
        $line = 0; 
    }
    print $fh $_; $line++;
}

ar 2010-10-18 15:51:54

Thanks, your quick example helped me a lot. With two minor fixes (first line must start with #!/ and after the $fileno++ an additional $line=0 is needed) it worked good enough for my purposes.

Niels Basjes 2010-10-20 08:56:45

Thanks. I've added those to the script for completeness.

ar 2010-10-20 09:59:32

ansaurus

tags:

views:

answers:

Splitting gzipped logfiles without storing the ungzipped splits on disk.

related questions