ansaurus

Question

What is the buffer size to create a .zip archive using Java?

Answer 1

A:

Depends on the hardware you have (disk speed and file search time). I would say if you are not interested in squeezing the last drop of performance pick any size between 4k and 64k. Since it is a short-lived object it will be collected quickly anyway.

ddimitrov 2008-10-14 13:08:45

Answer 2

+2 A:

Short answer: I would pick something like 16k.

Long answer:

ZIP is using the DEFLATE algorithm for compression (http://en.wikipedia.org/wiki/DEFLATE). Deflate is a flavor of Ziv Lempel Welch(search wikipedia for LZW). DEFLATE uses LZ77 and Huffman coding.

This is a dictionary compression, and as far as I know from the algorithm standpoint the buffer size used when feeding the data into the deflater should have almost no impact. The biggest impact for LZ77 are dictionary size and sliding window, which are not controlled by the buffer size in your example.

I think you can experiment with different buffer sizes if you want and plot a graph, but I am sure you will not see any significant changes in compression ratio (3/80000 = 0.00375%).

The biggest impact the buffer size has is on the speed due to the amount of overhead code that is executed when you make the calls to FileInputStream.read and zos.write. From this point of view you should take into account what you gain and what you spend.

When increasing from 1 byte to 1024 bytes, you lose 1023 bytes (in theory) and you gain a ~1024 reduction of the overhead time in the .read and .write methods. However when increasing from 1k to 64k, you are spending 63k which reducing the overhead 64 times.

So this comes with diminishing returns, thus I would choose somewhere in the middle (let's say 16k) and stick with that.

Dan Cristoloveanu 2008-10-14 14:22:23

I accept this answer because it shows that the buffer size don't affect significatively the result size but the dictionary size and sliding window

Telcontar 2008-10-14 15:16:34

Answer 3

A:

I try do download a zip archive. Original size is 88k, downloaded size is 4k. Anyone can help?

Dr.Vet. Cumpanasu Florin 2010-07-30 17:29:07

ansaurus

tags:

views:

answers:

What is the buffer size to create a .zip archive using Java?

related questions