ansaurus

Question

making project on double data compression..please help...

Answer 1

A:

In Ruby you could use the "rubyzip" gem, which does provide lossless compression.

Ethan 2009-07-14 19:51:28

Answer 2

A:

In Python, here's a way to do "double data compression" by compressing a compressed string:

from zlib import compress
data_to_compress = 'double double, toil and trouble'
doublely_compressed_data = compress(compress(data_to_compress))
# but why stop there? why not have triple data compression too?
triplely_compressed_data = compress(compress(compress(data_to_compress)))

Of couse, this can be extended to "n" data compression:

from zlib import compress
def n_compress(data, n):
   for _ in range(n):
      data = compress(data)
   return data

[tongue firmly in cheek]

mhawke 2009-07-15 01:37:19

Answer 3

+2 A:

avinash, why do you want to do double data compression? This will not result in better compression; in fact, it will quite possibly be worse than single compression.

Imagine if you could keep compressing infinite times, and get a smaller file / string each time. Eventually, you would end up with a file / string with a length of 1 byte (or possibly 1 bit). But wait - doesn't that mean that all files are then exactly the same?

Instead, there is a limit as to how much you can losslessly compress data. This is known as the entropy of the data. Note that the entropy of a file / string is ideal - in practice, compression algorithms can not reach this level of compression (usually). This is simply because it would take too long to calculate the best tree to use, so they use a greedy algorithm.

You can calculate the entropy of a string if you know the frequency of each character. The formula for this is
Entropy formula

a_m0d 2009-07-21 03:47:02

ansaurus

tags:

views:

answers:

making project on double data compression..please help...

related questions