views:

579

answers:

2

Hi, I wish to compress my application's network traffic.

According to the (latest?) "Haskell Popularity Rankings", zlib seems to be a pretty popular solution. zlib's interface uses ByteStrings:

compress :: ByteString -> ByteString
decompress :: ByteString -> ByteString

I am using regular Strings, which are also the data types used by read, show, and Network.Socket:

sendTo :: Socket -> String -> SockAddr -> IO Int
recvFrom :: Socket -> Int -> IO (String, Int, SockAddr)

So to compress my strings, I need some way to convert a String to a ByteString and vice-versa. With hoogle's help, I found:

Data.ByteString.Char8 pack :: String -> ByteString

Trying to use it:

Prelude Codec.Compression.Zlib Data.ByteString.Char8> compress (pack "boo")

<interactive>:1:10:
    Couldn't match expected type `Data.ByteString.Lazy.Internal.ByteString'
           against inferred type `ByteString'
    In the first argument of `compress', namely `(pack "boo")'
    In the expression: compress (pack "boo")
In the definition of `it': it = compress (pack "boo")

Fails, because (?) there are different types of ByteString ?

So basically:

  • Are there several types of ByteString? What types, and why?
  • What's "the" way to convert Strings to ByteStrings?

Btw, I found that it does work with Data.ByteString.Lazy.Char8's ByteString, but I'm still intrigued.

+3  A: 

A more efficient mechanism might be to switch to a full bytestring-based layer:

  • network.bytestring for bytestring sockets
  • lazy bytestrings for compressoin
  • binary of bytestring-show to replace Show/Read
Don Stewart
@dons: it appears that I will need to use Network.Socket.ByteString.Lazy for compatibility with zlib, but unlike Network.Socket.ByteString, it doesn't implement sendTo and recvFrom, making it unsuitable for me.
yairchu
I don't think it will be too hard for you to stream chunks of lazy bytestrings into a socket taking individual ones. It's essentially mapM_ send . fromChunks
Don Stewart
@dons: when using UDP it will actually make a difference if you send your packet in whole or in chunks. but are you also saying that conversion between lazy and strict bytestring is easy? that would be useful to know..
yairchu
+3  A: 

There are two kinds of bytestrings: strict (defined in Data.Bytestring.Internal) and lazy (defined in Data.Bytestring.Lazy.Internal). zlib uses lazy bytestrings, as you've discovered.

Alexey Romanov
@Alexey Romanov: is there also a Char8/Word8 distinction?
yairchu
No. `Char8` modules just give a different interface to the same bytestrings.
Alexey Romanov