ansaurus

Question

How to reduce memory usage in a Haskell app?

Answer 1

+2 A:

Does forcing "un-laziness" using $! help? as per this answer.

John Mulder 2009-01-20 06:08:13

Thank you! I tried to evaluate strictly `u`, an argument of `stepEuler`, and all the arguments in `applyBC` (see http://pastebin.com/f56a2079d), but effect is the opposite: the program allocates almost 50% more (according to valgrind), and runs almost 50% longer.

jetxee 2009-01-20 10:54:24

Answer 2

+1 A:

Per Harleqin's request: Have you tried setting optimization flags? For example, with ghc, you can use add "-O2" just like you would with gcc. (Although I'm not sure what optimization levels exist in ghc; the man page doesn't exactly say ...)

In my past experience, setting this flag has made a tremendous difference. As far as I can tell, runhugs and unoptimized ghc use the most basic, obvious implementation of Haskell; unfortunately, this sometimes isn't very efficient.

But that's just a guess. As I said in my comment, I hope that someone answers your question well. I often have problems analyzing and fixing Haskell's memory usage.

A. Rex 2009-01-20 06:29:39

Yes, I did compile with optmization flags: `ghc -O2 -c -prof -auto-all -caf-all -fforce-recomp Euler1D.hs ; ghc -O2 -o eulerhs Main.hs Euler1D.o -prof -auto-all -caf-all -fforce-recomp`

jetxee 2009-01-20 09:16:04

Answer 3

A:

One thing that jumped to my eye now is that the Haskell output is a float, while the C output seems to be integer. I have not yet come to grips with Haskell code, but is there perhaps some place where you have floating point arithmetic in Haskell while C uses integers?

Svante 2009-01-20 09:40:45

No, C output is not an integer. Just printf tries to represent the result nicely on print. C arithmetics is entirely in double.

jetxee 2009-01-20 10:41:38

Answer 4

+1 A:

Use switch -fvia-C also.

Hynek -Pichi- Vychodil 2009-01-20 13:27:57

Answer 5

+7 A:

Lists are not the best datastructure for this type of code (with lots of (++), and (last)). You loose a lot of time constucting and deconstructing lists. I'd use Data.Sequence or arrays, as in C versions.
There is no chance for thunks of makeu0 to be garbage-collected, since you need to retain all of them (well, all of the results of "diffuse", to be exact) all the way till the end of computation in order to be able to do "reverse" in applyBC. Which is very expensive thing, considering that you only need two items from the tail of the list for your "zeroflux".

Here is fast hack of you code that tries to achieve better list fusion and does less list (de)constructing:

module Euler1D
( stepEuler
) where

-- impose zero flux condition
zeroflux mu (boundary:inner:xs) = boundary+mu*2*(inner-boundary)

-- one step of integration
stepEuler mu n = (applyBC . (diffused mu)) $ makeu0 n
    where
          diffused mu (left:x:[]) = []    -- ignore outer points
          diffused mu (left:x:right:xs) = -- integrate inner points
                   let y = (x+mu*(left+right-2*x))
                       in y `seq` y : diffused mu (x:right:xs)
          applyBC inner = lbc + sum inner + rbc -- boundary conditions
               where
                     lbc = zeroflux mu ((f 0 n):inner)             -- left boundary
                     rbc = zeroflux mu ((f n n):(take 2 $ reverse inner)) -- right boundary

-- initial condition
makeu0 n = [ f x n | x <- [0..n]]

f x n = ((^2) . sin . (pi*) . xi) x
    where xi x = fromIntegral x / fromIntegral n

For 200000 points, it completes in 0.8 seconds vs 3.8 seconds for initial version

ADEpt 2009-01-22 09:38:18

Thank you very much! This is exactly kind of answer I was looking for. I didn't know about Data.Sequence and arrays, but suspected, that it is possible to avoid lots of list operations. P.S. y `seq` y:… in diffused is a nice pattern… also take's in applyBC… Thank you!

jetxee 2009-01-22 13:53:23

Though, summation in stepEuler changes the original semantics of the function. I restored the original semantics (http://pastebin.com/f5ca77a7f) and it still runs reasonably fast (0.5 seconds for 2e5 points). I'll consider using Data.Sequence now.

jetxee 2009-01-22 13:56:46

Answer 6

+1 A:

More generally, you can find out where your memory is going using GHC's heap profiling tools. In my experience, they won't necessarily tell you why your data is being leaked, but can at least narrow down the potential causes.

You may also find illuminating this excellent blog post by Don Stewart about understanding strictness, how it interacts with garbage collection, and how to diagnose and fix problems.

daf 2009-01-31 15:41:37

Thank you for the links, daf!

jetxee 2009-01-31 16:34:23

The post by Don Stewart is really, really helpful. Thank you very much!

jetxee 2009-02-02 08:20:37

Answer 7

+1 A:

On my 32-bit x86 system, your program uses only about 40 MB of memory.

Are you perhaps confusing the the "total alloc = 116,835,180 bytes" line from your profiling output with how much memory is actually used by the program at any one time? The total alloc is how much memory was allocated over the entire program run; much of this is freed by the garbage collector as you go along. You can expect that number to get very large in a Haskell program; I have programs that allocate many terrabytes of memory over the course of their entire run, though they actually have a maximum virtual memory size of a hundred megabytes or so.

I wouldn't worry too much about large total allocations over the course of a program run; that's the nature of a pure language, and GHC's runtime has a very good garbage collector to help compensate for this.

Curt Sampson 2009-06-06 08:44:15

Yes, exactly! I did understand "total alloc" like a max memory allocation. Now it is more clear to me. Thank you!

jetxee 2009-06-09 07:46:44

ansaurus

tags:

views:

answers:

How to reduce memory usage in a Haskell app?

related questions