views:

887

answers:

4

I have a Scala data processing tool which is failing with a java.lang.OutOfMemoryError exception. The tool needs to make a couple passes over a large data file (the one I'm working on is over 700MB), so it would be convenient if the entire thing could be stored in memory.

I run the tool from the command line or from a Bash script using the "scala" runner. How do I increase the JVM heap size for this? I've tried passing -Xmx1024m, but it does not recognize this argument. I'm using a nightly build of Scala 2.8.0 (r18678).

+1  A: 

As a hack you can edit the scala/bin/scala file to edit the parameters for the java command. Not pretty.

Joe
That... could work.
Jay Conrod
Yes but I'm waiting for someone else to answer. I want to know too...
Joe
+2  A: 

Edit JAVA_OPTS.

Daniel
+8  A: 

Based on Joe's suggestion, I looked at the scala program, which is a Bash script that calls java with various Scala specific arguments. It allows extra command line arguments to be passed to Java through the JAVA_OPTS environment variable. So you can increase the heap size like this:

JAVA_OPTS="-Xmx2g" scala classname arguments...
Jay Conrod
+5  A: 

The JAVA_OPTS environment variable is used to specify the options passed to the java command. See the Scala manpage for details.

Synesso