ansaurus

Question

Help me stabilize this jRun configuration (CF9/Win2k3/IIS6)

Answer 1

+2 A:

This is an effect that could have many causes -- anything from the way your application is constructed (unconventional usage of application or server scope? Bad database drivers and connection management? Parsing giant XML files? Use of CFHTTP or other external resources? Problems with native session replication?) to your coding practices (var scoping everywhere?) to the kinds of CPUs in your servers. It's not likely you'll come up with some magic bullet JVM settings without much analysis (and perhaps not even then). But for starters, why do you have such an unusually large PermGen? Seems like a peculiar pattern, but of course I don't know anything about your app.

It seems you have little to lose by trying some different garbage collectors. If appropriate to your JVM version, try:

-XX:+UseConcMarkSweepGC -XX:+UseParNewGC

and add in:

-XX:+CMSPermGenSweepingEnabled -XX:+CMSClassUnloadingEnabled

which may help manage your large PermGen. Don't forget to take out XX:+UseParallelGC if you try these.

Ken Redler 2010-05-20 03:06:43

We did see error with session replication where the memory usage would shoot up, we disabled it on the cluster while troubleshooting the other issue. I'll try the other GC see if it's better.We increased the MaxPerm after seeing it was full in all the hs_err_*.log file---PSPermGen total 107520K, used 106079K [0x03810000, 0x0a110000, 0x23810000) object space 107520K, 98% used [0x03810000,0x09fa7e40,0x0a110000)

jfrobishow 2010-05-20 10:48:13

My comment about the big PermGen doesn't mean it's wrong -- just that it's not a pattern I've commonly seen. Good luck!

Ken Redler 2010-05-20 13:58:46

Answer 2

+1 A:

A little update. I've tried different GCs and while some stabilized the system for a while it kept crashing, only less frequently. So I kept digging and eventually found out that the JVM will throw "Out of swap space" when the OS itself refuses to allocate the memory requested.

This usually happen when the maximum memory is already assigned to the JVM process, this is the jrun overhead, the JVM itself, all the libraries, the heap AND the stack. Since request are living on the stack if you have a lot of requests being spawned the stack will grow and grow. The size of each thread varies according to the OS and version of the JVM but can be controlled using the -Xss argument. I reduced ours to 64k so our java.args looks like this:

java.args=-server -Xmx768m -Xss64k -Dsun.io.useCanonCaches=false -XX:MaxPermSize=512m -XX:+UseParallelGC -Dcoldfusion.rootDir={application.home}/ -verbose:gc -Xloggc:c:/Jrun4/logs/gc/gcInstance2a.log

So far everything has been stable without any noticeable slowdown for 6 days, which is definitely the longest I've ever seen the application stay up. If you reduce the request size too much, you'll start noticing stack overflow errors in the log instead of the OOM error.

My next step will be to tweak the MaxPermSize but so far so good!

jfrobishow 2010-06-02 15:01:44

ansaurus

tags:

views:

answers:

Help me stabilize this jRun configuration (CF9/Win2k3/IIS6)

related questions