I have seen some clusters with more than one tomcat running per machine. The main argument is that the garbage collector is slower on large heaps, so it would better to have 4 tomcats with 1G heap than 1 tomcat with a 4G heap.
Have anyone ever profiled that? what do you think?