ansaurus

Question

Identify cause of hundreds of AJP threads in Tomcat

Answer 1

+1 A:

This isn't a direct answer I guess, but perhaps as a mitigating approach in production, you could limit the damage by restricting the maxThreads for AJP in your configuration, per http://tomcat.apache.org/tomcat-6.0-doc/config/ajp.html ?

The default is 200, which sure is a lot of threads - but that possibly doesn't explain the 565 above. Obviously that has the potential to push the problem elsewhere, but perhaps you'll better be able to debug the problem there, or it will manifest itself in a different way. Is it possible that you're just under a high amount of load? Is there anything notable in the behaviour of Apache in the periods leading up to the problems you're experiencing?

Chad 2010-02-12 03:21:50

Answer 2

A:

Impossible to know for sure unless you managed to get a thread dump, but once I experienced a similar problem where all 8 cores were busy at 100% with thousands of threads (it wasn't on Tomcat however).

In our case, each thread was stuck inside java.util.HashMap in the get() method, spinning tightly in the for loop:

   public V get(Object key) {
       if (key == null)
           return getForNullKey();
       int hash = hash(key.hashCode());
       for (Entry<K,V> e = table[indexFor(hash, table.length)];
            e != null;
            e = e.next) {
           Object k;
           if (e.hash == hash && ((k = e.key) == key || key.equals(k)))
               return e.value;
       }
       return null;
   }

Our theory was that somehow the linked list of entries at the specific bucket had got corrupted and was pointing back to itself, so was never able to exit the loop. Since no job ever finished, more and more threads got consumed from the pool as more requests were made.

This can occur if the table has to be resized whilst putting new entries, but there is unguarded read/write access by several threads; one thread may be extending the linked list at a specific bucket location whilst another is busy trying to moving it. If access to the hash map is not synchronized then it is quite likely to get corrupted (although generally not reproducible).

Check to see if there is a shared HashMap (or HashSet) which several threads can simultaneously access. If so, and it is easy to do so, either replace with a ConcurrentHashMap, or use a ReentrantReadWriteLock to guard read/write access to the map. You could of course try Collections.synchronizedMap() too, but this would not be so scaleable.

Any of these proposed fixes should prevent the issue, if that turns out to be the root cause of your problem.

ansaurus

tags:

views:

answers:

Identify cause of hundreds of AJP threads in Tomcat

related questions