ansaurus

Question

How good is the JVM at parallel processing? When should I create my own Threads and Runnables? Why might threads interfere?

Answer 1

+10 A:

Java Concurrency Tutorial

If you're just spawning a bunch of stuff off to different threads, and it isn't going to be talking back and forth between different threads, it isn't too hard; just write each in a Runnable and pass them off to an ExecutorService.

You should skim the whole tutorial, but for this particular task, start here.

Basically, you do something like this:

ExecutorService executorService = Executors.newFixedThreadPool(n);

where n is the number of things you want running at once (usually the number of CPUs). Each of your tasks should be an object that implements Runnable, and you then execute it on your ExecutorService:

executorService.execute(new SimulationTask(parameters...));

Executors.newFixedThreadPool(n) will start up n threads, and execute will insert the tasks into a queue that feeds to those threads. When a task finishes, the thread it was running on is no longer busy, and the next task in the queue will start running on it. Execute won't block; it will just put the task into the queue and move on to the next one.

The thing to be careful of is that you really AREN'T sharing any mutable state between tasks. Your task classes shouldn't depend on anything mutable that will be shared among them (i.e. static data). There are ways to deal with shared mutable state (locking), but if you can avoid the problem entirely it will be a lot easier.

EDIT: Reading your edits to your question, it looks like you really want something a little different. Instead of implementing Runnable, implement Callable. Your call() method should be pretty much the same as your current run(), except it should return getResults();. Then, submit() it to your ExecutorService. You will get a Future in return, which you can use to test if the simulation is done, and, when it is, get your results.

Adam Jaskiewicz 2009-04-23 22:39:38

Thanks good sir. So when I read low level threads, I should think of things beyond my concern and control? If I want any multithreading, I need to implement it my self?

hornairs 2009-04-23 22:51:54

Added a bit more clarification about what is going on. It is multithreading underneath, but you don't deal with it directly. You just say "these are independent tasks; run them at the same time".

Adam Jaskiewicz 2009-04-23 22:56:52

Check the API. http://java.sun.com/j2se/1.5.0/docs/api/java/util/concurrent/ExecutorService.htmYou write "Runnables" that do your processing, then you can call myResult = executorService.invokeAll( yourListOfRunnables) and get a list of results when everything is done.Make sure that the Runnables do not use static variables etc.results = executor.invokeAll

KarlP 2009-04-24 10:07:24

Answer 2

+2 A:

You can also see the new fork join framework by Doug Lea. One of the best book on the subject is certainly Java Concurrency in Practice. I would strong recommend you to take a look at the fork join model.

dfa 2009-04-23 23:02:55

Answer 3

A:

If you are doing full-out processing all the time in your threads, you won't benefit from having more threads than processors. If your threads occasionally wait on each other or on the system, then Java scales well up to thousands of threads.

I wrote an app that discovered a class B network (65,000) in a few minutes by pinging each node, and each ping had retries with an increasing delay. When I put each ping on a separate thread (this was before NIO, I could probably improve it now), I could run to about 4000 threads in windows before things started getting flaky. Linux the number was nearer 1000 (Never figured out why).

No matter what language or toolkit you use, if your data interacts, you will have to pay some attention to those areas where it does. Java uses a Synchronized keyword to prevent two threads from accessing a section at the same time. If you write your Java in a more functional manner (making all your members final) you can run without synchronization, but it can be--well let's just say solving problems takes a different approach that way.

Java has other tools to manage units of independent work, look in the "Concurrent" package for more information.

Bill K 2009-04-23 23:08:47

"Linux the number was nearer 1000 (Never figured out why)" - I think I had that same issue once and found out that the Linux kernel has a (configurable) maximum number of OS threads, with a default value of 1024.

Michael Borgwardt 2009-04-27 14:19:44

Answer 4

A:

Java is pretty good at parallel processing, but there are two caveats:

Java threads are relatively heavyweight (compared with e.g. Erlang), so don't start creating them in the hundreds or thousands. Each thread gets its own stack memory (default: 256KB) and you could run out of memory, among other things.
If you run on a very powerful machine (especially with a lot of CPUs and a large amount of RAM), then the VM's default settings (especially concerning GC) may result in suboptimal performance and you may have to spend some times tuning them via command line options. Unfortunately, this is not a simple task and requires a lot of knowledge.

Michael Borgwardt 2009-04-27 14:28:02

ansaurus

tags:

views:

answers:

How good is the JVM at parallel processing? When should I create my own Threads and Runnables? Why might threads interfere?

related questions