views:

358

answers:

5

Hi

We generate pdf files with data regarding monthly financial balance of tens of thousands of clients. At it's peak (100.000 files at the end of year), the process may take as long as 5 days to complete using distribute the load between 5 servers. The distribution of workload is a manual process (eg. server 1 generates pdf for clients 1 to 20.000, server 2 from 20.001 to 40.000, and so on).

We use Java, so we would like to use a Java tool or framework in a fashion similar to BOINC (BOINC is not coded in Java), which would allow us to have:

  • a server automatically distributing data and code to workers, and managing them
  • workers (plain dual core desktops using free (or not so) cpu cycles) receiving code sent by server and executing it using the data as input, giving some status feedback, and finally doing something with the output, eg. sending pdf generated files to network storage.

For now, we don't want to try heavyweight all-powerful toolkits such as Globus Toolkit.

¿Do you know any java framework/tools suitable to our needs? Open source would be nice, but we don't discard commercial solutions.

+1  A: 

There are many:

Google is your best friend here.

Byron Whitlock
Thanks. In fact I had googled a little and found GridGain and jppf, but nothing quite like boinc. Cajo looks interesting.
Marco Ramírez
A bit late... but +1 for using the dmoz directory.
Andy
A: 

Google says there is a Java port of BOINC here: http://boincoid.sf.net

It's mostly targeting about android :-/ but apparently all but the GUI can run on a regular VM.

ykaganovich
Nice. In fact it says there will be a Java Port and an Android Port soon.
Marco Ramírez
A: 

Another one you can look at is Terracotta.

eugener
A: 

Check out Hazelcast distributed executor service. It uses Java's standard ExecutorService API.

Talip Ozturk