How do you save the state of a computation in while using SNOW (or Multicore or ...) | ansaurus

tags:

views:

38

answers:

1

+1 Q:

How do you save the state of a computation in while using SNOW (or Multicore or ...)

From hard experience I've found it useful to occasionally save the state of my long computations to disk to start them up later if something fails. Can I do this in a distributed computation package in R (like SNOW or multicore)?
It does not seem clear how this could be done since the master is collecting things from the slaves in a non-transparent way.

A:

This is (again :-) a hard one.

You could try to dump snapshots on the nodes using save() or save.image(). You could then try to re-organize your code so that the nodes can resume after the last snapshot.

Or you could try to re-organize your workflow such that nodes 'take tickets' and return the results. That way the central node keeps tabs on everything and you can log interim results there.

Either way, what you desire is not available out of the box (as far as I know).

Dirk Eddelbuettel 2010-04-14 03:25:07

Do you think if I transitioned to NWS I could dump the workspace every couple *large number* of iterations? Even though I'm running on multiple cores I could maybe count through the RN streams to retrieve the RNG state as well.

James 2010-04-14 16:10:00

But if you 'phone home' from the nodes you get all that communications overhead. It's tough -- but ultimately your trade-off to make. And RNG state can be dumped easily. But eg in the 'ticketing' I mentioned you could provide the seed from the master for each task and then you'd control things....

Dirk Eddelbuettel 2010-04-14 16:28:36

related questions

What's the best way to unit test concurrent Erlang code?

How can i connect two or more machines via tcp cable to form a network grid?

How are you taking advantage of Multicore?

Start stored procedures sequentially or in parallel

Using Parallel.For to test SQL queries and comparison with the ThreadPool

Multithreaded image processing in C++

RT parallel processing in Rails

What is a good textbook for Parallel Computing?

Easy parallelisation

How to wait untill all child processes called by fork() complete?

Free OpenMosix replacement?

What is the easiest way to parallelize my C# program across multiple PCs

Passing values with Parallel Extensions and VB.net

What's the best way of executing tasks in parallel in Ksh and Perl?

What are some practical problems that parallel computing, f#, and GPU-parallel processing might solve.

How to paralleize search for a string in a file with a help of fork? (GNU Linux/g++)

Unit Testing, Deadlocks, and Race Conditions

How would you simply Monitor.TryEnter

How would you simplfy Entering and Exiting a ReaderWriterLock?

Which parallel programming APIs do you use?

How does NUnit (and MSTest) handle tests that change static/shared variables?

MPI for multicore ?

Is it possible that F# will be optimized more than other .Net languages in the future?

What parallel programming model do you recommend today to take advantage of the manycore processors of tomorrow?

What are the current best options for parallelizing a CPU-intensive .NET app?