ansaurus

Question

Running multiple instances of a python program efficiently & economically?

Answer 1

+1 A:

Does Process access the data on the binary files directly or do you cache it in memory? Reducing the usage of I/O operations should help.

Also, isn't it possible to break Process into separate functions running in parallel? How is the data dependency inside the function?

Finally, you could give some cloud computing service like Amazon EC2 a try (don't forget to read this for tools), but it won't be cheap (EC2 starts at $0.085 per hour) - an alternative would be going to an university with a computer cluster (they are pretty common nowadays, but it will be easier if you know someone there).

bnery 2010-08-29 19:01:11

bnery: the binary files are not cached in memory, as they are too big and cannot fit in memory. I read the files using numpy's mmap(), which is very, very fast.

Shalom Rav 2010-08-29 20:35:47

Answer 2

+3 A:

Check out PiCloud: http://www.picloud.com/

import cloud
cloud.call(function)

Maybe it's an easy solution.

niscy 2010-08-29 19:31:52

very interesting. thank you for bringing this to my attention.

Shalom Rav 2010-08-29 20:36:07

+1 for the link to PiCloud...quite interesting :)

elo80ka 2010-08-30 00:08:48

Answer 3

+1 A:

Well, from your description, it sounds like things are IO bound... In which case parallelism (at least on one IO device) isn't going to help much.

Edit: I just realized that you were referring more to full cloud computing, rather than running multiple processes on one machine... My advice below still holds, though.... PyTables is quite nice for out-of-core calculations!

You mentioned that you're using numpy's mmap to access the data. Therefore, your execution time is likely to depend heavily on how your data is structured on the disc.

Memmapping can actually be quite slow in any situation where the physical hardware has to spend most of its time seeking (e.g. reading a slice along a plane of constant Z in a C-ordered 3D array). One way of mitigating this is to change the way your data is ordered to reduce the number of seeks required to access the parts you are most likely to need.

Another option that may help is compressing the data. If your process is extremely IO bound, you can actually get significant speedups by compressing the data on disk (and sometimes even in memory) and decompressing it on-the-fly before doing your calculation.

The good news is that there's a very flexible, numpy-oriented library that's already been put together to help you with both of these. Have a look at pytables.

I would be very surprised if tables.Expr doesn't significantly (~ 1 order of magnitude) outperform your out-of-core calculation using a memmapped array. See here for a nice, (though canned) example. From that example:

PyTables vs Numpy Memmap

Joe Kington 2010-08-30 15:45:20

ansaurus

tags:

views:

answers:

Running multiple instances of a python program efficiently & economically?

related questions