ansaurus

Question

How do I tell a multi-core / multi-CPU machine to process function calls in a loop in parallel?

Answer 1

+3 A:

Well, if .net is an option, they have put a lot of effort into Parallel Computing.

EBGreen 2008-09-11 14:58:23

Answer 2

+3 A:

You can implement the algorithm from Google's MapReduce without having physically separate machines. Just consider each of those "machines" to be "threads." Threads are automatically distributed on multi-core machines.

Jason Cohen 2008-09-11 14:58:58

Answer 3

+2 A:

If you're working with a compiler that will support it, I would suggest taking a look at http://www.openmp.org for a way of annotating your code in such a way that certain loops will be parallelized.

It does a lot more as well, and you might find it very helpful.

Their web page reports that gcc4.2 will support openmp, for example.

Thomas Kammeyer 2008-09-11 17:34:59

Answer 4

+2 A:

I might be missing something here, but this this seems fairly straight forward using pthreads.

Set up a small threadpool with N threads in it and have one thread to control them all.

The master thread simply sits in a loop doing something like:

Get data chunk from DB
Find next free thread If no thread is free then wait
Hand over chunk to worker thread
Go back and get next chunk from DB

In the meantime the worker threads they sit and do:

Mark myself as free
Wait for the mast thread to give me a chunk of data
Process the chunk of data
Mark myself as free again

The method by which you implement this can be as simple as two mutex controlled arrays. One has the worked threads in it (the threadpool) and the other indicated if each corresponding thread is free or busy.

Tweak N to your liking ...

Christian 2008-09-12 00:29:38

Answer 5

+3 A:

If you still plan on using Python, you might want to have a look at Processing. It uses processes rather than threads for parallel computing (due to the Python GIL) and provides classes for distributing "work items" onto several processes. Using the pool class, you can write code like the following:

import processing

def worker(i):
    return i*i
num_workers = 2
pool = processing.Pool(num_workers)
result = pool.imap(worker, range(100000))

This is a parallel version of itertools.imap, which distributes calls over to processes. You can also use the apply_async methods of the pool and store lazy result objects in a list:

results = []
for i in range(10000):
    results.append(pool.apply_async(worker, i))

For further reference, see the documentation of the Pool class.

Gotchas:

processing uses fork(), so you have to be careful on Win32
objects transferred between processes need to be pickleable
if the workers are relatively fast, you can tweak chunksize, i.e. the number of work items send to a worker process in one batch
processing.Pool uses a background thread

Torsten Marek 2008-09-15 19:57:34

Answer 6

A:

The same thread pool is used in java. But the threads in threadpools are serialisable and sent to other computers and deserialised to run.

2009-03-10 12:00:41

Answer 7

A:

I have developed a MapReduce library for multi-threaded/multi-core use on a single server. Everything is taken care of by the library, and the user just has to implement Map and Reduce. It is positioned as a Boost library, but not yet accepted as a formal lib. Check out http://www.craighenderson.co.uk/mapreduce

CraigH 2009-10-28 13:03:20

Answer 8

A:

You may be interested in examining the code of libdispatch, which is the open source implementation of Apple's Grand Central Dispatch.

mouviciel 2009-10-28 13:09:33

Answer 9

A:

Intel's TBB or boost::mpi might be of interest to you also.

piotr 2009-10-28 13:26:44

ansaurus

tags:

views:

answers:

How do I tell a multi-core / multi-CPU machine to process function calls in a loop in parallel?

related questions