clojure pmap/preduce vs fork-join

views:

180

answers:

+4 Q:

clojure pmap/preduce vs fork-join

Looks like clojure will have a fork-join implementation which looks like a functional wrapper over java's fork join framework.

I am wondering what the difference between these and pmap/preduce could be ?

+1 A:

From looking at that code, their functionality will be mostly the same - the only difference is that pmap uses Futures running on the Agent threadpool as it's underlying primitive, while pvmap uses fork-join.

I'm not in a position to say for sure, but I'd expect that whichever one performs better in the general case would become the standard implementation for pmap, unless there are significant enough tradeoffs to make having both worthwhile.

It also looks like (for now at least) the fork-join framework only supports vectors, so it's not semi-lazy like pmap.

levand 2010-09-17 20:16:33

+2 A:

Fork-join is more general than the sequence-based pmap/preduce, and should allow for more fine-grained control over parallelism. The exact APIs for doing this are still up in the air.

Stuart Sierra 2010-09-17 20:55:42

+1 A:

One difference, as far as I understand it, is that pmap will run only at whatever degree of "chunkiness" it is given. The function is mapped over each member of the sequence given to pmap. If a the granularity is too small the potential benefits of parallelism get swallowed in the overhead of creating and managing too many Futures.

Fork-join enables work stealing so that how much gets run on each thread can be adaptive.

Alex Stoddard 2010-09-17 21:48:40

+1 A:

Neither pmap or pvmap will save us from having to use the correct chunk size. For my projects that usually means breaking the data into chunks and using map on each chunk, then using pmap to map the chunks in parallel. then reduce and flatten.

Arthur Ulfeldt 2010-09-17 22:01:00

+1 A:

These slides contain some charts showing comparisons between the two approaches: http://data-sorcery.org/2010/10/23/clojureconj/

rplevy 2010-10-24 20:15:56

ansaurus

tags:

views:

answers:

clojure pmap/preduce vs fork-join

related questions