ansaurus

Question

Python multiprocessing Pool.map is calling aquire?

Answer 1

+3 A:

I believe the problem is that the amount of CPU time it takes to process each chunk is small relative to the amount of time it takes to copy the input and output to and from the worker processes. I modified your example code to split the output into 16 even chunks and to print out the difference in CPU time (time.clock()) between when a run of chunk_avg_map() begins and ends. On my system each individual run took slightly under a second of CPU time, but the overall CPU time usage for the process group (system + user time) was more than 38 seconds. An apparent 0.75 second copying overhead per chunk leaves your program performing calculations only slightly faster than multiprocessing can deliver the data, leading to only two worker processes ever being utilize at once.

If I modify the code such that the "input data" is just xrange(16) and build the random array within chunk_avg_map() then I see the sysem + user time drop to around 19 seconds and all 16 worker processes executing at the same time.

llasram 2010-09-23 02:56:39

ansaurus

tags:

views:

answers:

Python multiprocessing Pool.map is calling aquire?

related questions