ansaurus

Question

Answer 1

A:

You should use the master-slave (aka. farmer-worker) pattern. The initial process would be the master and creates the jobs. It

creates a Queue
creates 7 slave processes, passing the queue as a parameter
starts writing jobs into the queue

The slave processes continuously read from the queue, and perform the jobs (perhaps until they receive a stop message from the queue). There is no need to use Manager objects in this scenario, AFAICT.

Martin v. Löwis 2009-08-24 15:53:11

How you implementation would manage *remote* queues? I think multiprocessing.managers is a really good choice if he needs to share resources remotely.

AlberT 2009-08-24 16:22:06

Answer 2

+2 A:

Look to the doc how to retreive a queue from the manager (paragraph 17.6.2.7) than with a pool (paragraph 17.6.2.9) of workers launch 7 jobs passing the queue to each one.

in alternative you can think something like a producer/consumer problem:

from multiprocessing.managers import BaseManager
import random

class Producer():
def __init__(self):
    BaseManager.register('queue')
    self.m = BaseManager(address=('hostname', 50000), authkey='jgsjgfdjs')
    self.m.connect()
    self.cm_queue = self.m.queue()
    while 1:
        time.sleep(random.randint(1,3))
        self.cm_queue.put(<PUT-HERE-JOBS>)

from multiprocessing.managers import BaseManager
import time
import random
class Consumer():
def __init__(self):
    BaseManager.register('queue')

    self.m = BaseManager(address=('host', 50000), authkey='jgsjgfdjs')
    self.m.connect()
    self.queue = self.m.queue()
    while 1:
        <EXECUTE(job = self.queue.get())>


from multiprocessing.managers import BaseManager, Queue
class Manager():

def __init__(self):

    self.queue = QueueQueu()

    BaseManager.register('st_queue', callable=lambda:self.queue)

    self.m = BaseManager(address=('host', 50000), authkey='jgsjgfdjs')
    self.s = self.m.get_server()

    self.s.serve_forever()

DrFalk3n 2009-08-24 15:55:54

I have this working (thank you). What I need to know is in your section <EXECUTE(job = self.queue.get())> what is the best way to process these jobs? They're all python files so would it be best to run them as a module? Or should they be run under a separate python process with the subprocess module?

WeWatchYourWebsite 2009-08-29 00:13:27

look to the OS module and to its exec method for example

DrFalk3n 2009-09-01 14:03:54

ansaurus

tags:

views:

answers:

Multiprocessing with renewable queue

related questions