ansaurus

Question

How to approach parallel processing of messages?

Answer 1

A:

The first method also has unpredictable ordering. The processing of message 1 on thread 1 could take very long, making it possible that message 2, 3 and 4 have long been processed

This would tip the balance to method 2

Edit: I see what you mean.

However why in method 2 would you do the handlers sequentially. In method 1 the ordering doesn't matter and you're fine with that.

E.g. Method 3: both handle the messages and the handlers in parallel.

Of course, here also, the ordering is unguaranteed.

Given that there is some result of the handlers, you might just store the results in an ordered list, this way restoring ordering eventually.

Toad 2010-06-13 10:20:37

No, because message 2 will not be processed until all handlers of message 1 (which happen to run in parallel) are done. ie, all threads are busy processing message 1 (or idle, if there are fewer handlers than threads). Or where is the hole in my logic?

Dan 2010-06-13 10:36:35

Well, if ordering doesn't matter, then method 2 or even method 3 are the better ones. Method 3 isn't good because it has the disadvantage of method 2 (unordered messages) as well as the disadvantages of method 1 (bad use of caching, small handlers have large overhead - the only thing thats improved is possible parallelism). As for re-ordering results later, that only works if the message handlers are side-effect free.

Dan 2010-06-13 14:41:52

I don't think order of handlers would ever matter, but order of messages does (or can - but its not in my control, since I'm using this for a plugin system).

Dan 2010-06-13 17:03:52

Answer 2

+1 A:

I Suppose it comes down to wether or not the order is important. If the order is unimportant you can go for method 2. If the order is important you go for method 1. Depending on what your application is supposed to do, you can still go for method 2, but use a sequence number so all the messages are processed in the correct order (unless of cause if it is the processing part you are trying to optimize).

Arkain 2010-06-13 11:33:04

Ordering is definitely desireable, since there are plenty of situations I can think of where ordering would make life much much easier. I definitely want to optimize performance, but I can accept a performance hit in exchange for other benefits (ease of use, asynchronous message handling to prevent a message handler from hogging resources, safety, whatever other benefits may be possible). I will look at other options, such as letting the senders or handlers decide how messages are dealth with, or by coming up with a more elaborate (and intelligent) method of scheduling handlers..

Dan 2010-06-13 14:46:14

Answer 3

+1 A:

If possible I would go for number two with some tweaks. Do you really need every message tp be in order? I find that to be an unusual case. Some messages we just need to handle as soon as possible, and then some messages need be processed before another message but not before every message.

If there are some messages that have to be in order, then mark them someway. You can mark them with some conversation code that lets the processor know that it must be processed in order relative to the other messages in that conversation. Then you can process all conversation-less messages and one message from each conversation concurrently.

Give your design a good look and make sure that only messages that need to be in order are.

Chris 2010-06-13 16:52:25

I was about to comment asking how to handle third-party modules imposing order on messages sent by other modules (which they may not have access to), but I figured it out: pass a conversation object along with the message to the handlers and allow them to add more messages to it. Messages in the same conversation are sequential, all others are processed in parallel like in #2 (or even the handlers are called within TBB tasks.. I guess task stealing means they run sequentially (making better use of cache) unless other threads are idle, in which case they run in parallel). Sound reasonable?

Dan 2010-06-13 17:01:19

Answer 4

+1 A:

I'd say do something even different. Don't send work to the threads. Have the threads pull work when they finish previous work.

Maintain a fixed amount of worker threads (the optimal amount equal to the number of CPU cores in the system) and have each of them pull sequentially the next task to do from the global queue after it finishes with the previous one. Obviously, you would need to keep track of dependencies between messages to defer handling of a message until its dependencies are fully handled.

This could be done with very small synchronization overhead - possibly only with atomic operations, no heavy primitives like mutexes or semaphores.

Also, if you pass a message to each handler by reference, instead of making a copy, having the same message handled simultaneously by different handlers on different CPU cores can actually improve cache performance, as higher levels of cache (usually from L2 upwards) are often shared between CPU cores - so when one handler reads a message into the cache, the other handler on the second core will have this message already in L2. So think carefully - do you really need to copy the messages?

slacker 2010-06-13 19:29:52

My message objects are immutable and shared between handlers, which is why running handlers sequentially (approach 2) is better for caching, since all handlers using the same message will then be running in the same thread and therefore able to share the cache. My goal has always been to share immutable objects. You gave me an idea though with task dependency handling. Might make for a convenient approach.

Dan 2010-06-14 01:39:06

ansaurus

tags:

views:

answers:

How to approach parallel processing of messages?

related questions