ansaurus

Question

Is multithreading the right way to go for my case?

Answer 1

+4 A:

If you're having problems with locking, race conditions, etc due to the multi-threading nature of you app, it would be hard for anyone to give an instant solution. These kind of problems can be very intermittent at best and can not always be easily reproduced. That makes it hard even for someone sitting right in front of all of the code. But I will offer an alternative, that is to consider using some kind of message queues as your publish-subscribe backbone. Using such architecture can help simplify a lot of your boiler-plate code. As I said, this might or might solve your problem instantly, but hopefully share a different approach with you.

Khnle 2010-05-21 19:43:31

+1 I was going to suggest the same approach :)

Cory Grimster 2010-05-21 19:47:47

+1 Me too! But just to add a little more detail, the basic mechanism is that you add incoming messages to a thread-safe queue, then you have a dedicated thread that processes the messages in the queue. This way you get single-threaded in-order message processing, which simplifies things. Obviously you can get pretty fancy with this, but getting it working correctly to begin with will stop your head burning. :)

chibacity 2010-05-21 19:52:23

Agree - this seems like a classic pub-sub architecture. Reimplementing it all your self is a challenging task. Fun, but challenging. Have you considered using the System.Messaging namespace for this, or some other pub/sub solution like AMQP?

Rob Goodwin 2010-05-21 19:56:23

pub/sub seems to be way more elaborated than what I anticipated, googling more about the subject currently.

Julien Lebosquain 2010-05-21 22:10:40

Answer 2

+1 A:

I really don't know anything about .NET, but I can share my few experiences with asynchronous programming in the C and Linux world.

First of all, take this with gallons and gallons of salt, but: using threads (rather than processes) is often a bad idea. Processes only share the information you want to share (through message passing), while threads share everything. Because you can't share every single object accessible by code with every thread, you have to explicitly indicate what is not shared by using locks and whatnot. Working with processes is often easier because you only have to specify what you do share. I can't remember where I read this, but someone compared multithreaded programming with the style of programming you'd have to follow on a system without memory management (e.g. DOS) or in an operating system kernel. That type of programming is often unnecessary in userspace because the OS and MMU (memory management unit) take care of that for you.

One example of a large, asynchronous program that doesn't use threads is PostgreSQL. In fact, on its Todo list, it is listed under "Features We Do Not Want" (see here). Granted, there may be cases now in the future where threads could speed up tasks (because they're cheaper to instantiate than processes), but they aren't (and won't be any time soon) used as the main vehicle of asynchronous programming in PostgreSQL.

An alternative to threads and processes is to simply use one thread and one process, but have an event loop and quick handlers. However, drawbacks to this approach include: * Your code has to be chopped up into pieces that don't sleep. Instead of calling a function that simply downloads a URL and returns a result, you have to supply a callback for when the result is ready and also have your main loop respond to events related to downloading a URL (e.g. a single packet arrived). * You might not be able to avoid sleeping, or it may be unduly difficult.

I would recommend the single-process, single-thread approach for a relatively simple daemon. However, if the role of that daemon starts getting large and the code gets complicated, it may be time to split it up into separate processes.

Joey Adams 2010-05-21 20:09:50

Erlang comes to mind. Scalable lightweight processes that communicate using message passing semantics.

chibacity 2010-05-21 20:19:08

While your answer is 100% correct and I upvoted it, in my case almost any object is shared due to the way clients can modify almost. The few resources that don't need sharing are either immutable or accessed by a single thread.

Julien Lebosquain 2010-05-21 20:23:59

@Julien Lebosquain Consider a database server and its clients. The clients can arguably modify "just about anything" on the database. However, the client and server are well-separated and talk to each other through a protocol. Of course, it would be a lot of needless work to implement a SQL interface between your clients and server. You could use a simple encoding indicating the intention of the client to the server. Perhaps .NET even lets you pass functions bound to parameters (closures) to/from client and server.

Joey Adams 2010-05-21 20:36:44

You're right, but the big difference here is that a database server don't push the information back to other connected clients when one has updated something. It just waits for another request from the clients, which is not applicable here since every client has to be up-to-date.

Julien Lebosquain 2010-05-21 20:48:18

@Julien Lebosquain Au contraire. PostgreSQL, for one, has `NOTIFY` and `LISTEN` which can be used to wake up clients.

Joey Adams 2010-05-21 21:12:17

Answer 3

+2 A:

I mentioned Erlang in a previous comment and also queued message processing in another. Erlang is designed from the ground up to support highly concurrent, shared-nothing, message passing style systems.

http://en.wikipedia.org/wiki/Erlang_(programming_language)

Although I have never used it it anger, I've read the book (Programming Erlang), and really like the simple beauty of the concurrent message passing approach that it embodies. After doing a fair amount of complex multi-threaded development, I can appreciate the challenges that Erlang seeks to solve i.e. the complexities of shared resources and synchronization.

There is a C# project that seeks to embody the concepts of Erlang - Retlang:

http://code.google.com/p/retlang/wiki/GettingStarted

Never used it, but the message passing approach is definitely a good one, and could be a nice fit for what you are trying to achieve.

chibacity 2010-05-21 21:48:57

Thanks, investigating Erland and Retlang right now.

Julien Lebosquain 2010-05-21 22:11:01

Erlang is awesome. I'm currently writing some POCs and it seems to perfectly suit my needs. Many thanks.

Julien Lebosquain 2010-05-22 12:41:56

ansaurus

tags:

views:

answers:

Is multithreading the right way to go for my case?

related questions