views:

612

answers:

8

I am currently researching the options that are available (both Open Source and Commercial) for developing a distributed application.

"A distributed system consists of multiple autonomous computers that communicate through a computer network." Wikipedia

The application is focused on distributing highly cpu intensive operations (as opposed to data intensive) so I'm sure MapReduce solutions don't fit the bill.

Any framework that you can recommend ( + give a brief summary of any experience or comparison to other frameworks ) would be greatly appreciated.

Thanks.

+5  A: 

MPAPI Framework - http://www.codeplex.com/MPAPI

My Thoughts

  • Seems to be one of the simpler solutions.
  • I see no evidence of failover support.
  • Is Open Source =] and is MONO Compatible YAY!
  • Uses the "RemotingLite" framework hense the mono compatibility.
  • Writing single-computer or multiple-computer (cluster) applications are as equally simple.
  • Developer's philosophy is "Performance is Paramount". So its safe to say he has optimised.
  • Continuously updated framework.
StevenH
+1  A: 

Azure Grid - http://azuregrid.codeplex.com/

StevenH
+1  A: 

Alchemi ".NET Grid Computing Framework"

http://www.gridbus.org/~alchemi/publications.html

http://sourceforge.net/projects/alchemi/files/

Alchemi is a .NET grid computing framework that allows you to painlessly aggregate the computing power of intranet and Internet-connected machines into a virtual supercomputer (computational grid) and to develop applications to run on the grid.

Quick Notes:

  • Stopped being updated in 2007 version 1.0.6 (.NET 2.0).
  • Well documented formally done so.
  • Uses XML to define jobs. (It was all the rage in 2007 to use XML for everything)
  • Relatively complex compared to MPAPI, but don't take this as a big negative.
  • Would take a good bit of time to get up to speed.
  • Is Open Source =).
  • Is Extendible and can be used in conjunction with other cluster technologies via a Gridbus broker (http://www.gridbus.org)
  • Seems widely used (see publications http://www.gridbus.org/~alchemi/publications.html) =)
StevenH
A: 
StevenH
+1  A: 

DryadLINQ is a Microsoft Research project to allow LINQ queries to be distributed among multiple computers, rather than just multiple cores as Parallel LINQ does.

Blog post with a brief example.

Joel Mueller
A: 

AWS SDK for .NET - http://aws.amazon.com/sdkfornet/

I think this cloud offering and .NET api may also fit the bill. Amazon offer high performance machines (see price list) which would allow for delegation a execution of highly cpu intensive opperations on a large scale.

StevenH
+2  A: 

Appistry CloudIQ Platform http://www.appistry.com

  • Will make hundreds of machines appear as one computing resource.
  • Currently used in production at FedEx to run routing algorithms
  • Allows you to write apps in C,C++, Java and .Net
  • No single point of failure
  • Auto recover/retry of failed tasks
Brett McCann
+1  A: 

hi all,

I was one of the lead developers on Alchemi. We've had to change course are start fresh, and the result is the Utilify Platform. Conceptually similar to Alchemi, the Utilify platform builds on the lessons learnt from Alchemi and provides a more flexible programming model to build distributed .NET applications on a Windows network.

Please visit www.utilify.com for more information. Currently, there is a free beta version available for download.

Regards, Krishna.

Krishna