distributed-computing

Using Blitz implementation of JavaSpaces

I have great doubts about this forum, but I am willing to be pleasantly surprised ;) Kudos and great karma to those who get me back on track. I am attempting to use the blitz implementation of JavaSpaces (http://www.dancres.org/blitz/blitz_js.html) to implement the ComputeFarm example provided at http://today.java.net/pub/a/today/2005/0...

Developing a distributed system as a grid

Has anyone had experience with developing a distributed system as a grid? By grid, I mean, a distributed system where all nodes are identical and there is no central management, database etc. How can the grid achieve even distribution of: CPU, Memory, Disk, Bandwidth etc.? ...

Best distributed filesystem for commodity linux storage farm

I have a lot of spare intel linux servers laying around (hundreds) and want to use them for a distributed file system in a web hosting and file sharing environment. This isn't for a HPC application, so high performance isn't critical. The main requirement is high availability, if one server goes offline, the data stored on it's hard dr...

How to send messages between Companies

I often see myself in the need of setting up automated transfer of messages/data/files between companies. Besides the monstrous EDIFACT, Odette and a few other "big" electronic data exchange standards every industry has a few dozen or even e few hundred relatively sane relatively light weight, relatively well documented file formats for...

Anyone using JavaSpaces technology?

I am just wondering if there is real practical use of JavaSpaces technology out there and how exactly its implemented. Can someone who has worked with the tech help. ...

What is the best architecture for multiple game servers that need to talk to each other?

The game is a low graphic (SVG) strategic game. Each server represents a Game Domain with its players. All servers should be able to talk to each other, as players can move (in the game) from domain to domain/send "diplomatic messengers" etc.. We have chosen this idea as it enables us to enlarge the world map endlessly, it enables secon...

Persistent, Distributed and Replicated Key-Value Blob Store

Does anyone have any suggestions on a persistent, distributed and replicated key-value blob store? I've seen this list of key value stores. But they all seem to be for storing small blob values. The blobs I am dealing with range from 2K to 300M in size. What I need is something more like to mogilefs, but mogilefs is optimized for W...

Performances evaluation with Message Passing

Hi, I have to build a distributed application, using MPI. One of the decision that I have to take is how to map instances of classes into process (and then into machines), in order to take maximum advantages from a distributed environement. My question is: there is a model that let me choose the better mapping? I mean, some arrangement...

How do I control output files name and content of an Hadoop streaming job?

Is there a way to control the output filenames of an Hadoop Streaming job? Specifically I would like my job's output files content and name to be organized by the ket the reducer outputs - each file would only contain values for one key and its name would be the key. Update: Just found the answer - Using a Java class that derives from ...

Helper library for distributed algorithms programming?

When you code a distributed algorithm, do you use any library to model abstract things like processor, register, message, link, etc.? Is there any library that does that? I'm thinking about e.g. self-stabilizing algorithms, like self-stabilizing minimum spanning-tree algorithms. ...

How to distribute a program on an unreliable cluster?

What I'm looking for is any/all of the following: automatic discovery of worker failure (computer off for instance) detection of all running (linux) PCs on a given IP address range (computer on) ... and auto worker spawning (ping+ssh?) load balancing so that workers do not slow down other processes (nice?) some form of message passing ...

Publishing performance counters within Windows-based async-messaging application?

So I'm involved in building an application that's using an async messaging architecture. One of the requirements is monitoring performance from a central location; requests/sec at each node, requests/sec processed by series of nodes, etc. This is being built with .NET within a Windows-based platform, distributed across several machines...

EagerReadDerivation: balancing the benefits with the costs

I'm studying Domain-Driven Design and Distributed DDD for an upcoming Silverlight application I'm going to be working on. The EagerReadDerivation pattern seems like it would improve the scalability of the application, but at the cost of increased complexity. The application will have potentially have thousands of users uploading large ...

suggestions on a project in C++ / distributed systems / networks

I'd like to work on a 2-3 month long project (full time) that involves coding in C++ and is related to networks (protocol stacks). I was considering writing my own network stack but that doesn't seem as interesting. It would be great to find an idea to implement a tcp/ip-like stack for distributed system/GPUs that is better as far as net...

What is a good hardware setup for programming concurrent and distributed applications?

I don't have the money to build my own uber Blade system but I would like to get into concurrent and distributed programming (think CCR/DSS, Hadoop, Project Voldemort etc.). I currently have a Q6600 with 4GB with some separate hdds but that's about it. While I can write multi-threaded programs I can not properly test distributed filesys...

C++ distributed programming

Is there any library for distributed in-memory cache, distributed tasks, publish/subscribe messaging? I have used Hazelcast in Java, I would like something similar. I know that Memcached is an in-memory cache and even distributed, but it is missing the messaging and remote task. I just need something to coordinate a cluster of server w...

Learning more about distributed computing

I'm interested in learning more about distributed computing and how to do it - mostly in C++ but I'd be interested in C# as well. Can someone please recommend some resources? I know very little to nothing about the topic so where should I start? Thanks. ...

Ehcache performance on a large cluster

I would like to use Ehcache replicated cache, first as the backend to Hibernate second level cache, second as a cache for any data. I know how a distributed cache like memcached is working, and I know it can scale to large clusters, but I cannot find how Ehcache replication behaves on large clusters. Has someone a pointer to some info...

.net distributed grid computing migration, recommendations on libraries, architecture

I have a c# multi-threaded monte carlo simulation, the application is already structured such that it can be partitioned by into Tasks that execute independently, a TaskController executes Tasks, aggregates intermediate results, checks for convergence (early termination criteria) then returns final results, this is currently implemented ...

Javascript Distributed Computing

Why aren't there any Javascript distributed computing frameworks / projects? The idea seems absolutely awesome to me because: The Client is the Browser Iteration can be done with AJAX Webmasters could help projects by linking the respective Javascript Millions or even billions of users would help DC projects without even noticing Ple...