clustering

What algorithms do "the big ones" use to cluster news?

I want to cluster texts for a news website. At the moment I use this algorithm to find the related articles. But I found out that PHP's similar_text() gives very good results, too. What sort of algorithms do "the big ones", Google News, Topix, Techmeme, Wikio, Megite etc., use? Of course, you don't know exactly how the algorithms work....

Monitor MSMQ

I'm trying to monitor a remote private msmq in a Server 2003 cluster with the MSMQ Local Admin API and the mgmtinfo.exe utility. I can list the queues on the clustered resource with the command: mgmtinfo.exe server-name machine but cannot get information about the individual queues. I'm having difficulty crafting a correct format name...

How to distribute a program on an unreliable cluster?

What I'm looking for is any/all of the following: automatic discovery of worker failure (computer off for instance) detection of all running (linux) PCs on a given IP address range (computer on) ... and auto worker spawning (ping+ssh?) load balancing so that workers do not slow down other processes (nice?) some form of message passing ...

Adding a Node to a SQL Server 2008 Cluster Returns "The current SKU is invalid." Error

When I try to add a second node to my SQL Server 2008 cluster the installer throws an error, "The current SKU is invalid." What's up I know the key is correct, I used it on the first node? ...

OpenMQ 4.2 cluster stops delivering messages

Hi. We have a system running an application connecting to a cluster of OpenMQ brokers (non-HA). It's been running for about a week without restart. This week we noticed the cluster stopped delivering messages to the consumers (I believe these were the messages on second broker). After checking it in the morning we saw it did not deliver ...

Looking for approaches to categorize objects based on their properties.

I have a set of ~10K objects, each with approximately 150 distinct properties, about a quarter of which are multivalued and/or related to other properties. I have a set of about 120 categories that I would like to sort these objects into, with each category being defined as a 'template' object. If an instance matches the template exact...

Converting a parallel program to a cluster program. From OpenMP to ?

I want to write a code converter that takes an OpenMP based parallel program and runs it on a cluster. How do I go about this problem? What libraries do I use? How do I set up a small cluster for this? I'm finding it extremely hard to find good material about cluster computing on the internet. EDIT: If it's impossible then how does In...

How to configure EJB lookup in clusterd websphere environment.

Hello, I'm developing a EJB client. The EJB (2.1) server in deployed as Websphere 6.0 cluster. I'm doing the jndi lookup after acquirement of the InitialContent object for specific ip address using the following code: Hashtable env = new Hashtable(); env.put(Context.INITIAL_CONTEXT_FACTIORY,com.ibm.websphere.naming.WsnInitialContextFac...

[Java] Cluster Shared Cache

Hi everyone. I am searching for a java framework that would allow me to share a cache between multiple JVMs. What I would need is something like Hazelcast but without the "distributed" part. I want to be able to add an item in the cache and have it automatically synced to the other "group member" cache. If possible, I'd like the cache ...

Clustering Coordinates on Server Side

What is the most efficient way of clustering coordinates on the server side? Examples? Here is an example of exactly what I need (although I can't use third party vendors) http://www.crunchpanorama.com/ PostgreSQL, Python platform, Google Maps. ...

PHP sessions in a load balancing cluster - how?

OK, so I've got this totally rare an unique scenario of a load balanced PHP website. The bummer is - it didn't used to be load balanced. Now we're starting to get issues... Currently the only issue is with PHP sessions. Naturally nobody thought of this issue at first so the PHP session configuration was left at its defaults. Thus both s...

scalable sites

facebook, skype, myspace etc... all have millions and millions of users, does anyone know what their architecture looks like. Is it distributed on different nodes or do they use massive clusters? ...

How does DB clustering work?

Hello I have a question for the DBA's out there: If I scale from a single web/DB server setup to two web/two DB server setup with a load balancer in front of the web servers to route incoming queries evenly... how do solutions like MySQL Cluster work so that a change made to one DB server is immediately known to the other (otherwise, us...

Linux HA - how to detect a node is dirty wrt a resource?

Some background: on Linux HA, resources and groups have scores, i.e. score=(preference + stickness - failure stickness*fail counter). When a backup node has score greater than that of the current node, the resource will move to the backup node. I need to check, for a given resource/node, if that score is negative. It is easy to get fai...

Tomcat Clustering and HTTPS Issue

Hi I have two instances of Tomcat 6 with content accessible via HTTP and HTTPS for other pages. I have configured the instances this way: 1) Instance one to listen on port 8080(Http) and 8443(Https) 2) Instance two to listen on port 7080(Http) and 7443(Https) I have mod_proxy configured with Apache 2.2 to do clustering. The re...

Synchronization with master database

I have a 2 databases hosted on the different server. What the best thing do, to copy all the contents of master table database to the table of slave database? I am not the owner of the master database but they are willing to give an access. Before the data from master database is outputted via RSS and my PHP script parse it to insert int...

how to get group score in Linux HA?

I can get resource score with something like this: ptest -L -s -VV 2>&1 | grep "_color: resource.node" but for groups this seems to get me "0" even if a resource in a group has failed enough times to get the group move to another node. How can I get score for groups? Thanks! ...

Developing for SQL Server Cluster

Hello If I am developing for an environment that will make use of SQL Server clustering, what (if anything) should I be aware of? From what I've seen, I should handle an exception on a database operation and have two options: Display a 'retry' to the user. By the time the user retries the failover node should have kicked in. Wait a ...

SQL Server 2000 Index - Clustered vs Non Clustered

I have inherited a database where there are clustered indexes and additional duplicate indexes for each of the clustered index. i.e IX_PrimaryKey is a clustered index on the column ID. IX_ID is a non clustered index on the column ID. I want to clean up these duplicate non clustered indexes and I wanted to check to see if anyone could t...

Rails Subdomain Clustering

Hello all, I am about to be writing a Ruby on Rails app which will use sub-domains to authenticate users. We will have two types of accounts: user accounts domain accounts Users will thus be able to belong to multiple domain accounts using the same credentials. I hope to have the ability for a domain account administrator to be able...