tags:

views:

113

answers:

4

Hello,

I am interested in the Apache Hadoop project, but i would like to know if any other tested (please mind the 'tested') projects/frameworks are out there.

Appreciate any information/links to projects similar to Apache Hadoop and any comments on the Apache Hadoop project from anyone that has used it.

Regards,

+1  A: 

As mentioned in an answer to this question:

http://stackoverflow.com/questions/2168558/is-there-anything-like-hadoop-in-c

MongoDB might be something you could look at. Its a scalable database which allows MapReduce algorithms to be run against it.

Binary Nerd
A: 

Maybe. But none of them will have anywhere near the testing a real world experience that hadoop does. Companies like facebook and yahoo are paying to scale hadoop and I know of no similar open source projects that are really worth looking at.

Steve
A: 

A possible way is to use org.apache.hadoop.hbase.MiniDFSCluster and org.apache.hadoop.mapred.MiniMRCluster, which are used in testing hadoop itself.

What they do is to launch a small cluster locally. To test your program, make hdfs-site.xml stuffs pointing to local cluster, and add them to your classpath. And this local cluster is just like another cluster but smaller. You can reference hadoop/src/test/*-site.xml as templates.

For more example, take a look at hadoop/src/test/.

Victor
+1  A: 

There are indeed open-source projects utilizing and funding on Hadoop. See Apache Mahout for data mining: http://lucene.apache.org/mahout/

And are you aware of the other MR implementations available? http://en.wikipedia.org/wiki/MapReduce#Implementations

Peter Wippermann