views:

982

answers:

6

What is the closest thing like Hadoop, but in C++?

In particular, I want to do distributed computing using MapReduce.

Thanks!

A: 

BerkeleyDB is a key/value based database ( but it does not use the MapReduce algorithm).

Berkeley DB replication groups consist of some number of independently configured database environments. There is a single master database environment and one or more client database environments

: http://www.oracle.com/technology/documentation/berkeley-db/db/programmer_reference/rep.html

Pierre
How is this relevant to a question for a MapReduce-implementation in C++?
Leonidas
if what anon wants, is just an efficient key/value datastore handling replication, then he should have a look at BDB.
Pierre
+6  A: 

The original MapReduce implementation by Google is in C++, but unfortunately it is not made available to the public. That leaves you with the following options:

  1. Use Hadoop streaming (allows you to create and run Map/Reduce jobs with any executable/script)
  2. Try Boost.MapReduce (not yet part of the Boost Library and is still under development and review)

I recommend giving the first one a try. It works uses stdin/stdout as interface, and works like a charm.

Jørn Schou-Rode
+5  A: 

MongoDB http://www.mongodb.org/display/DOCS/Home:

MongoDB (from "humongous") is a scalable, high-performance, open source, schema-free, document-oriented database. Written in C++, MongoDB features: Replication and fail-over support... MapReduce for complex aggregation...

Pierre
+1  A: 

List of Map-Reduce implementations can be found at http://en.wikipedia.org/wiki/MapReduce#Implementations.

Some of these frameworks are implemented in C++. But if you only interested in writing map-reduce applications using C++, then Hadoop also has pipes. Pipes is library which allows C++ source code to be used for Mapper and Reducer code.

Cydork
A: 

Cloudstore migt be worth a look and its written in C++.

CloudStore (KFS, previously Kosmosfs) is Kosmix's C++ implementation of Google File System. It parallels the Hadoop project, which is implemented in Java.

http://en.wikipedia.org/wiki/CloudStore

Sundar
Official page: http://kosmosfs.sourceforge.net/about.html
Sundar
A: 

Sector/Sphere - distributed file system with integrated map-reduce framework.

Alexey