views:

163

answers:

1

I know there are three different, popular types of non-sql databases.

  • Key/Value: Redis, Tokyo Cabinet, Memcached
  • ColumnFamily: Cassandra, HBase
  • Document: MongoDB, CouchDB

I have read long blogs about it without understanding so much.

I know relational databases and get the hang around document-based databases like MongoDB/CouchDB.

Could someone tell me what the major differences are between these and the 2 former on the list?

+5  A: 

The main differences are the data model and the querying capabilities.

Key-value stores

The first type is very simple and probably doesn't need any further explanation.

Data model: more than key-value stores

Although there is some debat on the correct name for databases such as Cassandra, I'd like to call them column-family stores. Although key-value pairs are an essential part of Cassandra, it's not limited to just that. It allows you to nest key-value pairs, so a key could refer to multiple sub-key-value pairs.

You cannot nest key-value pairs indefinitely though. Your limited to three levels (column families) or four levels of nesting (super-column families). In case the term column family doesn't ring a bell, see the WTF is a SuperColumn article, it's a good explanation of Cassandra's data model.

Document databases, such as CouchDB and MongoDB store entire documents in the form of JSON objects. You can think of these objects as nested key-value pairs. Unlike Cassandra, you can nest key-value pairs as much as you want. JSON also supports arrays and understands different data types, such as strings, numbers and boolean values.

Querying

I believe column-family stores can only be queried by key, or by writing map-reduce functions. You cannot query the values like you would in an SQL database. If your application needs more complex queries, your application will have to create and maintain indexes in order to access the desired data.

Document databases support queries by key and map-reduce functions as well, but also allow you to do basic queries by value, such as "Give me all users with more than 10 posts". Document databases are more flexible in this way.

Niels van der Rest
So the key-value stores like redit doesn't allow you to store nested key:values? And from your description, then storing a whole database (from RDBMS) into Cassandra doesn't sound very clever cause it doesn't allow flexible query and has limited nesting depth, am I right?
never_had_a_name
@ajsie: Correct, key-value stores don't support nested key-value pairs. Most of them do support specialized values though, such as a lists. Cassandra is very different from an RDBMS, as both are designed to solve very different problems. RDBMS systems are aimed at relational data that need complex querying, whereas Cassandra is aimed at processing enormous amounts of mostly non-relational data. Of course it's *possible* to move an RDBMS database to Cassandra, but not very clever indeed. Each of them has its own use.
Niels van der Rest