ansaurus

Question

Translating a MySQL data/query-set into the equivalent Cassandra representation

Answer 1

+2 A:

you have two options:

(1) is sort of traditional: have one CF (columnfamily) with your foo objects, one row per foo, one column per field. then create two index CFs, where the row key in one is the string values, and the row key in the other is lookup_id. Columns in the index rows are foo ids. So you do a GET on the index CF, then a MULTIGET on the ids returned.

Note that if you can make id the same as lookup_id then you have one less index to maintain.

High-level clients like Digg's lazyboy (http://github.com/digg/lazyboy) will automate maintaining the index CFs for you. Cassandra itself does not do this automatically (yet).

(2) is like (1), but you duplicate the entire foo objects into subcolumns of the index rows (that is, the index top-level columns are supercolumns). If you're not actually querying by the foo id itself, you don't need to store it in its own CF at all.

jbellis 2010-03-10 23:01:15

Can you provide a link on how to create an index column family on Cassandra? Or is it just the simple ColumnFamily but with ids instead of actual objects ?

Ritesh M Nayak 2010-05-11 10:25:30

ansaurus

tags:

views:

answers:

Translating a MySQL data/query-set into the equivalent Cassandra representation

related questions