ansaurus

Question

Options for persisting data as a set of key-value-pairs with a common key and type

Answer 1

A:

Take a look at XML databases (like eXist). You can easily change your "datamodel" by modifying the xml schema. And you can use powerful query languages like XPath and XQuery.

Kees de Kooter 2009-03-10 12:53:28

Answer 2

+1 A:

I haven't used it but what you're trying to do kind of sounds like CouchDB. You may want to look there before reinventing the wheel...

Jason Punyon 2009-03-10 12:54:05

Answer 3

A:

I never based an entire application on this principle, but in almost all applications I do use some form of key-value pair collections which deal with extreme cases when the specific entity requires some additional properties which are not needed for other entities.

I basically serialize the dictionary and store it like that in the database with my entity data. That's what I use for post production patching when I have to deal with something too obscure to warrant a change in the entire model.

With the key-value pair data, I do store the type as well, so I can automatically render appropriate HTML controls. I have just basic types: text, multi-line, RTF, checkbox, number and date.

muerte 2009-03-10 12:54:48

Answer 4

+1 A:

This approach is used by Amazon SimpleDB. You define domains and each domains has rows with a bunch of key/value pairs in it. This data is known as 'semi-structured'.

This approach has some strengths. Like your idea, you do not need to define a database schema. You can introduce new tables ad-hoc, new columns on a per-row basis, and even have columns that have more than one value (instead of creating a has_many relationship with an extra table). If your schema changes, you can introduce these changes transitionally rather than force migration.

On the other hand, you're throwing away decades of development on the relational model. You will hemorrhage speed because your indexing will either be too general or non-existent. Aggregate operations (groups, joins) will be extremely slow. Query optimisation will be difficult, etc.

Both Amazon SimpleDB and Apache CouchDB deal with this issue by making their databases highly distributed. While this ensures reliability and redundancy, it has its own set of problems, such as conflict resolution and out-of-date data.

From your question you seem dead set on an 'agile' methods, so I would recommend one of those two DB engines (depending on whether you'd rather pay Amazon - albeit not much - or build your own setup). They both allow a completely dynamic database schema. Just beware of the pitfalls.

rjh 2009-03-10 13:27:44

ansaurus

tags:

views:

answers:

Options for persisting data as a set of key-value-pairs with a common key and type

related questions