ansaurus

Question

Optimal DB structure for additional fields entity

Answer 1

A:

The only workaround (while retaining your strucure) is to have separate tables:

create table IntProps(...);
create table StringProps(...);
create table CurrencyProps(...);

But I do not think that this is a good idea...

Frank Krueger 2008-12-08 22:44:13

Answer 2

A:

One common approach is having the key-value table contain multiple columns, one for each data type, i.e. StringValue, DecimalValue, etc.

Just know you're trading queryability and performance for a database schema you don't need to change. You could also consider ORM mapping or an object database.

Cristian Libardo 2008-12-08 22:45:07

Answer 3

+2 A:

How about this instead... each sub-type gets its own DB table. And the base/super table just has a varchar column that holds the name of the sub-type DB table. Then you can have something like this...

Entity
------
ID
Name
Type
SubTypeName   (value of this column will be 'Dog')


Dog
---
VetName
VetNumber
etc

If you don't want your (sub-)table names to be varchar values in the base table, you can also just have a SubType table whose primary key will be in the base table.

Kon 2008-12-08 22:45:09

Answer 4

A:

You could have a per type key/value table. The available table would need to encode the availability of a specific key/type pair to point to the correctly typed key/value table.

This seems like a highly inefficient architecture in for a row based relational databases however.

Perhaps you should take a look at a column oriented relational database?

dgtized 2008-12-08 22:47:53

Answer 5

+8 A:

The design you're experimenting with is a variation of Entity-Attribute-Value, and it comes with a whole lot of problems and inefficiencies. It's not a good solution for what you're doing, except as a last resort.

What could be a better solution is what fallen888 describes: create a "subtype" table for each of your subtypes. This is okay if you have a finite number of subtypes, which sounds like what you have. Then your subtype-specific attributes can have data types, and also a NOT NULL constraint if appropriate, which is impossible if you use the EAV design.

One remaining weakness of the subtype-table design is that you can't enforce that a row exists in the subtype table just because the main row in the superclass table says it should. But that's a milder weakness than those introduced by the EAV design.

edit: Regarding your additional information about comments-to-any-entity, yes this is a pretty common pattern. Beware of a broken solution called "polymorphic association" which is a technique many people use in this situation.

Bill Karwin 2008-12-08 23:45:57

Answer 6

A:

Thanks for the answers. I'll explain a little bit more specifically what i need. There's a need to program a blog+forum website, and I've been looking at the WordPress DB structure.

There's a strong need for the ability to place comments to any kind of 'object', like a blog entry, or a video file attachment to it. The above DB structure being very easy to scale and to fulfill all our needs was the reason of its choice.

But that's not late to change it, cause this is in stage of early engineering. Also our model smells now like a completely tree-hierarchy based DB. For now I'll accept Bill Karwin's and fallen888 answers, but maybe I'm going in a totally wrong direction?

Igor Katson 2008-12-09 11:07:44

Answer 7

A:

about the user being able to add a new field to the table:

I admire all these people making comments.

I used to be interested in this kind of thing a few years ago, but have written little code recently (apart from a little bit of PHP and MYSQL).

I think it's fine if you want to keep going - you may end up with something new.

Sorry to pour any cold water on the scheme - I admire your efforts. My personal belief is that if you go far enough in this direction, you will end up with a system that interprets more of natural language than SQL does. (Around 1970, SQL was actually spelt Sequel, and it actually stood for "structured english query language", but after they standardized it in the 1970's - I think someone said that Oracle was the first commercial implementation, 19079, the "English" got dropped off, because I guess they decided that it was only a tiny subset of English.

I have run out of steam in this area, because I haven't got a job. Without an easy job that pays the bills, where I can experiment with these ideas, it's a bit hard to concentrate on this area.

Best wishes to all.

2009-01-31 14:34:51

Answer 8

A:

sorry, I wrote 19079 above, I meant the year 1979. Oracle got their first contract writing a database for the CIA.

2009-01-31 14:35:55

ansaurus

tags:

views:

answers:

Optimal DB structure for additional fields entity

related questions