ansaurus

Question

Object-oriented-like structures in relational databases

Answer 1

+3 A:

The apporach you suggest seems justified to me. You can add a actortype column to your actor-base table for differentiating between different types of actors. The PK of each specific actor table would be a FK to the actorbase table to avoid 'hairy' queries and to emulate the inheritance-like 'is-a' relationship.

Manu 2009-03-01 22:43:39

Works somewhat, except that the reverse association query becomes hairy: "show me the NAMES of all workers that i've sent communications to in the last month"

Alex 2009-03-01 22:48:51

select names from communications c inner join actor_base ab on c.actorid=ab.actorid where ab.actortype='worker'

Manu 2009-03-01 22:53:23

and c.date between getdate() and dateadd(m,-1,getdtate())

Manu 2009-03-01 22:54:45

What is hairy abour that???

Manu 2009-03-01 22:55:20

Imprecise definition, I apologize. Let's say that the Name property does NOT exist in all three actor types (only in the Worker type). You can't SELECT names from actor_base. It has to be from worker. Doable, but with a nested query... That's what I call "hairy" here.

Alex 2009-03-01 23:01:56

select name from workers w inner join communications c on c.actorid=w.actorid where c.date between...

Manu 2009-03-01 23:07:43

The point being that the PK of the workers table is an FK to the actorbase table

Manu 2009-03-01 23:08:47

Very, very interesting. So you partition the ID space in the actor_base... And a "worker" and an "employer" can't have the same primary key... Simplifies a lot of things! I like it! Thank you! +1!

Alex 2009-03-01 23:11:02

Answer 2

+7 A:

It's called ORM or Object Relational Mapping. There are dozens of products that purport to help you map OO structures to relational tables. Ruby on Rails, for example, offers Active Record to help bridge the divide. For PHP you have Propel and Doctrine and Porte and many others.

Scott Evernden 2009-03-01 22:44:53

I'm very familiar with ORM; I'm using the QCodo framework, which offers nice object-to-table mapping ability, but it's not good for this particular task. Is Propel or Doctrine better in this specific task?

Alex 2009-03-01 22:47:20

Answer 3

+2 A:

the best answer I've ever seen for this has been: http://en.wikipedia.org/wiki/The_Third_Manifesto

Unfortunately it's not something that fits in the space of a single answer here on stackoverflow. I will attempt to abbreviate it here, but I warn you that such an abbreviation will not be an accurate reflection of the third manifesto. Please redirect all criticisms of this solution to actually reading the damn thing, instead of assuming that you understand it fully from reading the abbreviation. Okay, here it goes.

define three new column types named worker, employer, and contact. Store objects of each of these types, in columns of their respective types. Follow the standard rules of normalization for the rest of your data model.

My feeling is that current popular database technology doesn't actually support the "correct" way to do these things, (specifically, many database systems don't allow the definition of new types). so it doesn't matter what you do, you'll always be forced into a compromise situation. But after reading the third manifesto, at least you'll know what you're compromising on.

ORM is currently the overwhelmingly popular solution to the problem at the moment, but I do not believe it is the correct solution.

Breton 2009-03-01 22:59:30

Answer 4

+7 A:

.. It's about "how do I map OOP structures to database tables in a painless way."

You don't.

Object oriented and relational algebra are two fundamentally different paradigms. You can't transition between them without a subjective interpretation. This is called an impedance mismatch, and has been dubbed the The Vietnam of Computer Science.

troelskn 2009-03-01 23:10:26

Point taken, but you are being extreme. If you have a dynamic typing system (such as Javascript or Python) it's easy to partially load a class and then complete the load lazily. In my experience, the problem is worse in theory than in practice.

Ken Fox 2009-03-02 00:16:02

I am perhaps exaggerating a bit, but the question to me sounded like "What's the silver bullet" - And there isn't one. Lazy loading is nice enough, but the real problem is with relations between entities. For simple use cases orm work fine, but complex object graphs simply don't map well.

troelskn 2009-03-02 08:45:31

Answer 5

A:

To me it just looks like your data model is missing a level. I would set it up more like this:

People Table - (Just information about the actual people)

Roles Table - (The types of roles people can have i.e. Worker, Employer, Contact - and information specific to that role)

PeopleRoles Table - (people_id, role_id, maybe start / modify dates etc.)

Entities Table - (Define the different types of Entities)

RoleEntities Table - (role_id, entity_id, etc.)

Then changing a Person from one Role to another (or allowing them to have multiple roles) is a simple update.

Ron

Ron Savage 2009-03-01 23:16:04

Answer 6

+3 A:

What you are looking for is Disjoint-subtypes ... ORM is a hack.

mike g 2009-03-01 23:17:11

Thanks a bunch for finding that other thread. Very helpful.

Alex 2009-03-01 23:25:20

Answer 7

A:

Many RDBMS offer a table-inheritance feature, which links parent tables to child tables in much the same way as class inheritance. the implementation varies a bit from vendor to vendor, but it can take some of the pain out of implementing similar concepts.

Also, most RDBMSs have some combination of triggers, stored views and stored procedures that can separate behavior from implementation. In many cases, such as PostgreSQL's rules (a generalization of views) offer very sophisticated encapsulation and are quite easy to use.

TokenMacGuy 2009-03-01 23:42:26

Answer 8

A:

A couple people have noted the object-relational impedance mismatch. The best solution is to simply forgo the RDBMS in favor of the OODBMS, which has recently regained popularity.

That said, there aren't any object databases with APIs in pure PHP, as far as I know. A quick search produced this result but it hasn't been updated in years. On the other hand, I've heard of plenty of object databases for other languages, including Hibernate, db4o, and ZODB.

Nikhil Chelliah 2009-03-01 23:42:44

I thought OODBMSs had died on their backsides while everyone continued using RDBMSs (or ORDBMSs, as Oracle likes to call itself now).

Tony Andrews 2009-03-02 12:12:54

Answer 9

+3 A:

Here's a solution I came up with about 10 years ago. The system that uses this design is still running, so it worked well enough to survive longer than most of my code. ;) Today I may use one of the ORM packages that Scott mentions, but there's really no huge problems just using SQL directly.

Model all of your inheritance relations as joins between tables. Each table in your system will hold the attributes of a specific class.
Use a synthetic object id (oid) as your primary key for all objects. A sequence generator or autoincrement column is necessary to generate oid values.
All inherited classes must use the same oid type as their parent. Define the oid as a foreign key with cascaded delete. The parent table gets the autoincrement oid column and the children get plain oid columns.
Queries on final classes are made on the corresponding table. You can either join all the parent class tables into the query or just lazy load the attributes you need. If your inheritance hierarchy is deep and you have many classes, an ORM package can really simplify your code. My system had less than 50 classes with a maximum inheritance depth of 3.
Queries across child classes (i.e. queries on a parent class) can either lazy load the child attributes on a per-instance basis, or you can repeat the query for each child class joined with base classes. Lazy loading child attributes based on a parent class query requires you know the type of the object. You may have enough information in the parent classes already, but if not you'll need to add type information. Again, this is where an ORM package can help.

Virtual classes without member attributes can be skipped in the table structure, but you won't be able to query based on those classes.

Here's what "show me all communications with just actors of type worker" looks like.

select * from comm c, worker w where c.actor=w.oid;

If you have sub-classes of communication, and you want to immediately load all the child class attributes (perhaps your system does not allow partial construction), the easiest solution is to eager join on all the possible classes.

select * from comm c, worker w, missive m where c.actor=w.oid and c.oid=m.oid;
select * from comm c, worker w, shoutout s where c.actor=w.oid and c.oid=s.oid;

One last thing. Make sure you have a good database and correct indexes. Performance can be a serious problem if you database can't optimize these joins.

Ken Fox 2009-03-02 00:01:57

thank you for an extensive reply! I think i'm getting a hang of it - many of the responses are suggesting a similar approach of "partitioning" the primary keys using this "primary key of the child = foreign key into parent" approach. It makes a lot of sense.

Alex 2009-03-02 00:07:07

http://martinfowler.com/eaaCatalog/concreteTableInheritance.html

troelskn 2009-03-02 08:49:08

Answer 10

+1 A:

It's probably worth your time getting familiar with Object Role Modeling as discussed in this question. The biggest problem I see is that there is no existing accepted methodology for having a conceptual design discussion about relational data. The best you can do is logical modeling (ERMs usually). Object Role Modeling provides the basis for that discussion. I hope you'll see recognizable artifacts from a similar OOP design discussion you might have.

le dorfier 2009-03-02 00:24:05

ansaurus

tags:

views:

answers:

Object-oriented-like structures in relational databases

related questions