ansaurus

Question

Broken referential integrity: What would Edgar Codd say?

Answer 1

+2 A:

If there is no customer named Mary in the Customers table, then there is no referential integrity between the tables. Specifically, a foreign key refers to a non-existent primary key.

Does this break the relational model? No. It's defined in the relational model (i.e. lack of referential integrity) and is an indication that there is a problem with the underlying data.

Michael Todd 2009-03-16 16:23:29

I'm trying to find on google "relational model" "lack of referential integrity" but it doesn't return anything. I would really like to read about this. How do you know it's defined in the relational model? Have you got any reference?

lubos hasko 2009-03-16 16:31:31

I'm currently taking a class at UNLV specifically for database development (from the MIS-side, not the CS-side), so I just rattled the answer off. I don't have my books with me right now but I'll try to find something for you.

Michael Todd 2009-03-16 22:10:07

Hmmm...in my quick search I found nothing "high-level" enough to explain my position (just specifics about DB design or one-liners). You may want to try accessing Codd's documents on www.acm.org (requires a membership, though).

Michael Todd 2009-03-16 22:15:44

Gotta love the internet. http://www.seas.upenn.edu/~zives/03f/cis550/codd.pdf is the text of Codd's "A Relational Model of Data for Large Shared Data Banks." On the last page it talks about inconsistencies, how they occur, and ways to resolve them.

Michael Todd 2009-03-16 22:22:45

Answer 2

+1 A:

I read the following as clearly stating that referential integrity is included in the relational model:

Two integrity rules apply to every relational database:

1 Entity integrity:
No mark of either type is permitted in any attribute which is a component of the primary key of a base relation

2 Referential integrity:
Let D be a domain from which one or more single-attribute primary keys draw their values. Let K be a foreign key which draws its values from domain D. Every unmarked value which occurs in K must also exist in the database as a value in the primary key of some base relation.

"Missing information (applicable and inapplicable) in relational databases," E. F. Codd, ACM SIGMOD Record, vol. 15, no. 4, pp. 53-78, 1986.

By "mark of either type" he is referring to an unknown value, for which we use NULL today. This paper suggested two different types of unknown values, one for "applicable but missing," and one for "inapplicable."

By "unmarked" he means not NULL.

Bill Karwin 2009-03-16 17:33:29

ansaurus

tags:

views:

answers:

Broken referential integrity: What would Edgar Codd say?

related questions