ansaurus

Question

MySQL ON DUPLICATE KEY UPDATE with nullable column in unique key

Answer 1

+4 A:

I think something along the lines of (2) is really the best bet — or, at least, it would be if you were starting from scratch. In SQL, NULL means unknown. If you want some other meaning, you really ought to use a special value for that, and 0 is certainly an OK choice.

You should do this across the entire database, not just this one table. Then you shouldn't wind up with weird special cases. In fact, you should be able to get rid of a lot of your current ones (example: currently, if you want the summary row where there is no filter, you have the special case "filter is null" as opposed to the normal case "filter = ?".)

You should also go ahead and create a "not present" entry in the referred-to table as well, to keep the FK constraint valid (and avoid special cases).

PS: Tables w/o a primary key are not relational tables and should really be avoided.

edit 1

Hmmm, in that case, do you actually need the on duplicate key update? If you're doing a INSERT ... SELECT, then you probably do. But if your app is supplying the data, just do it by hand — do the update (mapping zip = null to zip is null), check how many rows were changed (MySQL returns this), if 0 do an insert.

derobert 2009-08-19 07:07:42

Yes, the summary table is quite explicitly not a relational table. It is simply a convenient container for holding reporting results.My statement that "These NULLs are intended to mean 'not present, and all such cases are equivalent'", is perhaps misleading. In the relational tables containing the normalized data, the filter_id and other nullable relationships I mention as being part of the unique key in the summary table truly have the meaning of "unknown", and are not part of any primary or unique keys. See edit, above.

ryandenki 2009-08-19 07:50:05

Exactly right. We use INSERT...SELECT, using the ON DUPLICATE KEY clause there to update entries throughout the day.Actually, the first implementation two years ago was as you suggest--first selecting the data, performing some extra manipulation, then issuing individual INSERTS, with WHERE clauses taking into account the IS NULL case.That approach has the advantage that the locks inserting individual rows are shorter than for the INSERT...SELECT method. But these locks are only on the master using row replication, and we could replace all the app-side code with a single SQL statement.

ryandenki 2009-08-20 02:41:18

Answer 2

A:

Change the DEFAULT NULL column to DEFAULT 0, which allows the UNIQUE KEY to be matched consistently. This has the negative side effect of overly complicating the development of queries against the summary table. It forces us to use a lot of "CASE filter_id = 0 THEN NULL ELSE filter_id END", and makes for awkward joining since all of the other tables have actual NULLs for the filter_id.

Create a view which returns "CASE filter_id = 0 THEN NULL ELSE filter_id END", and using this view instead of the table directly. The summary table contains a few hundred thousand rows, and I've been told view performance is quite poor.

View performance in MySQL 5.x will be fine, as the view does nothing but replace a zero with a null. Unless you use aggregates/sorts in a view, most any query against the view will be re-written by the query optimizer to just hit the underlying table.

And of course, since it's an FK, you'll have to create an entry in the referred-to table with an id of zero.

tpdi 2009-08-19 07:15:30

ansaurus

tags:

views:

answers:

MySQL ON DUPLICATE KEY UPDATE with nullable column in unique key

edit 1

related questions