ansaurus

Question

Answer 1

A:

There's no way to do this in SQL: you can have six rows (every unique tuple), five rows (every first use of each column value) or one row (every first use of each value in each column which appears in both columns).

The reason you're having such a difficult time explaining what you want is that it's based on a human judgement call. You won't be able to do this in SQL until you're able to describe it qualitatively in English, and what you want isn't qualitative, it's procedural.

There are a bunch of ways to approximate it, such as grouping by the lesser column then sorting by match count inverse, but they're all exploitable.

Until you can give an unambiguous, logic driven criterion for selection, this will not succeed. Saying "minimal" doesn't count until you define minimal, and the minimal you appear to want requires procedural aggregate behvaior, which you cannot get in MySQL.

2009-07-23 22:30:14

Probably the closest you'll get which doesn't incorrectly exclude rows is select distinct * from (select * from foo group by one) as l union all (select * from foo group by two) as r;

2009-07-23 22:32:40

Thanks for the reply, admit don't understand the difference between qualitative and procedural - will attempt to read up on those terms...

barryhunter 2009-07-24 10:23:06

Answer 2

A:

Well as it turns out I found an answer ;)

CREATE TEMPORARY TABLE table2 ENGINE HEAP SELECT * FROM table;

ALTER IGNORE TABLE table2 ADD UNIQUE (one), ADD UNIQUE (two);

SELECT * FROM table2;

The IGNORE in the alter table is important, as it simply discards any duplicate rows based on the unique indexe**s**.

(not sure why didn't think of this before - as used it to good effect in solving "order before group by" style queries!)

barryhunter 2009-07-24 10:19:50

or course in the real query have a WHERE and ORDER BY on the initial select, which makes it useful. Experimenting with different order by's, RAND() works well.

barryhunter 2009-07-24 10:21:45

ansaurus

tags:

views:

answers:

Multi column distinct in mysql

related questions