ansaurus

Question

MySQL Select Statement DISTINCT for Multiple Columns

Answer 1

+4 A:

Try the following. It might not be the most efficient query, but it will work:

SELECT uniqueID, stringID, subject
FROM data_table
WHERE uniqueID IN
 (
  SELECT MAX(uniqueID) 
  FROM data_table
  GROUP BY stringID
 )
ORDER BY uniqueID DESC

Andrew Moore 2009-07-25 06:34:10

This query helped the most. Also, I replaced the 'uniqueID' with lexu's suggestion above, using the timestamp. Thanks very much for your help.

2009-07-25 07:04:41

Answer 2

A:

Edit: Based on new info provided by the OP in a comment, this would be preferable to relying on uniqueID:

select t.uniqueID
       , t.stringID
       , t.subject
       , t.your_timestamp_col
from   data_table t
       left outer join data_table t2
       on t.stringID = t2.stringID
    and
       t2.your_timestamp_col > t.your_timestamp_col
where  t2.uniqueID is null

If, as lexu mentions in a comment, you are certain that the highest uniqueID value always corresponds with the newest subject, you could do this:

select t.uniqueID
       , t.stringID
       , t.subject
from   data_table t
       left outer join data_table t2
       on t.stringID = t2.stringID
    and
       t2.uniqueID > t.uniqueID
where  t2.uniqueID is null

Which basically means: return to me only those records from data_table where there exists no higher uniqueID value.

Adam Bernier 2009-07-25 06:49:45

It will actually perform worse. The subquery does not use any of the superqueries columns, and therefore, is computed only once. A `max` is much quicker than trying to compare each id one by one. Moreover, the join will then have to apply the `where` clause. The subquery, however, will create a hash table which serves as a lookup to each of the ID's. Ergo, only one comparison, and we don't have to check the column after all the comparisons are done.

Eric 2009-07-25 06:53:34

ansaurus

tags:

views:

answers:

MySQL Select Statement DISTINCT for Multiple Columns

related questions