ansaurus

Question

SQL Find Possible Duplicates

Answer 1

A:

A check constraint perhaps.

Something along the lines of select count(*) where date1 = '1900/01/01' and date2 = @date2 and groupid = @groupid.

Just need to see if you can do this in a table-level constraint ....

LRE 2009-08-25 05:24:38

With some example code I think this is the answer.

Cellfish 2009-08-25 05:30:37

Answer 2

A:

In addition to having a PRIMARY KEY field defined on the table, you can also add other UNIQUE constraints to perform the same sort of thing you're asking for. They'll validate that a particular column or set of columns have a unique value in the table.

Check out the entry in the MySQL manual for an example:

http://dev.mysql.com/doc/refman/5.1/en/create-table.html

Brent Nash 2009-08-25 05:27:32

Answer 3

+1 A:

You can identify duplicates on (date2, GroupID) using

Select date2,GroupID
from t
group by (date2,GroupID)
having count(*) >1

Use this to identify records in main table that are duplicates:

Select *
from t
where date1='1900/01/01'
and (date2,groupID) = (Select date2,GroupID
                       from t
                       group by (date2,GroupID)
                       having count(*) >1)

NOTE: Since Date1, Date2, GroupID forms a unique key, check if your design is right in allowing Date1 to be NULL. You could have a genuine case where Date 1 is different for two rows while (date2,GroupID) is the same

bkm 2009-08-25 05:28:16

Unfortunately I have to allow for the fact that there may be no information available for Date1

Karl 2009-08-25 05:53:44

Answer 4

A:

select * from table a
join (
select Date2, GroupID, Count(*)
from table
group by Date2, GroupID
having count(*) > 1
) b on (a.Date2 = b.Date2 and a.GroupID = b.GroupID)
where a.Date1 = '1900/01/01'

wgpubs 2009-08-25 05:32:28

Answer 5

+1 A:

If I understand correctly, you are looking for a group of IDs for which GroupID and Date2 are the same, there's one occurance of Date1 that's different from 1900/01/01, and all the rest of the Date1s are 1900/01/01.

If I got it right, here's the query for you:

SELECT T.ID 
FROM Table T1
WHERE 

(T1.GroupID, T1.Date2) IN
    (SELECT T2.GroupID, T2.Date2
    WHERE T2.Date1 = '1900/01/01' OR
        T2.Date IS NULL
    GROUP BY T2.GroupID, T2.Date2)

AND 

1 >= 
(
    SELECT COUNT(*) 
    FROM TABLE T3
    WHERE NOT (T3.Date1 = '1900/01/01') 
    AND NOT (T3.Date1 IS NULL)
    AND T3.GroupID = T1.GroupID
    AND T3.Date2 = T1.Date2
)

Hope that helps.

Roee Adler 2009-08-25 05:33:01

Answer 6

+2 A:

bkm kind of has it, but the inner select can perform poorly on some databases. This is more straightforward:

select t1.* from 
t as t1 left join t as t2 
on (t1.date2=t2.date2 and t1.groupid=t2.groupid)
where t1.id != t2.id and (t1.date1='1900/01/01' or t2.date2='1900/01/01')

SquareCog 2009-08-25 05:33:14

works perfectly, just needs to be select DISTINCT t1.* .....

Karl 2009-08-25 06:27:30

Answer 7

A:

This is the most straightforward way I can think to do it:

SELECT DISTINCT t1.*
FROM t t1 JOIN t t2 USING (date2, groupid)
WHERE t1.date1 = '1900/01/01';

No need to use GROUP BY, which performs poorly on some brands of database.

Bill Karwin 2009-08-25 05:42:25

ansaurus

tags:

views:

answers:

SQL Find Possible Duplicates

related questions