ansaurus

Question

Finding duplicate records in SQL based on combination of fields

Answer 1

+1 A:

-- List all Duplicates
select m1.lastname, m1.firstname, m1.email, m1.zipcode
from tblMain m1
inner join tblMain m2
on isnull(m1.lastname, '') = isnull(m2.lastname, '')
and isnull(m1.firstname, '') = isnull(m2.firstname, '')
and isnull(m1.email, '') = isnull(m2.email, '')
and isnull(m1.zipcode, '') = isnull(m2.zipcode)
and m1.ID <> m2.ID
order by 1, 2, 3, 4

To Delete the latest duplicates, use something like:

delete from tblMain
where ID in 
(
    select m1.ID
    from tblMain m1
    inner join tblMain m2
    on isnull(m1.lastname, '') = isnull(m2.lastname, '')
    and isnull(m1.firstname, '') = isnull(m2.firstname, '')
    and isnull(m1.email, '') = isnull(m2.email, '')
    and isnull(m1.zipcode, '') = isnull(m2.zipcode)
    and m1.ID > m2.ID
)

Gordon Bell 2009-03-16 20:49:06

Answer 2

A:

I don't understand how can you be sure SSIS is the answer to your problem. Why can't you simply create unique keys in your "final" table to ensure you are not adding duplicates? Perhaps you should explain your problem better...

Sam 2009-03-16 20:54:04

Answer 3

+2 A:

It seems to me you are trying to outwit your users by trying to second guess them. Unfortunately, this almost always never works, as you may indeed have two clients with the same name, but different zipcodes, or other examples like that.

Your best bet is to "suggest" that the customer they are about to save already exists (and show them the duplicate) but allow them to save anyway. So the process needs to probably look like this:

User enters info in and presses Save
System detects a potential duplicate using criteria, and prompts user
User will either cancel or confirm, upon which you take the appropriate action

If there is no potential duplicate then steps 2-3 can be safely skipped.

hova 2009-03-16 21:06:58

Brings to mind:Programming today is a race between software engineers striving to build bigger and better idiot-proof programs, and the Universe trying to produce bigger and better idiots. So far, the Universe is winning.-- Rick Cook, The Wizardry Compiled

marc_s 2009-03-17 06:30:49

That is what I've done in one of our web applications and it works very well.

WakeUpScreaming 2009-03-17 20:16:15

ansaurus

tags:

views:

answers:

Finding duplicate records in SQL based on combination of fields

related questions