ansaurus

Question

Answer 1

A:

You could try something like the following:

delete T1
from MyTable T1, MyTable T2
where T1.dupField = T2.dupField
and T1.uniqueField > T2.uniqueField

(this assumes that you have an integer based unique field)

Personally though I'd say you were better off trying to correct the fact that duplicate entries are being added to the database before it occurs rather than as a post fix-it operation.

Ben Cawley 2010-07-23 11:02:17

I donot have the unique field(ID) in my Table. How can i perform the operation then.

2010-07-24 04:20:40

Answer 2

+5 A:

Assuming that your Employee table also has a unique column (ID in the example below), the following should work:

delete from Employee 
where id NOT in
(
select Min(ID)
from Employee 
group by EmployeeName 
)

it will leave the version with the lowest ID in the table

nonnb 2010-07-23 11:07:58

Also, in Oracle, you could use "rowid" if there is no other unique id column.

Brandon Horsley 2010-07-23 11:13:03

+1 Even if there were not an ID column, one could be added as an identity field.

Kyle B. 2010-07-23 15:31:15

Answer 3

+1 A:

You can do this with window functions. It will order the dupes by empId, and delete all but the first one.

delete x from (
  select *, rn=row_number() over (partition by EmployeeName order by empId)
  from Employee 
) x
where rn > 1;

Run it as a select to see what would be deleted:

select *
from (
  select *, rn=row_number() over (partition by EmployeeName order by empId)
  from Employee 
) x
where rn > 1;

John Gibb 2010-07-23 15:22:00

Answer 4

A:

WITH CTE AS ( SELECT EmployeeName,ROW_NUMBER OVER(PARTITION BY EmployeeName ORDER BY EmployeeName) AS R ) DELETE CTE WHERE R > 1;

The magic of common table expressions.

SubPortal 2010-07-25 11:30:41

ansaurus

tags:

views:

answers:

delete duplicate records in SQL Server

related questions