ansaurus

Question

Answer 1

A:

This article should help you out.

James 2009-06-25 11:50:36

You should really add a summary of the article to your answer. Then, if the article is moved (or removed), your answer would still be useful.

tvanfosson 2009-06-25 11:54:05

thats what i dont want.......i dont want to create a temporary table with distinct data but i want to alter my existing table by delete duplicated records.

developer 2009-06-25 11:54:59

Answer 2

+10 A:

Yes, assuming you have a unique ID field, you can delete all records that are the same except for the ID, but don't have "the minimum ID" for their group of values.

Example query:

DELETE FROM Table
WHERE ID NOT IN
(
SELECT MIN(ID)
FROM Table
GROUP BY Field1, Field2, Field3, ...
)

Notes:

I freely chose "Table" and "ID" as representative names
The list of fields ("Field1, Field2, ...") should include all fields except for the ID
This may be a slow query depending on the number of fields and rows, however I expect it would be okay compared to alternatives

EDIT: In case you don't have a unique index, my recommendation is to simply add an auto-incremental unique index. Mainly because it's good design, but also because it will allow you to run the query above.

Roee Adler 2009-06-25 11:52:08

This is cool as long as ID is numeric

Svetlozar Angelov 2009-06-25 13:56:44

IDs are usually numeric so it should not be a problem, however actually it will work as long as "MIN" is defined on ID it will work. If it's defined on strings, and the field is unique, it will work great.

Roee Adler 2009-06-25 15:03:04

I like your solution.. just wanted to clarify... it will be a problem if the table doesn't have a unique index too, it's good to have multiple options for a problem ..

Svetlozar Angelov 2009-06-25 15:08:17

@Svetilo: You just gave me an idea for how to deal with no unique index...

Roee Adler 2009-06-25 15:43:56

Answer 3

+2 A:

ALTER IGNORE TABLE 'table' ADD UNIQUE INDEX(your cols);

Duplicates get NULL, then you can delete them

Svetlozar Angelov 2009-06-25 11:55:23

Answer 4

A:

delete from table_x a where rowid < any (select rowid from table_x b where a.someField = b.someField and a.someOtherField = b.someOtherField) where (a.someField, a.someOtherField) in (select c.someField, c.someOtherField from table_x c group by c.someField, c.someOtherField having count(*) > 1)

In above query the combination of someField and someOtherField must identify the duplicates distinctively.

Priyank 2009-06-25 12:24:53

ansaurus

tags:

views:

answers:

Deleting duplicate rows from a table

related questions