ansaurus

Question

UPDATE query that fixes orphaned records

Answer 1

+1 A:

The most common technique to identify duplicates in a table is to group by the fields that represent duplicate records:

ID  FIRST_NAME  LAST_NAME
1   Brian   Smith
3   George  Smith
25  Brian   Smith

In this case we want to remove one of the Brian Smith Records, or in your case, update the ID field so they both have the value of 25 or 1 (completely arbitrary which one to use).

SELECT  min(id)
    FROM example
GROUP BY first_name, last_name

Using min on ID will return:

ID  FIRST_NAME  LAST_NAME
1   Brian   Smith
3   George  Smith

If you use max you would get

ID  FIRST_NAME  LAST_NAME
25  Brian   Smith
3   George  Smith

I usually use this technique to delete the duplicates, not update them:

DELETE FROM example
      WHERE ID NOT IN (SELECT   MAX (ID)
                           FROM example
                       GROUP BY first_name, last_name)

Brian 2010-05-14 16:55:36

Thanks, Brian. That's a cool method for deleting duplicates.However, although I'd be fine with deleting duplicates from my sample Student table, it is mandatory that I save (Update) the existing records in the TestScore table.Referring back to the sample TestScore table, you'll notice that there are records for John(ID=1) and John(ID=5). The problem is, John ID1 and ID5 are the same person. So, I want to update all the ID=1 to ID=5.I do not want to lose track of the history for all of John's (and the other students') test scores.

Jed 2010-05-14 17:15:16

Answer 2

+1 A:

The only way to do this is through a series of queries and temporary tables.

First, I would create the following Make Table query that you would use to create a mapping of the bad StudentID to correct StudentID.

Select S1.StudentId As NewStudentId, S2.StudentId As OldStudentId 
Into zzStudentMap
From Student As S1
    Inner Join Student As S2
        On S2.Name = S1.Name
Where S1.Disabled = False
    And S2.StudentId <> S1.StudentId
    And S2.Disabled = True

Next, you would use that temporary table to update the TestScore table with the correct StudentID.

Update TestScore
    Inner Join zzStudentMap
        On zzStudentMap.OldStudentId = TestScore.StudentId
Set StudentId = zzStudentMap.NewStudentId

Thomas 2010-05-14 17:03:58

I didn't think to use a temp table. Thanks, Thomas.

Jed 2010-05-14 18:24:57

ansaurus

tags:

views:

answers:

UPDATE query that fixes orphaned records

related questions