ansaurus

Question

need to find differences between 2 identically structured sql tables

Answer 1

+2 A:

You could do this using two left joins and regular join for the updates. This will show you, with respect to TableA, which rows are added, deleted, and updated. Note these could be contatenated into a single result set.

select b.*, 'added'
from tableb b
   left outer join tablea a on b.IssueID = a.IssueID
where a.IssueID is null

select a.*, 'deleted'
from tablea a
    left outer join tableb b on a.IssueID = b.IssueID
where b.IssueID is null

select a.*, 'updated'
from tablea a
    join tableb b on a.IssueID = b.IssueID
where a.Status <> b.Status or a.Who <> b.Who

Note for the latter, if you need to handle null values, I think you'll need to adjust the where clause.

If the tables are large and this is an ongoing operation, you should consider adding indexes on the join columns.

tvanfosson 2010-05-05 12:21:25

thanks for the quick reply, I missed a point in my question.Number of rows updated (I had repeated the delete part).can you help on this part?

balalakshmi 2010-05-05 12:23:45

this will not find updates

KM 2010-05-05 12:45:12

@KM -- the question was modified after I posted my answer. To find updates you need a third statement.

tvanfosson 2010-05-05 12:47:07

I do it all in one statement, see my answer http://stackoverflow.com/questions/2773050/need-to-find-differences-between-2-identically-structured-sql-tables/2773099#2773099

KM 2010-05-05 13:03:17

Answer 2

A:

unless you want to copy the entire contents of each table into C#, do it in the database.

This will find all missing TableA or TableB rows, as well as any changes:

;WITH AllPKs AS
(
SELECT ID FROM TableA
UNION ID FROM TableB
)
SELECT
    z.ID,a.*, b.*
    FROM AllPKs                z
        LEFT OUTER JOIN TableA a ON z.ID=a.ID
        LEFT OUTER JOIN Tableb b ON z.ID=b.ID
    WHERE A.ID IS NULL OR B.ID IS NULL OR a.Col1!=b.Col1 OR a.Col2!=b.Col2 OR...

KM 2010-05-05 12:24:18

OP doesn't say which database they are using, but if they can't use a CTE, then just make it a derived table: `SELECT z.ID,a.*, b.* FROM (SELECT ID FROM TableA UNION ID FROM TableB) z LEFT OUTER JOIN`...

KM 2010-05-05 12:32:49

Answer 3

+1 A:

I've heard good things about Redgate SQL Data Compare

Mark Maslar 2010-05-05 12:24:44

You'd need to use the 'Data Compare' product for this. 'Compare' works against schemas.

Even Mien 2010-05-05 12:34:56

Answer 4

A:

In order to find updates, you will need to compare the value of each column to the other table.

select 'updated',
   a.IssueID as IssueID_A, b.IssueID as IssueID_B
   a.Status as Status_A, b.Status as Status_B,
   a.Who as Who_A, b.Who as Who_B
from tablea a
inner join tableb b 
   on a.IssueID = b.IssueID 
where a.Status <> b.Status or a.Who <> b.Who

Even Mien 2010-05-05 12:33:11

this will not find inserts/deletes

KM 2010-05-05 12:45:56

Answer 5

A:

In general, I would recommend a product like Redgate SQL Data Compare but as a one off, you could use a script like this:

-- Create some tables and data for testing purposes
USE [tempdb]
SET NOCOUNT ON
GO

DROP TABLE [Issues1]
DROP TABLE [Issues2]
GO

CREATE TABLE [Issues1] ([IssueID] int, [Status] varchar(max), [Who] varchar(max))
CREATE TABLE [Issues2] ([IssueID] int, [Status] varchar(max), [Who] varchar(max))
GO

INSERT [Issues1] VALUES (1, 'aaa', 'bbb')
INSERT [Issues1] VALUES (2, 'ccc', 'ddd')
INSERT [Issues1] VALUES (3, 'eee', 'fff')
GO

INSERT [Issues2] VALUES (1, 'aaa', 'bbb')
INSERT [Issues2] VALUES (3, 'ggg', 'hhh')
INSERT [Issues2] VALUES (4, 'iii', 'iii')
GO

-- **** START OF ANSWER PROPER ****

-- Create some temporary variables to store the change details
DECLARE @Inserts TABLE ([IssueID] int, [Status] varchar(max), [Who] varchar(max))
DECLARE @Updates TABLE ([IssueID] int, [OldStatus] varchar(max), [NewStatus] varchar(max), [OldWho] varchar(max), [NewWho] varchar(max))
DECLARE @Deletes TABLE ([IssueID] int, [Status] varchar(max), [Who] varchar(max))

-- Find all rows that exist in Issues2 but do not exist in Issues1
-- (matching on ID)
INSERT @Inserts
SELECT *
FROM [Issues2]
WHERE [IssueID] NOT IN
(
    SELECT
        [IssueID]
    FROM [Issues1]
)

-- Find all rows existing in both Issues1 and Issues2 (matching on ID)
-- and where either Status or Who has changed
INSERT @Updates
SELECT
    [Issues1].[IssueID],
    [Issues1].[Status],
    [Issues2].[Status],
    [Issues1].[Who],
    [Issues2].[Who]
FROM [Issues1]
    INNER JOIN [Issues2] ON [Issues2].[IssueID] = [Issues1].[IssueID]
        AND
        (
            [Issues2].[Status] != [Issues1].[Status]
                OR [Issues2].[Who] != [Issues1].[Who]
        )

-- Find all rows that exist in Issues1 but do not exist in Issues2
-- (matching on ID)
INSERT @Deletes
SELECT *
FROM [Issues1]
WHERE [IssueID] NOT IN
(
    SELECT
        [IssueID]
    FROM [Issues2]
)

-- Output the results
SELECT
    (SELECT COUNT(*) FROM @Inserts) AS [Number Inserted],
    (SELECT COUNT(*) FROM @Updates) AS [Number Updated],
    (SELECT COUNT(*) FROM @Deletes) AS [Number Deleted]

SELECT 'INSERTED', * FROM @Inserts
SELECT 'UPDATED', * FROM @Updates
SELECT 'DELETED', * FROM @Deletes
GO

Daniel Renshaw 2010-05-05 12:48:47

ansaurus

tags:

views:

answers:

need to find differences between 2 identically structured sql tables

related questions