ansaurus

Question

Finding duplicate rows but skip the last result?

Answer 1

A:

SELECT ...
       ROW_NUMBER() OVER (PARTITION BY email ORDER BY emailid DESC) AS RN
FROM ...

...is a great starting point for such a problem. Never underestimate the power of ROW_NUMBER()!

Will A 2010-08-12 06:51:12

I dont see how this might help in my case, can u point me some more :-)?

grady 2010-08-12 07:55:04

Answer 2

A:

In the end I need 2 of the 3 rows to have the duplicate flag (duplicate column) set, the third should not have it set.

Thanks

grady 2010-08-12 07:56:53

Answer 3

A:

Using Sql Server 2005+ you could try something like (full example)

DECLARE @Table TABLE(
        ID INT IDENTITY(1,1),
        Email VARCHAR(20)
)

INSERT INTO @Table (Email) SELECT 'a'
INSERT INTO @Table (Email) SELECT 'b'
INSERT INTO @Table (Email) SELECT 'c'
INSERT INTO @Table (Email) SELECT 'a'
INSERT INTO @Table (Email) SELECT 'b'
INSERT INTO @Table (Email) SELECT 'a'

; WITH Duplicates AS (
        SELECT  Email,
                COUNT(ID) TotalDuplicates
        FROM    @Table
        GROUP BY    Email
        HAVING  COUNT(ID) > 1
)
, Counts AS (
        SELECT  t.ID,
                ROW_NUMBER() OVER(PARTITION BY t.Email ORDER BY t.ID) EmailID,
                d.TotalDuplicates
        FROM    @Table t INNER JOIN
                Duplicates d    ON  t.Email = d.Email
)
SELECT  ID,
        CASE
            WHEN EmailID = TotalDuplicates
                THEN 0
            ELSE TotalDuplicates - 1
        END Dups
FROM    Counts

astander 2010-08-12 08:29:43

Answer 4

A:

I managed to have a temporary table now, which looks like this:

mailid | rowcount | AmountOfDups
643921 | 1 | 3
643921 | 2 | 3
643921 | 3 | 3

Now, how could I decide that only the first 2 should be updated (by mailid) in the other table? The other table has mailid as well.

grady 2010-08-12 11:54:32

ansaurus

tags:

views:

answers:

Finding duplicate rows but skip the last result?

related questions