ansaurus

Question

How to get unique set of rows from SQL where uniqueness is defined by 2 columns?

Answer 1

+4 A:

It depends on what you want to do with non-unique rows. If you want to not have them in the result set you could use group by and having:

select Name, Parent, Max(Category) 
from Table
group by Name, Parent
having count(*) = 1

You need the Max(Category) because you aren't grouping by that column, even though there will only be one row per Name and Parent.

If, though, you want to include non-unique rows in the result, similar to:

select distinct Name, Parent, Category from Table

except that two rows with the same Name and Parent but different Category only return a single row. In that case you need to decide what to show for Category, since more than one row will be condensed down to one. You could still use Max(Category) or Min(Category) and group by, but leave off the having.

select Name, Parent, Max(Category) 
from Table
group by Name, Parent

Adam Ruth 2010-02-08 23:39:23

That's only going to return one row, though, the "max" category for a given Name/Parent.

Michael Todd 2010-02-08 23:40:08

Which is what he wants, all of the unique rows (based on name and parent) and the category each of those rows belongs to.

Adam Ruth 2010-02-08 23:41:59

Might there not be more than one category included for a given Name/Parent? If so, then more than one category should be returned. (Guess we would need to find that out from the OP.)

Michael Todd 2010-02-08 23:44:23

The way the question is worded, it seems he wants all of the unique rows. But I could be misreading it.

Adam Ruth 2010-02-08 23:46:49

I do not think you should have "having count(*) = 1" clause, which will filter out ALL rows with duplicate Name + Parent, rather than leaving one of them behind.

hongliang 2010-02-08 23:53:09

My reading of the question is that's what he wants, but I could be wrong, so I clarified it.

Adam Ruth 2010-02-08 23:54:36

+1 Nice answer. I think it only works if you select exactly one non-grouped column. If the OP is looking for both ID and Category, this won't guarantee both come from the same row

Andomar 2010-02-09 00:03:01

True. It depends on what he wants the other columns for, values from two different rows may be okay.

Adam Ruth 2010-02-09 00:15:09

Thanks for all the answers! Yes, I need unique rows, as defined by name and parent (and actually by category). Even if the other values are not unique, it does not matter; just need the first one.

Donaldinio 2010-02-09 04:07:24

Answer 2

+3 A:

This query finds all rows where no other row has the same name and parent. If two rows have parent set to NULL, these rows are not considered to have the same parent.

SELECT T1.*
FROM Table1 T1
LEFT JOIN Table1 T2
ON T1.ID != T2.ID AND T1.Name = T2.Name AND T1.Parent = T2.Parent
WHERE T2.ID IS NULL

Mark Byers 2010-02-08 23:45:36

+1 This would work also

Andomar 2010-02-09 00:08:50

I'm surprised you accept this answer as it doesn't satisfy the extra specification you added 'If the rows have the same parent and name, return either.' I think Andomar's first suggestion does though.

Mark Byers 2010-02-10 15:17:17

Answer 3

+2 A:

You could use the row_number function to partition by Name and Parent, like:

select *
from (
    select
        row_number() over (partition by Name, Parent 
                           order by Name, Parent) as rn
    ,   *
    from YourTable
) sub
where rn = 1 -- Only first row for a name/parent combination

If you're looking to select only rows that are unique, in the sense that no other rows with the same name and parent exist, try:

select *
from YourTable a
where (
    select count(*)
    from YourTable b
    where a.Name = b.Name
    and a.Parent = b.Parent
) = 1

Andomar 2010-02-08 23:58:11

+1 for the row_number suggestion. Your `order by` is rather useless - it would make more sense to order by one of the columns not in the partition, id for example. I've asked a clarifying question about this as a comment to the question but haven't received an answer yet, so it's not clear which row he wants in this situation.

Mark Byers 2010-02-09 00:26:43

Answer 4

+1 A:

select x,y,z
from tablename t1
where not exists (select 1 from tablename t2 where t2.name = t1.name and t1.parent = t2.parent and t2.id <> t1.id)

may run slow depending on table size

BioBuckyBall 2010-02-08 23:59:43

+1 Or try `select *` :)

Andomar 2010-02-09 00:10:45

ansaurus

tags:

views:

answers:

How to get unique set of rows from SQL where uniqueness is defined by 2 columns?

related questions