ansaurus

Question

[SQL] How can I check for a certain value in all aggregated rows?

Answer 1

+3 A:

I'd switch from MAX to SUM (with 1 rather than Y) so you are saying "Count the number of groups this person is in where the group name is Developers".

Then the pattern is similar to a "count the number of sales where the purchase value was more than $30".

You can, if desired, then add another expression to say "If the count is greater than zero then 'yes' this person is a developer". Very explicit and probably unnecessary though.

Gary 2010-09-16 23:05:22

Upvoted because `SUM()` seems like less of a workaround. But, I am holding out on accepting any answers for now. Thanks!

Max 2010-09-16 23:18:30

Answer 2

+2 A:

SELECT  user.user_id,
        user.user_name,
        COUNT(*) group_count,
        COUNT(DISTINCT DECODE(group_name, 'Developers', 'Y', NULL)) AS is_developer
        COUNT(DISTINCT DECODE(group_name, 'Content Management', 'Y', NULL)) AS is_content_manager
FROM    the_query

As for the ANY, it's a predicate similar to IN, not a function:

SELECT  *
FROM    dual
WHERE   'baz' = ANY('foo', 'bar', 'baz')

Quassnoi 2010-09-16 23:46:06

Answer 3

A:

I prefer Gary's answer, but if you want to stick with a boolean return you could make the ordering more explicit by returning 'N' instead of null.

select
    user.user_id,
    user.user_name,
    count(*) as group_count,
    max( case group.group_name when 'Developers' then 'Y' else 'N' end )
        as is_dev
    max( case group.group_name when 'Content Management' then 'Y' else 'N' end )
        as is_cm
from
    user
        inner join xref on user.user_id = xref.user_id
        inner join group on group.group_id = xref.group_id
group by user.user_id, user.user_name

(+1 for nicely written question)

Nick Pierpoint 2010-09-17 11:59:13

Thanks for the +1 and answer. Unfortunately, the 'N' falls victim to the same problem I mentioned in my question, which is that you're relying on the fact that 'N' just happens to sort higher than 'Y'. For example, if instead of 'Y' and 'N', you were using 'Always' and 'Never', then every cell would be 'Never', because `max` will always pick it up. It's an unintuitive "gotcha" which, I believe, is part of what makes this a true anti-pattern.

Max 2010-09-17 16:11:56

As I said, I prefer the counting, but I think you're pretty safe using Y and N as a flag.

Nick Pierpoint 2010-09-17 22:49:23

ansaurus

tags:

views:

answers:

[SQL] How can I check for a certain value in all aggregated rows?

related questions