ansaurus

Question

Answer 1

A:

Assuming:

there is no relevant information missing
ItemIDs are PKs, and therefore unique
You don't want GroupIDs where you have repeated group/item combinations

This should work:

select GroupID
from GroupItems
inner join ItemMaster
    on GroupItems.ItemID = ItemMaster.ItemID
inner join GroupMaster
    on GroupItems.GroupID = GroupMaster.GroupID
group by GroupID
having count(*) = (select count(*) from ItemList)

If there was a guarantee in GroupItems of unique group/item combinations, the join would be unneccessary.

Harper Shelby 2009-10-29 14:39:44

This solution returns both A and B

Irwin M. Fletcher 2009-10-29 14:43:53

That one doesn't work. On the sample data, it returns A and B. Only B should be returned. In T-Sql, the COUNT() function requires a parameter (typically *).I don't think I've missed anything relevant. If two groups contain the exact same items as ItemList, they should both be returned.

jsr 2009-10-29 14:47:10

I see that now - however, your example appears (to me) to have an inconsistency that causes this. You have an ItemID in GroupItems that doesn't exist in ItemList. Is this expected? It seems to violate the concepts - though you haven't listed them explicitly, if I see an ItemList and a GroupItems table, I would expect the GroupItems table to have a foreign key relationship to ItemList (and to GroupList, for that matter).

Harper Shelby 2009-10-29 15:07:28

Sorry for the confusion. Assume that the ItemID column in the GroupItems table and the ItemID in the ItemList table both have a foreign key to a third table "ItemMaster". There is no foreign key relationship between GroupItems and ItemList tables. It is common for GroupItems.ItemID to have values that are not contained in the ItemList table.

jsr 2009-10-29 15:14:15

OK - 2 joins (to ItemMaster, and a guessed-at GroupMaster) to ensure that we've got only one entry per Group/Item combo, then match the counts. That ought to work.

Harper Shelby 2009-10-29 16:14:47

Answer 2

A:

Assuming :

ItemID value can be only > 0

SELECT t.GroupID
FROM (
  SELECT GroupItems.GroupID
        ,count(1) as groupItemsCount 
        ,min(IsNull(ItemList.ItemID, -1)) as minVal
  FROM GroupItems
      LEFT JOIN ItemList
              ON (GroupItems.ItemID = ItemList.ItemID)
  GROUP BY GroupID
) t
WHERE t.groupItemsCount = (SELECT COUNT(1) FROM ItemList)
  AND (t.minVal > 0)

DzheiZee 2009-10-29 15:48:59

ItemID will always be > 0. I hadn't considered using a LEFT JOIN. I'll try it out and see how it performs compared to my original function.

jsr 2009-10-29 16:14:58

Answer 3

A:

Have you considered creating an indexed view to aggregate the counts on GroupItems?

CREATE VIEW GroupCounts (groupId, GroupCount) with SCHEMABINDING
AS
SELECT groupId, COUNT_BIG(1) /* I use 1 instead of asterisk by convention */
FROM GroupItems
GROUP BY groupId

CREATE CLUSTERED INDEX IX_GroupCounts on GroupCounts(groupId)

With this, you can use a similar query to the one you have, but it should have much better performance.

SELECT GS.groupId FROM GroupItems AS GI
INNER JOIN ItemList AS IL ON IL.ItemID = GI.ItemID
INNER JOIN GroupCounts AS GS ON GS.GroupID = GI.GroupID  
GROUP BY GS.GroupID 
HAVING COUNT(1) = groupCount;

Jeff Meatball Yang 2009-10-29 16:32:46

Answer 4

A:

Try:

   SELECT gi.groupid
     FROM GROUPITEMS gi
LEFT JOIN ITEMLIST il ON il.itemid = gi.itemid
     JOIN (SELECT COUNT(*) 'num_items' FROM ITEMLIST) c
 GROUP BY gi.groupid
   HAVING COUNT(*) = c.num_items

OMG Ponies 2009-10-29 17:46:37

ansaurus

tags:

views:

answers:

T-SQL: finding a group by its members

related questions