ansaurus

Question

SQL query on two tables - return rows in one table that don't have entries in the other

Answer 1

+2 A:

Not sure if the syntax will work for Access, but something like this would work:

select CategoryName from Categories
where ID_Category not in (
    select SuperCategoryID 
    from SuperCategories 
)

Blorgbeard 2009-04-11 05:24:39

This would work in Access AFAIK.

Cyril Gupta 2009-04-11 05:30:26

too inefficient to use a subquery. Go directly with an outer join for more efficiency

Mike Pone 2009-04-11 05:34:20

Thank you. This works!

John at CashCommons 2009-04-11 06:30:53

There is no reason to apply "distinct" inside the subquery.

Bill Karwin 2009-04-11 07:19:44

For something like a list of categories, with around 100 entries max (I assume), the two table scans required will take 99.99% of the cost of execution, and this is easier to read, IMO. Bill, I agree about distinct, though.. Will remove.

Blorgbeard 2009-04-11 10:01:11

Answer 2

+3 A:

include only those categories that don't are not super cateogories. A simple outer join

select CategoryName from Categories LEFT OUTER JOIN
SuperCategories ON Categories.ID_Category =SuperCategories.SuperCategoryID
WHERE SuperCategories.SuperCategoryID is  null

Mike Pone 2009-04-11 05:33:39

Thanks! This works. Now I just have to figure out why!

John at CashCommons 2009-04-11 06:32:12

Outer joins will take everything from the left side of the join (that is everything from Categories) and only those records that match from the SuperCategories table.

Mike Pone 2009-04-11 14:30:50

Answer 3

+6 A:

SELECT
     CAT.ID_Category,
     CAT.CategoryName
FROM
     Categories CAT
WHERE
     NOT EXISTS
     (
          SELECT
               *
          FROM
               SuperCategories SC
          WHERE
               SC.SuperCategoryID = CAT.ID_Category
     )

Or

SELECT
     CAT.ID_Category,
     CAT.CategoryName
FROM
     Categories CAT
LEFT OUTER JOIN SuperCategories SC ON
     SC.SuperCategoryID = CAT.ID_Category
WHERE
     SC.ID_Super IS NULL

I'll also make the suggestion that your naming standards could probably use some work. They seem all over the place and difficult to work with.

Tom H. 2009-04-11 05:34:41

Thanks! This is my first venture into SQL so I have a lot to learn. I meant ID_ for primary key and BlahBlahID to indicate that it was a key in another table. Always open to suggestions...

John at CashCommons 2009-04-11 06:12:27

Thank you. This works as well!

John at CashCommons 2009-04-11 06:33:28

Answer 4

+3 A:

Hi John W,

Mike Pone's answer works, because he joins the "Categories" table with the "SuperCategories" table as a "LEFT OUTER JOIN" - this will take all entries from "Categories" and add columns from "SuperCategories" to those where the link exists - where it does not exist (e.g. where there is no entry in "SuperCategories"), you'll get NULLs for the SuperCategories columns - and that's exactly what Mike's query then checks for.

If you would write the query like so:

SELECT c.CategoryName, s.ID_Super 
FROM Categories c 
LEFT OUTER JOIN SuperCategories s ON c.ID_Category = s.SuperCategoryID

you would get something like this:

CategoryName    ID_Super
Box               1
Box               2
Red Box           NULL
Blue Box          3
Blue Plastic Box  NULL
Can               4
Tin Can           NULL

So this basically gives you your answer - all the rows where the ID_Super on the LEFT OUTER JOIN is NULL are those who don't have any entries in the SuperCategories table. All clear? :-)

Marc

marc_s 2009-04-11 07:15:51

Yes, that clears it up. Thank you!

John at CashCommons 2009-05-06 21:27:15

Answer 5

A:

I always take the outer join approach as marc_s suggests. There is a lot of power when using OUTER JOINS. Often times I'll have to do a FULL OUTER JOIN to check data on both sides of the query.

You should also look at the ISNULL function, if you are doing a query where data can be in either table A or table B then I will use the ISNULL function to return a value from either column.

Here's an example


 SELECT 
       isNull(a.[date_time],b.[date_time]) as [Time Stamp]
      ,isnull(a.[ip],b[ip]) as [Device Address]
      ,isnull(a.[total_messages],0) as [Local Messages]
      ,isnull(b.[total_messages],0) as [Remote Messages]
  FROM [Local_FW_Logs] a
FULL OUTER JOIN [Remote_FW_Logs] b 
on b.ip = a.ip

OhioDude 2009-05-07 11:43:08

ansaurus

tags:

views:

answers:

SQL query on two tables - return rows in one table that don't have entries in the other

related questions