ansaurus

Question

How to return as little rows as possible from related table

Answer 1

+2 A:

You want to do something like this

SELECT *
FROM OrderDetails
WHERE
    ShopID IN ( @listOfShopIds )
    AND
    OrderID IN (
        SELECT OrderID
        FROM Orders
        WHERE
            OrderDate BETWEEN @dateFrom AND @dateTo
    )

David Hedlund 2010-07-26 07:13:57

`WHERE OrderID IN ( SELECT ... )` can easily be converted to a join, which would be much more efficient.

Winston Smith 2010-07-26 07:26:42

@Winston Smith: it could easily be written as an inner join, yes. in fact, that is how the sql server will merge the results of a select like this one. it will use the same statistics, come up with the same execution plan, and perform exactly as well (since it's doing the exact same thing) in the two solutions.

David Hedlund 2010-07-26 07:39:10

Thanks a lot.Actually, I did a join before as I stated in the question, but my problem was that I added this column to the returned table which caused the problems with the updates. Your answer made me realize my mistake. Thanks.

Kharlos Dominguez 2010-07-26 12:30:41

Answer 2

A:

@David it's probably better to write code which expresses your intent and reflects the more performant algorithm, rather than relying on implementation details of the engine to perform the optimization.

Winston Smith 2010-07-26 07:58:12

@Winston Smith: uh, didn't notice this one until now. however you write your sql query, you're relying on the engine to execute it in a performant manner, utilizing indexes and statistics at hand. whether you phrase it as a join or a subquery, you're dealing with two separate recordsets that will be magically merged by the engine based on a given criteria. the means by which the two sets are merged in these cases is called *nested loops*. nested loops is what is going on both in the join and the subquery. its cost will in most scenarios be negligible in comparison with filtering out the ...

David Hedlund 2010-07-26 11:14:46

two subsets that are subject to the merge in the first place, but that is peripheral to the point - the key point being that whatever its cost is, it is the *same* in both ways of writing the code, because the two ways *are* the same (in this context). so there is no one way that is more performant, and there is no one of the two ways of writing the query that is closer than the other to saying "Hey SQL server, do an index seek on this table and another one on this table, using index such and such, and then merge the two sets using nested loops into a single result". we simply *have* to ...

David Hedlund 2010-07-26 11:15:05

rely on the engine to figure that out. at the end it's all a question of preference. and i honestly do think that "where order id is the id of one of the orders within this span" expresses the intended result *at least* as well as any join would.

David Hedlund 2010-07-26 11:15:22

ansaurus

tags:

views:

answers:

How to return as little rows as possible from related table

related questions