ansaurus

Question

Suitable indexes for sorting in ranking functions

Answer 1

+1 A:

The ideal index for this query would be with key columns uidNode, dtCreated and included columns all remaining columns in the table to make the index covering as you are returning r.*. If the query will generally only be returning a relatively small number of rows (as seems likely due to the WHERE r.ix = 1 filter) it might not be worthwhile making the index covering though as the cost of the key lookups might not outweigh the negative effects of the large index on CUD statements.

Martin Smith 2010-10-12 11:39:07

Thanks! So if I don't actually do `r.*` (which was to reduce noise as well) you're saying that I should include all rows returned here in the index, right? Would it make sense to create the index as clustered (and not include all columns then)? Inserts will be quite rare (maybe 1 insert in the same time 10000 queries are done), deletions and updates will never take place. And does it matter if I make the index on `dtCreated` ascending or descending?

Lucero 2010-10-12 11:46:35

*you're saying that I should include all rows returned here in the index, right?* **All *columns* yes.** *Would it make sense to create the index as clustered (and not include all columns then)?* . **If you don't have a more obvious candidate for a clustered index then possibly. The ideal clustered index key is narrow, unique, stable (not updated often), and ever increasing**. *And does it matter if I make the index on dtCreated ascending or descending?* **Not for the query you have shown. It might make a difference if you have an `ORDER BY` on the results of the `SELECT` though.**

Martin Smith 2010-10-12 12:02:30

Yeah, I meant columns, not rows, of course! If I were to cluster on this, it would be more or less narrow (guid+datetime), unique, stable, but not ever increasing. I'll think about it, thank you very much for the inputs.

Lucero 2010-10-12 12:10:12

Answer 2

A:

The window/rank functions on SQL Server 2005 are not that optimal sometimes (based on answers here). Apparently better in SQL Server 2008

Another alternative is something like this. I'd have a non-clustered index on (uidNode, dtCreated) INCLUDE any other columns required by SELECT. Subject to what Martin Smith said about lookups.

WITH MaxPerUid AS
(
    SELECT
       MAX(r.dtCreated) AS MAXdtCreated, r.uidNode
    FROM
       MaxPerUid
    WHERE
       r.dtCreated < @dt
    GROUP BY
       r.uidNode
)
SELECT
    ...
FROM
   MaxPerUid M
   JOIN
   MaxPerUid R ON M.uidNode = R.uidNode AND M.MAXdtCreated = R.dtCreated

gbn 2010-10-30 13:21:03

ansaurus

tags:

views:

answers:

Suitable indexes for sorting in ranking functions

related questions