ansaurus

Question

Is this execution plan a motivation for re thinking my primary keys

Answer 1

+3 A:

Could you please run the following queries and post their output:

SELECT  COUNT(*), COUNT(DISTINCT fieldname)
FROM    foo

SELECT  COUNT(*), COUNT(DISTINCT cId), COUNT(DISTINCT fId)
FROM    bar

This will help to estimate which indexes best suit your needs.

Meanwhile make sure you have the following indexes:

foo (FIELDNAME)
bar (cId, fId)

and rewrite your query:

SELECT  DISTINCT(fieldname)
FROM    foo f
WHERE   EXISTS (
        SELECT  1
        FROM    bar b
        WHERE   b.fId = f.id
                AND b.cId = @id
        )

This query should use an index on f.FIELDNAME to build the DISTINCT list and the index on bar to filter out the non-existent values.

Quassnoi 2009-05-12 12:21:05

The big question is if the query needs optimising or the app needs to be a bit less aggressive about calling the query

Sam Saffron 2009-05-12 12:31:33

It never hurts to make both :)

Quassnoi 2009-05-12 12:42:29

How do you mean less agressive? What I'm doing is building a Lucene index for fast searching. This query needs to be repeated for every @id I have and there I don't see any way to re-use previous results.

borisCallens 2009-05-12 12:46:09

@boris, bingo, get all the ids up-front insert them into a temp table or something, and select the whole kaboodle as a set. try to do things in sets not one id at a time

Sam Saffron 2009-05-12 12:48:48

Thanks for the advice, I will run the queries as soon as the current processing is done. In the mean time, you don't think the clustered indexes are the issue here?

borisCallens 2009-05-12 12:50:20

I can't because each id depends on the result of the previous query.

borisCallens 2009-05-12 12:52:32

I find that when an INNER JOIN is re-written using EXISTS then it is often the case that the DISTINCT is no longer required.

onedaywhen 2009-05-12 13:00:08

No, it's required for this certain query. Otherwise you'll get duplicate fieldnames if they exist in foo. The plan will be rewritten for sure if it's what you mean.

Quassnoi 2009-05-12 13:08:37

I don't see details of the design posted here but if fieldname is unique in foo then the DISTINCT could be removed ;-)

onedaywhen 2009-05-12 13:21:56

BTW is your re-written query missing some table correlation names e.g. foo AS f ... bar AS b... ?

onedaywhen 2009-05-12 13:23:13

@onedaywhen: That's why I asked for COUNT()'s, to see if it's unique or not. As for correlations, you're right, I missed them.

Quassnoi 2009-05-12 13:30:01

Answer 2

+1 A:

This kind of query looks familiar. Im guessing here, but, it's probably populating a combo box on a web/winform ui that is being hit pretty hard.

Perhaps you should be caching the results on the application side so you don't end up executing it so often. Worse case scenario you could cache this on sql servers side, but its a massive kludge.

Sam Saffron 2009-05-12 12:29:51

Yes, I can see what you mean, but I don't feel that is the case in my current situation. Please see the OP for an update.

borisCallens 2009-05-12 12:48:48

Answer 3

A:

In most databases, indexes aren't used if the first column in the index isn't listed. You say that the customerId is part of every primary key, but you don't use it for the join in your query. To properly answer your question, we really need to see the create table output for foo and bar, or at least show index from.

That said, your query may be faster if you change it like so:

select distinct(f.FIELDNAME) as fieldName
from foo f
inner join bar b
   on f.id = b.fId
   and f.cId = b.cId #Using this part of the key will speed it up
where b.cId = @id;

My comment assumes that your primary key is ordered as "cId, fId" Effectively, that will mean that your query doesn't have to check every cId, only the ones that are part of the index.

Autocracy 2009-05-12 13:43:31

The CustomerId isn't used yet, but will be in the future. But I feel that it should have been part of the regular fields and an artificial key should have been used.

borisCallens 2009-05-12 14:23:52

ansaurus

tags:

views:

answers:

Is this execution plan a motivation for re thinking my primary keys

related questions