ansaurus

Question

SQL Server 2008: Very slow query performance when dealing with specific date boundary

Answer 1

+4 A:

Without knowing anything additional, first guess would be that you are likely either hitting the tipping point of index usage for a particular operation or have outdated/wrong statistical distribution information for given column(s). Could be a few other things as well such as suboptimal indexing (perhaps wrong, perhaps key ordered incorrectly, etc.), given that you are using Sql 2008 are you sure you haven't created a filtered index rather than a traditional full index on the particular column(s), etc. - however, in order to determine that, we'd need to see a lot more information (i.e. schema, indexes, query plans, data distribution, statistics, etc.).

Might help if you could post the query plan used for each of the queries listed above, that would at least help us determine if you are getting drastically different plans.

chadhoc 2009-11-16 17:58:47

ok, looking at the execution plan is is telling me that I dont have an index! I guess this is a bad thing. My question is, should I add one. I am using SQL change tracking and am worried that adding an index may reset the change tracking, which would also be a bad thing. Ive also added the IO info in the original postThanks

Sergio 2009-11-16 18:24:44

SQL Management studio is suggesting the following:USE [brandfourcoke]GOCREATE NONCLUSTERED INDEX [<Name of Missing Index, sysname,>]ON [dbo].[tblAnswers] ([QuestionID])INCLUDE ([CallID],[Value])GO

Sergio 2009-11-16 18:28:53

Well, that's not an easy thing to answer (i.e. should you add the index). Naturally, if you want this particular query to run faster and adding the index does in fact significantly improve the execution, the you should probably add it; however, it's really not a cut and dry answer, so it would be something you should likely try out in your environment to determine if it makes sense or not (the topic of indexing is a very vast, very involved topic). It's also hard to give any guidance on what to index without seeing the views you're talking about and/or existing structures.

chadhoc 2009-11-16 18:50:31

Certainly seems like a good place to start however. You could also consider running the query through the database tuning advisor to see what it gives you as well (http://msdn.microsoft.com/en-us/library/ms166575.aspx)

chadhoc 2009-11-16 18:53:41

just to update - creating the index, along with another on a separate table improved query time hugely. Query now runs in a couple of seconds! Still not sure on the fundamental reason for the massive swing in performance though. Ah well

Sergio 2009-11-18 09:36:33

Answer 2

+4 A:

Run the slow and fast query with SET sTATISTICS IO ON and see if there is a significant difference in the amount of logical reads/physical reads between the two.

Most likely there is a strong skew in the data. For instance the plan on the fast one does a nested loop driven by a result of 10 rows (resulting in 10 nested lookups), while the slow one suddenly sees 10000 rows from where the previous one has seen 10, resulting in 10000 lookups. Although your query has no joins, the engine may use various access indexes and join the indexes with the cluster index. The actual execution plan will always reveal exactly what's going on.

Remus Rusanu 2009-11-16 18:00:10

Seems that's what's going on. I'm not entirely it would suddenly need to pull in so much more data, but it is. :(

Sergio 2009-11-16 19:08:40

Yep, 1976256 reads vs. 7008, looks like a table scan on tblAnswers vs. a range scan. Before you add the index, can you post the actual plans, upload the .sqlplan files and post a link to them?

Remus Rusanu 2009-11-16 19:22:33

Yep sure - before I stick em on the net - the plans don't include any sensitive info do they? :) not that I don't trust you - its the rest of the net I worry about. I could email them?

Sergio 2009-11-16 19:35:46

Is good to be vigilent, and shouldn't trust me neither :). The plans contain details about the name of tables/indexes in the query, which you already posted with the statistics io info. They do not contain any login/user/password credentials info, so I would say is a moderate risk to post them publickly. But always, better safe than sorry. About emailing them, I rather not turn a public forum into private consulting...

Remus Rusanu 2009-11-16 19:45:47

Answer 3

A:

That query is one reason why using generated sql is often a terrible idea. As far as I can tell this is the equivalent query if you had written it yourself:

     SELECT 1 AS [C1],     
   [view_answers].[Name] AS [K1],     
   SUM([view_answers].[Value]) AS [C2]   
  FROM  [view_answers] 
  WHERE ([view_answers].[UserID] = '16E3692F-806E-40A0-BB99-ABBBCC13060D')        
  AND (N'Distribution Points' = [view_answers].[QuestionType])        
  AND ([view_answers].[StartTime] >= '11/04/2009 00:00:00')        
  AND ([view_answers].[StartTime] <= '11/08/2009 00:00:00')        
  AND (1 = [view_answers].[Complete])        
  AND (2 = [view_answers].[BrandID])        
  AND (N'Distribution Points' = [view_answers].[QuestionType])    
  GROUP BY        [view_answers].[Name]

Try this with both dates and see if you get the same results and the same delay when using Nov 5. What fields are indexed?

HLGEM 2009-11-16 18:30:50

Whilst your query is much neater, it still runs slow. Check out the updates to the question for some more info

Sergio 2009-11-16 19:21:36

ansaurus

tags:

views:

answers:

SQL Server 2008: Very slow query performance when dealing with specific date boundary

related questions