ansaurus

Question

Why does this query timeout? V2

Answer 1

+1 A:

The SQL that Linqpad gives you may not be exactly what is being sent to the DB.

Here's what I would suggest:

Run SQL Profiler against the DB while you execute the query. Find the statement which corresponds to your query
Paste the whole statment into SSMS, and enable the "Show Actual Execution Plan" option.
Post the resulting plan here for people to dissect.

Key things to look for:

Table Scans, which usually imply that an index is missing
Wide arrows in the graphical plan, indicating lots of intermediary rows being processed.

If you're using SQL 2008, viewing the plan will often tell you if there are any indexes missing which should be added to speed up the query.

Also, are you executing against a DB which is under load from other users?

RyanHennig 2010-08-02 21:19:13

When I look at the profiler, the sql looks identical, other than it's using sp_executesql. And no, the database is not currently under heavy load, though it can be at times. I'm waiting for the query to complete to get the actual execution plan.

Mystere Man 2010-08-02 21:38:28

Ok, so the execution plans are wildly different, even though the query is identical. The SQL query executed from SSMS uses a lot of Parallelism and 86% of the work is in a Table Scan of TransmittalDetail, while the much longer Linq executed plan spends 27% of the time in "Table Spool (Lazy Spool)" and only 29% of the time in the same table scan and 41% in a Nested Loop. In both cases, the same Missing Index was reported.

Mystere Man 2010-08-02 21:53:13

Answer 2

+1 A:

At first glance there's a lot of joins, but I can only see one thing to reduce the number right away w/out having the schema in front of me...it doesn't look like you need AccountSummary.

[t6].[AccountSummaryID] = @p3

could be

[t5].[AccountSummaryID] = @p3

Return values are from the [t5] table. [t6] is only used filter on that one parameter which looks like it is the Foreign Key from t5 to t6, so it is present in [t5]. Therefore, you can remove the join to [t6] altogether. Or am I missing something?

Jim Leonardo 2010-08-02 23:51:15

Interesting that you mention that. By performing your optimization, the query returns in 25 seconds, instead of 4 minutes. Still an order of magnitude slower than it should be, but that's good.

Mystere Man 2010-08-03 01:36:20

This would lead me to believe that the number of joins is affecting the query performance, which may seem like a big "duh" but similar queries are not taking this long even with as many joins.

Mystere Man 2010-08-03 01:52:21

Answer 3

A:

Check that you have the same Transaction Isolation level between your SSMS session and your application. That's the biggest culprit I've seen for large performance discrepancies between identical queries.

Also, there are different connection properties in use when you work through SSMS than when executing the query from your application or from LinqPad. Do some checks into the Connection properties of your SSMS connection and the connection from your application and you should see the differences. All other things being equal, that could be the difference. Keep in mind that you are executing the query through two different applications that can have two different configurations and could even be using two different database drivers. If the queries are the same then that would be only differences I can see.

On a side note if you are hand-crafting the SQL, you may try moving the conditions from the WHERE clause into the appropriate JOIN clauses. This actually changes how SQL Server executes the query and can produce a more efficient execution plan. I've seen cases where moving the filters from the WHERE clause into the JOINs caused SQL Server to filter the table earlier in the execution plan and significantly changed the execution time.

EchoCoder 2010-08-03 00:26:22

Isolation level doesn't seem to make any difference. I've enclosed the query in a TransactionScope with the same default isolation level as SMSS (ReadCommitted) and no difference. Also note, that when I execute the sp_executesql query in SMSS it also takes a long time, but the non-sp_executesql query does not. I guess there is a difference in that one is parameterized and the other is not.

Mystere Man 2010-08-03 03:22:30

Answer 4

+1 A:

Are you sure you want to use LEFT OUTER JOIN here? This query looks like it should probably be using INNER JOINs, especially because you are taking the columns that are potentially NULL and then doing a distinct on it.

RyanHennig 2010-08-03 02:44:16

In response to EchoCoder's comment about Isolation level, I have written a blog post about a convenient way to execute LINQ queries without any table locks: http://ryancoder.blogspot.com/2010/06/execute-linq-queries-without-locking.html If that gives you a big speedup, it indicates that table locking from other concurrent queries is a problem. But this does not explain the difference in perf between LINQ and SSMS

RyanHennig 2010-08-03 02:50:06

The query is generated by Linq with LEFT OUTER JOINS, so i don't know of any way to change that. Using your extension does not create any noticable change either. There won't be NULLS in those tables anyways.

Mystere Man 2010-08-03 03:19:31

Ok then, the real problem here is that LINQ is generating LEFT OUTER JOIN instead of INNER JOIN. Check your DBML. You are probably missing associations between the columns used in your join clauses. Also make sure that any columns that are Primary Keys are marked as such in the DBML. Then regenerate your LINQ classes and try again. This will probably give you a huge speedup.

RyanHennig 2010-08-03 22:10:46

Answer 5

+1 A:

The bad plan is most likely the result of parameter sniffing: http://blogs.msdn.com/b/queryoptteam/archive/2006/03/31/565991.aspx

Unfortunately there is not really any good universal way (that I know of) to avoid that with L2S. context.ExecuteCommand("sp_recompile ...") would be an ugly but possible workaround if the query is not executed very frequently.

Changing the query around slightly to force a recompile might be another one.

Moving parts (or all) of the query into a view*, function*, or stored procedure* DB-side would be yet another workaround.
* = where you can use local params (func/proc) or optimizer hints (all three) to force a 'good' plan

Btw, have you tried to update statistics for the tables involved? SQL Server's auto update statistics doesn't always do the job, so unless you have a scheduled job to do that it might be worth considering scripting and scheduling update statistics... ...tweaking up and down the sample size as needed can also help.

There may be ways to solve the issue by adding* (or dropping*) the right indexes on the tables involved, but without knowing the underlying db schema, table size, data distribution etc that is a bit difficult to give any more specific advice on...
* = Missing and/or overlapping/redundant indexes can both lead to bad execution plans.

KristoferA - Huagati.com 2010-08-03 04:15:56

Ok, so the problem goes away when I add the missing index that the execution plan tells me to. A better plan is chosen, and while not paralleled like the non-parameterized query, it's sub-second response time.

Mystere Man 2010-08-03 22:34:37

ansaurus

tags:

views:

answers:

Why does this query timeout? V2

related questions