ansaurus

Question

SQL Server 2005 Scalar UDF performance

Answer 1

+2 A:

You can improve the performance of this UDF by NOT declaring variables and doing your calculations more in-line. This will likely improve performance a little but (but probably not much).

CREATE FUNCTION [dbo].[SquareDistance] 
(@Lat1 float, @Long1 float, @Lat2 float, @Long2 float)
RETURNS float
AS
BEGIN
    Return ( SELECT ((@Lat1 - @Lat2) * (@Lat1 - @Lat2)) + ((@Long1 - @Long2) * (@Long1 - @Long2)))
END

Even better would be to remove the function and put the calculations in the original query.

SELECT Lat, Long FROM Table   
WHERE (Lat BETWEEN 38 AND 42)   
  AND (Long BETWEEN 138 AND 142)  
  AND ((Lat - 40) * (Lat - 40)) + ((Long - 140) * (Long - 140))  < 4

There is a little bit of overhead with calling a user defined function. By removing the function, you are likely to gain a little in performance.

Also, I encourage you to check your execution plan just to make sure you are getting index seeks like you expect.

G Mastros 2008-12-22 15:28:59

Wow, I feel pretty dumb right now, for not making the leap to convert the calculation from UDF to direct SQL...I'll try this and check how it works.As for the indexes, it's definitely using them, it doesn't even touch the table, and it's seeking, not scanning. Thank you!!

Daniel Magliola 2008-12-22 16:38:11

Answer 2

+1 A:

There is a lot of overhead in using a UDF.

Even coding it in-line may not be good because an index can not be used, although here the BETWEEN clauses should reduce the data that needs crunched.

To extend G Mastros' idea, separate the select bit from the square bit. It may help the optimiser.

SELECT
    Lat, Long
FROM
    (
    SELECT
        Lat, Long
    FROM 
        Table   
    WHERE
        (Lat BETWEEN 38 AND 42)   
        AND
        (Long BETWEEN 138 AND 142)
    ) foo
WHERE
    ((Lat - 40) * (Lat - 40)) + ((Long - 140) * (Long - 140))  < 4

Edit: You may be able to reduce the actual calculations involved. This next idea may reduce the number of calcs from 7 to 5

    ...
    SELECT
        Lat, Long,
        Lat - 40 AS LatDiff, Long - 140 AS LongDiff
    FROM 
    ...
    (LatDiff * LatDiff) + (LongDiff * LongDiff)  < 4
    ...

Basically, try the 3 solutions offered and see what works. The optimiser may ignore the derived table, it may use it, or it may generate an even worse plan.

gbn 2008-12-22 15:58:04

It may help the optimizer, but probably won't. The optimizer is smart enough to recognize a derived table and optimize the query as though it weren't there.

G Mastros 2008-12-22 16:07:15

True, but it could help readability as well.Edited to add more work onto inner query for "Lat - 40", "Long - 40" once instead of twice for outer query

gbn 2008-12-22 16:25:07

Answer 3

A:

Updates:

GMastros: You were absolutely right. Doing the math in the query itself is infinitely faster than the UDF. I'm using the SQUARE() function to do the multiplication, which makes it a bit more concise, but performance is the same.

However, doing it this way is still twice as slow as doing the math in .Net.
I can't really understand that, but i've come to a compromise that is useful for my particular situation (which sucks, because I need to duplicate code, but it's the best scenario, unless we can find a way to make the circle calculation in SQL be faster)

Thanks!

Daniel Magliola 2008-12-22 18:02:22

Answer 4

A:

Check this article that describes why UDF in SQL Server are generically speaking a bad idea. Unless you're pretty sure the table you're invoking the UDF will not grow up a lot beware that UDF functions are always called on ALL the rows in your tables and not (as one can wrongly guess) only on resultset. This can give you a big performance hit when database grow.

The very good article linked details also some ways to overcome the problem but the real fact is that the SQL Server TSQL dialect misses a way to create a scalar function or a deterministic one (like Oracle does).

massimogentilini 2009-01-26 17:31:22

ansaurus

tags:

views:

answers:

SQL Server 2005 Scalar UDF performance

related questions