ansaurus

Question

Why are my spatial searches slower in SQL Server than PostGIS?

Answer 1

+1 A:

Here are some remarks about SQL-Server's spatial extensions and how to ensure that the index is efficiently used:

http://sqlskills.com/BLOGS/BOBB/post/How-to-ensure-your-spatial-index-is-being-used.aspx

Apparently, the planner has difficulties to build a good plan if he does not know the actual geometry during parse time. The autor suggest to insert exec sp_executesql:

Replace:

-- does not use the spatial index without a hint
declare @latlonPoint geometry = geometry::Parse('POINT (45.518066 -122.767464)')
select a.id, a.shape.STAsText() 
from zipcodes a 
where a.shape.STIntersects(@latlonPoint)=1
go

with:

-- this does use the spatial index without using a hint
declare @latlonPoint geometry = geometry::Parse('POINT (45.518066 -122.767464)')
exec sp_executesql 
N'select a.id, a.shape.STAsText() 
from zipcodes a 
where a.shape.STIntersects(@latlonPoint)=1', N'@latlonPoint geometry', @latlonPoint
go

Luther Blissett 2010-08-12 17:15:42

My spatial index is being used though. I hit "Include actual execution plan" and it shows the spatial index being used.

Brendan Long 2010-08-12 17:32:09

I tried this suggestion just to be sure and the times and execution plan was the same.

Brendan Long 2010-08-12 17:37:25

+1 informative, even if it did not solve the OP's problem

Peter 2010-08-28 04:03:25

Answer 2

A:

My gut reaction is "because Microsoft hasn't bothered to make it fast, because it's not an Enterprise Feature". Maybe I'm being cynical.

I'm not sure why you're migrating away from Postgres either.

tc. 2010-08-12 17:20:56

I suspect it has more to do with it being a new feature; I heard they're supposed to make it a lot better in the next version. What confuses me is that I haven't heard anything about it being slow, so I'm worried that I'm just missing something.

Brendan Long 2010-08-12 19:39:38

Answer 3

A:

I'm not familiar with spatial queries, but it could be a parameterized query problem

try writing a query (without using parameters) with a fixed value (use a value that performs slow with the parameterized query) and run it. Compare the times with the parameterized version. If its much faster, then your problem is parameterized queries.

If the above is much faster, then I would dynamically build your sql string with the parameter values embedded in the string, that way you can remove parameters from causing problems.

pete 2010-08-23 06:58:10

Answer 4

A:

Have you set up your spatial index correctly? Is your bounding box correct? Are all points inside? In your case probably HHMM for GRIDS would work the best (depending again on a bouding box).

Can you try to use sp_help_spatial_geometry_index, to see what's wrong? http://msdn.microsoft.com/en-us/library/cc627426.aspx

Try using Filter operation instead and tell us what perf numbers you get? (it executes only primary filter (use index) without going through secondary filter (true spatial operation))

Something is wrong with your setup. Spatial is indeed new feature but it's not that bad.

Desinderlase 2010-08-26 16:58:01

I've tried every combination of two sizes (LLLL, LLMM, LLHH, MMLL, etc.) and the best was MMMM with 256 cells per object. `sp_help_spatial_geometry_index` said the primary filter was 90% efficient, which I think might be the problem (others were as low as 70%). `Filter` was much faster than `STIntersects` but still 2-5x slower than Postgres (and not as accurate).

Brendan Long 2010-08-26 17:47:33

We think the problem is that our data is fairly sparse with high-density regions, so the static-grid-size approach isn't helpful. If we set the grids to high, the index is too specific in sparse areas, but if we set it to low, the index is useless in high-density areas.

Brendan Long 2010-08-26 17:51:07

Then try setting up multiple spatial indexes around each high-density region. Or at least break entire US into few big areas. I expect you have most of your data on east and west coast.

Desinderlase 2010-08-27 10:52:39

@Desinderlase, our data isn't the whole US, it's just Colorado. The problem is that users can select areas that cross the entire state. For example, my test query is a thin box from Fort Collins to Denver. This has two high-density regions separated by a low density region, and I was under the impression that SQL Server will only use one spatial index at a time (and not indexing one of those two areas will be even worse than I have now).

Brendan Long 2010-08-27 15:59:46

I can try loading this database up again on Monday to test it again, for the moment we just removed it because this configuration is way more complicated than just using two databases.

Brendan Long 2010-08-27 16:00:58

Answer 5

A:

I believe STIntersects is better optimized for using the index would have better performance than STWithin, especially for larger shapes.

Giri 2010-08-26 17:31:48

We tried both and it made no difference.

Brendan Long 2010-08-26 17:37:59

Would it be possbile for you to post the plan information after setting set statistics profile on, for STIntersects query?

Giri 2010-08-26 17:45:17

Answer 6

A:

You might try breaking it down into two passes:

select candidates into a temp table w/ .Filter().
query candidates w/ .STWithin().

eg:

SELECT * INTO #this FROM PointsTable WHERE Point.Filter(@Shape) = 1
SELECT * FROM #this WHERE Point.STWithin(@Shape) = 1

(replacing SELECT * with only the actual columns you need to reduce I/O)

This kind of micro-optimization shouldn't be necessary, but I have seen decent performance improvements before. Also, you will be able to gauge how selective your index is by the ratio of (1) to (2).

Peter 2010-08-28 04:11:56

ansaurus

tags:

views:

answers:

Why are my spatial searches slower in SQL Server than PostGIS?

related questions