ansaurus

Question

How to improve performance of non-scalar aggregations on denormalized tables

Answer 1

+1 A:

Another option would be to have another table (Region, FileDate) which holds the most recent FileDate for each Region. You would update this table during your load.

AdamRalph 2009-11-12 19:28:05

Another index on (Region, FileDate) would be simpler, no?

gbn 2009-11-12 19:43:32

Yep, definitely simpler.I was thinking of a) data volume and b) update time. To get the latter benefit you would require knowledge of Region, Max(FileDate) independently of the of loading data (otherwise you'd end up doing a conditional on each insert anyway). If data volume is not an issue and/or the number of updates can't be reduced as I've described then the additional covering index would be the best way.

AdamRalph 2009-11-15 14:34:43

Answer 2

+1 A:

I'd create the covering index (nonclustered) on (Region, FileDate), not just region. However, it will be large because you have a wide clustered key.

Otherwise, try AdamRalph's idea but I think this is overhead that outweighs another index

gbn 2009-11-12 19:47:22

Was a typo in my original question. Indeed I couldn't think of anything better than an additional non-clustered index. Everything else has too much pre-processing or post-processing involved to make it worthwhile.

The Lazy DBA 2009-11-18 02:45:14

Answer 3

A:

Any chance you could build a cube in Analysis Services, and run your aggregation query against the cube?

The queries should be faster, although there would be a delay from when your data changes until when the cube finishes updating.

RickNZ 2009-11-15 11:06:51

ansaurus

tags:

views:

answers:

How to improve performance of non-scalar aggregations on denormalized tables

related questions