ansaurus

Question

Approaches to table partitioning in SQL Server

Answer 1

A:

If you have no other choice you can partition by key module the number of partition tables. Lets say that you want to partition to 10 tables. You will define tables:
Case00
Case01
...
Case09

And partition you data by UniqueIdentifier or PrimaryKey module 10 and place each record in the corresponding table (Depending on your unique UniqueIdentifier you might need to start manual allocation of ids).

When performing a query, you will need to run same query on all tables, and use UNION to merge the result set into a single query result.

It's not as good as partitioning the tables based on some logical separation which corresponds to the expected query, but it's better then hitting the size limit of a table.

Alex Shnayder 2009-06-11 21:02:15

Not hitting the table size limit is definitely a goal but I also am trying to preserve query performance.

Sugerman 2009-06-11 21:08:50

Answer 2

+4 A:

Rather than guess, measure. Collect statistics of usage (queries run), look at the engine own statistics like sys.dm_db_index_usage_stats and then you make an informed decision: the partition that bests balances data size and gives best affinity for the most often run queries will be a good candidate. Of course you'll have to compromise.

Also don't forget that partitioning is per index (where 'table' = one of the indexes), not per table, so the question is not what to partition on, but which indexes to partition or not and what partitioning function to use. Your clustered indexes on the two tables are going to be the most likely candidates obviously (not much sense to partition just a non-clustered index and not partition the clustered one) so, unless you're considering redesign of your clustered keys, the question is really what partitioning function to choose for your clustered indexes.

If I'd venture a guess I'd say that for any data that accumulates over time (like 'cases' with a 'year') the most natural partition is the sliding window.

Remus Rusanu 2009-06-11 23:55:15

Answer 3

A:

Another possible thing to look at (before partitioning) is your model.

Are you in a normalized database? Are there further steps which could improve performance by different choices in the normalization/de-/partial-normalization? Are there options to transform the data into a Kimball-style dimensional star model which is optimal for reporting/querying?

If you aren't going to drop partitions of the table (sliding window, as mentioned) or treat different partitions differently (you say any columns can be used in the query), I'm not sure what you are trying to get out of the partitioning that you won't already get out of your indexing strategy.

I'm not aware of any table limits on rows. AFAIK, the number of rows is limited only by available storage.

Cade Roux 2009-06-13 00:07:51

ansaurus

tags:

views:

answers:

Approaches to table partitioning in SQL Server

related questions