Parition a table across multiple physical nodes | ansaurus

tags:

views:

29

answers:

1

Q:

Parition a table across multiple physical nodes

Hello,

So I'm currently working on a project that involves the collection and storing of some huge datasets (as far as what I'm used to working with). The data essentially consists of meta information, and then actual values (where the values are trended over time).

The meta information itself is relatively large, but nothing huge, I would probably say its going to grow the the 10-50 million row size over the next couple of years. This seems manageable to me, and a single beefy SQL Server should be enough to provide quick access to this data if it is decently indexed (and the data is very easy to index, with very defined boundaries)...

However, the trending data is a completely different story. Within a year, we are VERY easily going to be pulling in 40-50 million rows every day, and that could realistically double yearly for the next 3 or 4 years.

This trending data also has very defined boundaries that would split it into MUCH more manageable sized chunks. I'm hoping I can set up some sort of partitioning mechanism that would spread this data across multiple physical database nodes. The data is essentially all contained in a single table. I looked into SQL Server table partitioning, but couldn't find a way to spread the data over multiple servers.

My question is whether there is some "relatively simple" way of implementing table partitioning over multiple physical nodes. I've also spent some time looking at Sql Server PDW, but its difficult to find information online, and I don't want to pursue that until I've established that there is not simple way of implementing this sort of solution using features built into SQL Server.

Any advice would be greatly appreciated...

+1 A:

I'm no expert on this but I believe what you may be looking for is database 'sharding'. There's an interesting analysis of the problems and benefits of sharding here.

Ultimately, implementation of a 'sharded' design is likely to be very costly but if your data is going to be unmanageable in a single database then this could be a good solution.

There is also a small amount of information on the Wikipedia page which includes a list of software which supports shards (e.g. the Hibernate ORM)

Dolbz 2010-03-01 08:30:50

Thanks for the reply, not quite what I was hoping for, but I'll give you a +1 for the good reading... I'm thinking I may have to look into a distributed key value store or something, just for the trending tables, should be much easier to scale out than SQL Server

LorenVS 2010-03-01 21:32:26

related questions

In SQL Server 2008 how can I secure data in a way that it cannot be decrypted unless connected to a network?

Advantages of MS SQL Server 2008 over MS SQL Server 2005?

SqlDataReader.HasRows returns false since SQL 2008 upgrade

How do I set the login info for SQL Server 2008 in Entity Framework?

Move SSIS Packages from SQL Server 2005 to 2008.

Installation problem sql server 2008

How do I register a Web Reference for a SQL Server 2008 Reporting Service Report?

How to use DataMining feature of SQL Server 2008 with ASP.Net

SSAS Cube Browsing not working after SQL 2008 CTP uninstall

Hooking up Reporting Services 2005SP2 to SQL Server 2008

New Date/Time data types in SQL 2008

Can SQLExpress 2005 and 2008 be installed on same machine without issue?

Any reason to have SQL Server 2005 and 2008 installed on same machine?

Cloud Hosting options for ASP.NET 3.5 and SQL Server 2008

How do I get the full name of the current user from a SQL Reporting Services 2008 report?

How do I fix a Cross language installation problem in SQL Server 2008?

Upgrade MSDE to SQL Server 2008

Reverse Engineering for Database Diagramming in Visio with SQL Server 2008

SQL Server 2008 Reporting Services Report Definition Customization Extensions

Moving SQL2005 app to SQL2008

Best practices for DateTime serialization in .Net framework 3.5/SQL Server 2008

Can I use SQL Server Management Studio 2005 for 2008 DB?

SQL 2008 Dialect Support for NHibernate

SQL Server 2008 vs 2005 Linq integration

SQL Server 2005 and 2008 on same developer machine?