views:

14

answers:

1

I have multiple servers containing the same table schema, but each responsible for a portion of the overall data... No one machine needs to contain all the data. Each machine only contains data relating to its department.

Is there a central database server which will contain all the data, or just a central db server which can be used to access the other servers?

if my question is not clear please refer to this link it describes the main purpose

help! please

http://stackoverflow.com/questions/3218452/i-have-to-do-the-following-to-the-database-have-an-idea-but-its-not-clear-help

+1  A: 

It sounds like what you are doing is partitioning the data. Remember the CAP Theorem states you cannot have accessibility and data consistency all the time in this configuration. Normally you'd have data severs (like in your example) and then different servers for read/writing using something like SQL. They can be distributed (just as long as they know about all the data servers). Just remember that 'in a distributed system there is no central arbiter of truth'.

We use the same came of setup in a production network, where the faster servers are a cluster with the last 2 years worth of data and then there are slower archive servers. Results can span the entire collection of servers depending on the query. If a report needs archived data it queries a separate server that performs a join behind the scenes.

Metalshark