ansaurus

Question

Best way/technology to implement a generic archive process

Answer 1

+3 A:

First, this sound like a task for PL-SQL. The non modular code issue can be enforced, and using PL-SQL will give you better results and easier to be written.

As for the concept itself, any solution you`ll do will have a problem if the schema will be updated - the sync will fails, or worse, it won't and the data will be corrupted.

What about having a replication server where you add a "delete old records" from the main server, and only perform the insert/update on the offline server? This will allow you to both have all the data and keep the live one smaller.

Am 2009-09-24 22:49:21

Answer 2

+4 A:

PL/SQL is NOT just for "quick-in and quick-out" data operations. There a very substantial apps built on it. There is nothing inherently wrong with PL/SQL for this kind of task. That said, if you anticipate a poorly written 10K line procedure in PL/SQL, don't use it. Let your programmers do what they do best.

DCookie 2009-09-24 22:52:10

Answer 3

+3 A:

However you "do it", it will need to be done by hand.

Retiring data in a RDBMS is fraught with peril. Because you typically can't just archive a single table. You need to archive all of it's dependent tables as well.

Then there's the schema change issue. Not so much keeping your archive in sync with you evolving schema, but keeping your tools in sync with obsolete schemas. It's not like you can point your current applications at the "old data" and expect it to necessarily work. Hard enough to keep your apps up to date with current data, much less having it behave reasonably with old data.

If you're doing select subsets of your data, it's just simply safer, and actually easier, to craft the select and insert statements by hand, ensuring integrity, checking values, etc. than to rely on some contrived tool. It may seem arduous up front, but it's really just tedious.

But once done, you'll have much more control over what and how data is being exported and merged.

Writing it in PL/SQL is smart simply because this is a database operation. Why drag all of the data out of the server just to stuff it back in to it. The PL/SQL stuff will likely have better overall performance when this is all said and done.

As for ensuring modularity, indention, etc., well, that's why baseball bats were invented.

Will Hartung 2009-09-24 22:58:43

Answer 4

A:

Maybe I'm not reading the requirements correctly but wouldn't a simple

create <dest_table> as select * from <source_table>;

suffice? with a drop first on the dest_table if it already exists?

Venr 2009-09-24 22:59:15

You can't drop the target table if it already exists, because it will contains all the historical, archive data, that is too old for the data warehouse but too good to throw away

Velika 2009-09-24 23:14:56

Answer 5

+1 A:

You say this is a data warehouse. Are you using partitioning? If so, does the partitioning scheme identify the rows you want to archive? If the answer to both questions is "yes" then partition exchange could be the feature you're searching for.

APC 2009-09-25 06:07:08

ansaurus

tags:

views:

answers:

Best way/technology to implement a generic archive process

related questions