ansaurus

Question

How to efficiently store ever changing datasets (search results) for periodic reports

Answer 1

A:

What about using a document database and instead of saving each url you save a document that has a collection of urls. At this point whenever you execute whatever process that iterates over all the urls you get all of the documents that existing a time frame or whatever qualifications you have on that and then run all of the urls across each of the documents.

This could also be emulated in sql server by just serializing your object to json or xml and storing the output in a fitting column.

Chris Marisic 2010-08-18 20:40:59

Answer 2

+1 A:

Why not just a single table, called something like URL_HISTORY:

URL          VARCHAR  (PK)
START_DATE   DATE     (PK)
END_DATE     DATE
VERSION      VARCHAR

Have END_DATE as either NULL or a suitable dummy date (eg. 31-Dec-9999) where the version has not been superceded; set END_DATE to be the last valid date where the version has been superceded, and create a new record for the new version - eg.

+------------------+-------------+--------------+---------+
|URL               | START_DATE  |  END_DATE    | VERSION |
|..\Harry.pdf      | 01-OCT-2009 |  31-DEC-9999 | 1.1.0   |
|..\SarahJane.pdf  | 01-OCT-2009 |  31-DEC-2009 | 1.1.0   |
|..\SarahJane.pdf  | 01-JAN-2010 |  31-DEC-9999 | 1.1.1   |
+------------------+-------------+--------------+---------+

Mark Bannister 2010-08-19 12:10:35

Thanks, this is what I'm going to do. Basically it's half of a temporal database, recording just the valid from-to times and not the transaction from/to time.

Sander Marechal 2010-08-19 21:11:13

ansaurus

tags:

views:

answers:

How to efficiently store ever changing datasets (search results) for periodic reports

related questions