Differentiating update of entries (business logic objects) | ansaurus

tags:

views:

36

answers:

1

+2 Q:

Differentiating update of entries (business logic objects)

Hello,

The situation I'm facing is as follows:

There are large number of 'flat' files from which data is extracted by a C# app in order to create entries which are in turn written in a database (MS SQL server). A full release of the database comprises of ~ 97 million entries across 220 GB.

The task is to create a differential update of the data in the database by parsing a new full release and finding out which of the existing entries have been updated. An entry is considered to be updated if any of its properties has been changed.
[UPDATE] Each entry has a unique ID.

The problem is that the data provider does not supply any indication of entry modification (a version number or a last modification date) - only full releases.

The solution I've come up with so far is to generate a hash sum for each entry and then compare the new to the old one.
The other aspect of the issue which makes hash sums undesirable is the combo between the size of the data and number of entries - it's just staggering.

So, is there a better solution than this?

Any help with the case will be greatly appreciated!

All the best, Borislav

A:

Is there a key that you can use to uniquely identify a record?

If not, you can only find the ones that are identical. Then you would need to remove all existing records not matched in the new release and add all the ones from the release that do not match a record in the existing release.

Having a key would make things much easier though.

Johann Blais 2010-09-09 10:58:24

Yes, there is a unique ID for each entry - I've updated the question.Removing an entry purely for the reason that it exists is fine in terms of performance, but updated entries need to be found and marked as such - that's what's been puzzling me.

Borislav T 2010-09-09 13:59:00

related questions

Displaying Flash content in a C# WinForms application

How to get the value of built, encoded ViewState?

Unhandled Exception Handler in .NET 1.1

How do I connect to a database and loop over a recordset in C#?

How do I most elegantly express left join with aggregate SQL as LINQ query

Get a new object instance from a Type in C#

.NET Testing Framework Advice

Automatically update version number

What is the difference between an int and an Integer in Java/C#?

How to write to Web.Config in Medium Trust ?

WinForms ComboBox data binding gotcha

How do you sort a C# dictionary by value?

Adding Scripting functionality to .NET applications

Floating Point Number parsing: Is there a Catch All algorithm?

How do I print an HTML document from a web service?

Decoding T-SQL CAST in C#/VB.net

Anatomy of a "Memory Leak"

How do I get a distinct, ordered list of names from a DataTable using Linq

Reliable Timer in a Console Application

How do I fill a DataSet or a DataTable from a LINQ query resultset ?

What's the difference between Math.Floor() and Math.Truncate() in .NET?

How do I calculate relative time?

How do I calculate someone's age in C#?

Are there any conversion tools for porting Visual J# code to C#?

When setting a form's opacity should I use a decimal or double?