ansaurus

Question

Design suggestions for real-time data aggregation?

Answer 1

+1 A:

The first alternative solution is to aggregate using underlying data changes - ie, when I update the a totals record, go and update the sum total too. To do it this way, you would need the old value however, also it then adds overhead to any changes you make to aggregated values. But if whole purpose of the data existing is to be aggregated it might be a viable option.

I do this with my bank balancing app, whenever I insert/modify/delete a transaction the logic also updates the account balance because the balance is searched many times and can soon become expensive to calculate when there are many transactions.

I think structurally too you may have problems if the sums are stored in the database - such as locking issues. I'd always keep these values in memory.

Update: another possible solution is to pass your data access code through a maintenance layer that keeps aggregated values in memory - this would be blistering quick and virtually 0 overhead on inserting/updating/deleting the underlying data. You could also get clever and have this layer transactable so if the data access action fails, you can rollback your aggregation change.

The only downside is that database changes must go through the layer to avoid invalidating the aggregation, and it will need initializing from the database on first run or restart.

Adam 2010-07-14 16:01:42

Answer 2

+1 A:

Have you had a look at Push Linq or Reactive Extensions (Rx)?

Although I haven't used either, I believe both allow you to use LINQ operators on streaming data.

Further info on Rx can be found on the DevLabs site:

http://msdn.microsoft.com/en-us/devlabs/ee794896.aspx

Winston Smith 2010-07-14 16:02:30

ansaurus

tags:

views:

answers:

Design suggestions for real-time data aggregation?

related questions