ansaurus

Question

Answer 1

+1 A:

Define 2 cache regions "aggregation" and "aggregation.today" with a large expiry time. Use these for your aggregation queries for previous days and today respectively.

In DoIt(), make 1 NH query per day in the requested range using cacheable queries. Combine the query results in C#.

Prime the cache with a background process which calls DoIt() periodically with the date range that you need to be cached. The frequency of this process must be lower than the expiry time of the aggregation cache regions.

When today's data changes, clear cache region "aggregation.today". If you want to reload this cache region quickly, either do so immediately or have another more frequent background process which calls DoIt() for today.

When you have query caching enabled, NHibernate will pull the results from cache if possible. This is based on the query and parameters values.

Lachlan Roche 2010-03-01 16:14:13

Have re-read your answer several times to make sure I understand. I think what you're suggesting may at least allow me to leverage the second level cache, but I'd still have to write the aggregation logic twice - once for the cache query, and once for the database query. Right?

Kent Boogaart 2010-03-01 17:03:59

My approach caches the aggregate queries directly, by using the exact same query for cache loading including cache settings. This lets NH do the actual cache put/get work.

Lachlan Roche 2010-03-01 17:24:17

Understood, but that won't work for me because the parameter values and even the parameter themselves (due to filters) aren't known until a user makes a request. Still voted you up though because I think I can at least use the second level cache to store the data, even if I have to manually query it myself.

Kent Boogaart 2010-03-01 17:32:17

Answer 2

A:

When analyzing the NHibernate cache details i remember reading something that you should not relay on the cache being there, witch seems a good suggestion.

Instead of trying to make your O/R Mapper cover your applications needs i think rolling your own data/cache management strategy might be more reasonable.

Also the 7 days caching rule you talk about sounds like something business related, witch is something the O/R mapper should not know about.

In conclusion make your app work without any caching at all, than use a profiler (or more - .net,sql,nhibernate profiler ) to see where the bottlenecks are and start improving the "red" parts by eventually adding caching or any other optimizations.

PS: about caching in general - in my experience one caching point is fine, two caches is in the gray zone and you should have a strong reason for the separation and more than two is asking for trouble.

hope it helps

eti 2010-03-01 21:34:37

I already have the application working without caching and have already done the performance analysis. Adding a cache will give us 7 times the speed, and increase our scalability greatly since the DB won't be the bottleneck anymore.

Kent Boogaart 2010-03-02 07:51:13

Answer 3

+1 A:

Stop using your transactional ( OLTP ) datasource for analytical ( OLAP ) queries and the problem goes away.

When a domain significant event occurs (eg a new entity enters the system or is updated), fire an event ( a la domain events ). Wire up a handler for the event that takes the details of the created or updated entity and stores the data in a denormalised reporting store specifically designed to allow reporting of the aggregates you desire ( most likely push the data into a star schema ). Now your reporting is simply the querying of aggregates ( which may even be precalculated ) along predefined axes requiring nothing more than a simple select and a few joins. Querying can be carried out using something like L2SQL or even simple parameterised queries and datareaders.

Performance gains should be significant as you can optimise the read side for fast lookups across many criteria while optimising the write side for fast lookups by id and reduced index load on write.

Additional performance and scalability is also gained as once you have migrated to this approach, you can then physically separate your read and write stores such that you can run n read stores for every write store thereby allowing your solution to scale out to meet increased read demands while write demands increase at a lower rate.

Neal 2010-03-02 19:36:57

As stated in my question, moving to OLAP is my secondary concern after caching. A cube will not eliminate the problem of the database being the inhibitor of scalability. It merely has the potential to improve query times, which is not my concern right now (they are actually quite speedy already).

Kent Boogaart 2010-03-02 20:16:00

When you get thru reading and understanding my response again, you will see that the 2nd paragraph indicates how to separate the read and write stores such that you can scale out but I'll edit the response to make it clearer. Additionally, separating read and write stores will allow you to create read specific caches without burdening your write side with additional overheads.

Neal 2010-03-03 19:39:42

+1 thanks. I hadn't considered the scalability of it, so good point on that. I'm still pondering whether an OLAP approach could meet all our requirements and whether it's worth it in terms of effort.

Kent Boogaart 2010-04-07 10:14:36

You don't need to build a full OLAP cube style reporting store to gain benefits. Adding 1 or more tables that store the data you require in pre-aggregated format and designed specifically to serve the needs of those pages will greatly simplify your reporting code. Using a domain events style approach will allow you to separate the processing of new data from the creation / updating of the aggregates and therefore give you the flexibility to move from relational -> hybrid -> olap reporting stores.

Neal 2010-04-08 22:39:02

ansaurus

tags:

views:

answers:

NHibernate Caching Dilemma

related questions