JPA cache up-to-dateness with concurrent access to the database

A service tier that implements its persistence based on JPA can profit hugely from the second-level cache that is managed transparently by the JPA provider (e.g. Hibernate, Toplink/Toplink Essentials etc.). When this cache is activated, it holds instances of the persistent classes as soon as they were loaded from the database for the first time. There may be vendor-specific extensions to configure the cache behaviour.

The JPA standard also supports optimistic locking by having a timestamp or version field that is used to avoid data corruption when concurrent updates occur. As this mechanism relies on the data contained in the database it can also be used when other applications or services want to update the data - just include the version field of the record in the update and you're done.

When it comes to caching, the behaviour seems to be that the JPA provider (at least Toplink Essentials) does not notice changes in the database that aren't performed using the EntityManager.

Is this really the default behaviour and the responsibility to update/invalidate the JPA provider cache is up to the application? If yes, this seems quite counter-intuitive to the fact that most databases are used by many different applications.

This behavior is also true for Hibernate (19.2. The Second Level Cache):

Be aware that caches are not aware of changes made to the persistent store by another application. They can, however, be configured to regularly expire cached data.

This sounds reasonable. How should your cache know about any changes done by another application if this one doesn't use the cache? The only possibility left for the cache/JPA provider would be to monitor the database (all tables, all data) for changes and update accordingly. This isn't really feasible.

I've also read that it's strongly discouraged to use the second-level cache when multiple applications change the same data in the database, for the above mentioned reasons.

Sure, I understand the implications on performance for monitoring the complete database. I wonder if it could be a good trade-off if the mechanism that is used for optimistic locking (version field) were used for "pessimistic caching" i.e. comparing the version of the objects in the cache with the DB and reloading them from the database if necessary as soon as they are requested by the application...

Martin Klinke 2009-10-09 11:37:46

ansaurus

tags:

views:

answers:

JPA cache up-to-dateness with concurrent access to the database

related questions