ansaurus

Question

Building "isolated" and "automatically updated" caches (java.util.List) in Java.

Answer 1

+2 A:

Here's what you want to do. Something akin to what's known as MVCC, Multi Version Currency Control.

Simply put you need to associate a transaction ID to your cache elements.

So a cache entry will look something like this:

public class CacheEntry {
    long transactionId;
    boolean deleted;
    Object value;
}

Your cache entries are stored in a list, in reverse transactionid order.

When you go to get a cache element, you look up the list (in your hash map). Then you search for the value that has the highest transaction ID that is LESS THAN or EQUAL to the transaction ID of YOUR transaction.

So, let's take the DELETE issue in to account.

You have Transaction 10, looking for "ABC". We'll assume ABC is already in the cache, and it was put in by Transaction 5.

So, T10 gets the list of entries for ABC, searches down the list and finds that at T5, there's the value "123". T5 is the highest transaction less than or equal to T10. T10 changes the value for ABC from 123 to 456.

Now T12 comes along and it looks for ABC. It will find the value to be 456 from T10. T12 decides to delete ABC, and so a "deleted" cache entry for T12 is placed in the cache entry list. If T10 tries to look up ABC again, it will find 456, because 12 > 10, and the highest transaction <= 10 is T10, so it does not see the deletion.

T14 comes along, searches for ABC, CAN'T find it (because it's "deleted"), and inserts a new value of 789. If T12 were to look it would still be deleted, if T10 were, it's still 456.

So, in the end, you cache list looks like this:

{tid: 14 deleted: false value: 789}
{tid: 12 deleted: true  value: nul}
{tid: 10 deleted: false value: 456}
{tid:  5 deleted: false value: 123}

The next problem you have is dealing with visibility across open transactions. i.e. can another transaction see data from another open transaction that's not committed. But that's not too hard, as it just adjusts the criteria when scanning the list of versions for an appropriate candidate. And you can keep a list of transaction IDs, with their status (open, committed, rolledback).

Finally you have to come up with a mechanism to clean up the loose ends. After you commit two transaction, with no other open transactions, the older records can be removed.

For example, if you have the data from T5 and T10, if both of them are committed, no one will ever be able to "see" T5's data again as T10's is now the "current" state. So, the T5 row can be deleted.

This is probably best done by simply iterating over the cache and removing obsolete transaction entries.

That's the gist of it, obviously the devil is in the details.

Will Hartung 2010-04-10 15:48:54

Thankyou for your answer! I'm quite shocked with myself that I didn't see this before.I wonder if there is anything "like" subversion I can use for java (I didnt realise the simillarity until you pointed it out).

Aidos 2010-04-11 03:01:40

ansaurus

tags:

views:

answers:

Building "isolated" and "automatically updated" caches (java.util.List) in Java.

related questions