ansaurus

Question

Answer 1

+3 A:

I suggest using EhCache for this, even though what you're building isn't really a cache. EhCache allows you to configure the cache instance so that it overflows to disc storage, while keeping the most recent items in memory. It can also be configured to be disc-persistent, i.e. data is flushed to disc on shutdown, and read back into memory at startup. On top of all that, it's key-value based, so it already fits your model. It supports concurrent access, and since the disk storage is managed as a separate thread, you shouldn't need to worry about disk access concurrency.

Alternatively, you could consider a proper embedded database such as Hypersonic (or numerous others of a similar style), but that's probably going to be more work.

skaffman 2010-07-23 08:48:38

im looking into it, ehcache i mean. reminds me of Oracle Berkeley DB in a way...

robinsonc494 2010-07-24 13:48:22

I went for an embedded db instead, using hsqldb thanks for the suggestion everyone, it was much appreciated

robinsonc494 2010-07-24 18:19:31

For future reference to anyone looking for something like this. I Implemented hsqldb and hypersonic but both cause seriously high CPU usage. Testing an implementation of sqlite, from http://www.zentus.com/sqlitejdbc/ so far having better results

robinsonc494 2010-07-25 11:31:33

Answer 2

A:

what about using JPA in your class, and persist data in a database (that can be text based like sqlite) http://en.wikipedia.org/wiki/Java_Persistence_API

gersh 2010-07-23 08:57:38

I'm already serializing but the objects are loaded back on to the heap anyway with their original size which defeats the whole point

robinsonc494 2010-07-24 13:50:04

no JPA is not about serialising, it is about persisting objects to database. Your object is persisted to the database and not loaded back (well java objects are not destroyed, they are garbage collected when they are dereferenced)

gersh 2010-10-02 08:41:57

Answer 3

A:

There is Tokyo Cabinet, which is a fast implementation of a disk-based hash table.

In your case, I think the best way to store values in such a setup would be to prefix the metadata keys with the url:

[url]_[name] => [value]
[url]_[name2] => [value2]

Unfortunately, I'm not sure you can enumerate the metadata for a given URL, using this solution.

If you want to use a more structured data store, there are also MongoDB, and SQLite, which I would recommend.

SirDarius 2010-07-23 09:08:54

as with the above i'm looking into tokyo cabinet thanks for the suggestion, i'll get bk to everyone after i see how it goes

robinsonc494 2010-07-24 13:52:28

ansaurus

tags:

views:

answers:

java disc based hashmap

related questions