ansaurus

Question

Recommend a fast & scalable persistent Map - Java

Answer 1

+1 A:

I'd likely use a local database. Like say Bdb JE or HSQLDB. May I ask what is wrong with this approach? You must have some reason to be looking for alternatives.

In response to comments: As the problem performance and I guess you are already using JDBC to handle this it might be worth trying HSQLB and reading the chapter on Memory and Disk Use.

mlk 2009-10-08 10:35:28

+1 agree. I would use a regular DB and write a nice API for the requirements so that the backend can be switched easily.

flybywire 2009-10-08 10:40:20

Once Bdb reaches the limits of what can be cached in memory i'm finding that it slows down unacceptably. This generally happens after about 1mm inserts.

Joel 2009-10-08 10:46:44

How about HSQLDB? I'm going to guess they both JDBC so you should be able to slot it in without modifying much of your existing code.Would be worth reading:http://hsqldb.org/doc/2.0/guide/deployment-chapt.html#deployment_mem_disk-sect

mlk 2009-10-08 11:24:28

BDBs slow down once you hit the point that you're thrashing your cache. BDBs essentially have a BTree in memory which tries to answer a request. If the request cannot be answered, the BDB pages in more data from disk. Once your working set is larger than your cache, you'll find trouble. There are JMX methods for monitoring the cache hit misses and cache size: use them to debug your application and if necessary increase the heap and give BDB more cache.

jasonmp85 2010-05-30 23:58:29

Also HSQLDB is **not** an acceptable solution. While it can store a lot of data on disk, it does **not** stream that data from disk when doing reads. It reads the entire `ResultSet` into memory rather than paging it in as you iterate through it. If you ever need to walk over a large portion of a table this will blow out your memory. BDBs handle this just fine. I also believe the the h2 database (http://www.h2database.com/html/main.html ) claims to solve this, though I've never used it.

jasonmp85 2010-05-31 00:00:57

@jasonmp85 - this is exactly what i've found - once the BDB BTree no longer fits in memory you're in trouble.

Joel 2010-06-25 14:58:29

Answer 2

A:

I think Hibernate Shards may easily fulfill all your requirements.

Boris Pavlović 2009-10-08 10:37:29

Answer 3

A:

memcached provides an excellent scalable map-based distributable cache. If you use this and back it with one of the databases mentioned to provide persistance, you may well solve your performance problems, at least for frequently hit keys (as long as you provide enough RAM to cache all the values that are frequently accessed).

Bill Michell 2009-10-08 11:00:36

Answer 4

+1 A:

SQLite does this. I wrote a wrapper for using it from Java: http://zentus.com/sqlitejdbc

As I mention in a comment, I have successfully used SQLite with gigabytes of data and tables of hundreds of millions of rows. If you think out the indexing properly, it's very fast.

The only pain is the JDBC interface. Compared to a simple HashMap, it is clunky. I often end up writing a JDBC-wrapper for the specific project, which can add up to a lot of boilerplate code.

David Crawshaw 2009-10-08 11:03:23

I seriously doubt sqlite would scale to this many records.

Omry 2009-10-08 12:02:53

I have successfully used SQLite with gigabytes of data and tables of hundreds of millions of rows. If you think out the indexing properly, it's very fast.

David Crawshaw 2009-10-08 22:44:26

Answer 5

A:

JBoss (tree) Cache is a great option. You can use it standalone from JBoss. Very robust, performant, and flexible.

james 2009-10-08 15:26:02

Is it persistent?

Seun Osewa 2010-08-09 12:25:43

Answer 6

+1 A:

You may want to look into OrientDB.

Juha Syrjälä 2010-06-12 09:54:16

Answer 7

A:

I've so far found Tokyo Cabinet to be the simplest persistent Hash/Map to integrate into my code.

This abbreviated example, taken from the docs, shows how simple it is to save and retrieve data from a persistent Hash:

    // create the object
    HDB hdb = new HDB();
    // open the database
    hdb.open("casket.tch", HDB.OWRITER | HDB.OCREAT);
    // add item 
    hdb.put("foo", "hop");
    hdb.close();

Joel 2010-09-27 12:21:48

ansaurus

tags:

views:

answers:

Recommend a fast & scalable persistent Map - Java

related questions