ansaurus

Question

Random-access container that does not fit in memory?

Answer 1

+1 A:

You could look into memory mapped files, and then access one of those too.

Liz Albin 2010-01-25 19:34:11

Answer 2

+7 A:

Consider using the STXXL:

The core of STXXL is an implementation of the C++ standard template library STL for external memory (out-of-core) computations, i.e., STXXL implements containers and algorithms that can process huge volumes of data that only fit on disks. While the compatibility to the STL supports ease of use and compatibility with existing applications, another design priority is high performance.

James McNellis 2010-01-25 19:34:48

This looks nice, but I don't know if it's possible to tell it to cache or preload certain results? For example, once I access element n it might be likely that I will access some from n-100 to n+100 soon, so it should load and store them in memory. Maybe I need my own custom solution in such a case?

Frank 2010-01-25 20:07:39

STXXL does not work for me, see the update in my question. Any other ideas?

Frank 2010-01-26 15:00:43

Answer 3

+1 A:

I would implement a basic cache. With this workingset size you will have the best results with a set-associative-cache with x byte cache-lines ( x == what best matches your access pattern ). Just implement in software what every modern processor already has in hardware. This should give you imho the best results. You could than optimize it further if you can optimize the accesspattern to be somehow linear.

DarthCoder 2010-01-25 19:38:06

Answer 4

A:

One solution is to use a structure similar to a B-Tree, indices and "pages" of arrays or vectors. The concept is that the index is used to determine which page to load into memory to access your variable.

If you make the page size smaller, you can store multiple pages in memory. A caching system based on frequency of use or other rule, will reduce the number of page loads.

Thomas Matthews 2010-01-25 22:07:45

Answer 5

A:

I've seen some very clever code that overloads operator[]() to perform disk access on the fly and load required data from disk/database transparently.

SF. 2010-01-26 14:01:54

Sure, I was asking if it's worth writing that code myself (and if so, what's the best approach: database access, etc.?) or if that code is available.

Frank 2010-01-26 14:53:58

ansaurus

tags:

views:

answers:

Random-access container that does not fit in memory?

related questions