ansaurus

Question

Good C++ array class for dealing with large arrays of data in a fast and memory efficient way?

Answer 1

+3 A:

From the point of view of your program you always have 2GB available on startup, no matter what else is going on in the system. I don't believe Windows provides a way to detect if you have memory being paged out to disk or not. As far as you data structures go, it sounds like you're describing something similar to how a deque is implemented in the STL.

tloach 2010-03-18 19:59:50

I think you mean deque (http://www.cplusplus.com/reference/stl/deque/), not dequeue.

Bill 2010-03-18 21:09:54

Right you are. Fixed.

tloach 2010-03-19 12:24:26

Answer 2

+10 A:

Have you tried using an std::deque? Unlike a std::vector, which uses one huge heap allocation, deque usually allocates in small chunks, but still provides amortised constant time indexing via operator[].

Peter Alexander 2010-03-18 20:00:02

I'll have a look at the deque implementation but I'd be concerned about how small the chunks are. I'm dealing with many millions of relatively small structures, so any implementation that individually allocates memory on a per item basis is liable to be prohibitively memory ineffecient.

Shane MacLaughlin 2010-03-18 20:08:18

I don't know what policy it uses for sizing, but it is certainly far more than 1. I doubt the memory inefficiency would be more than 10%, and I would expect it to be <5%.

Peter Alexander 2010-03-18 20:38:21

After reading up on std::deque, specifically on using allocators, I can't find anything that states that it will not attempt to allocate all memory in a single contiguous block. See http://www.cplusplus.com/reference/std/memory/allocator/ Put another way, I can't find anything to suggest that adding 1GB of data to a deque is any more likely to succeed than using a HeapAlloc to do the same thing. Do you have any references to suggest that a deque will use multiple heap allocations for very large amounts of data, and if so how does it fragment them?

Shane MacLaughlin 2010-03-19 08:51:51

IIRC: deques are implemented in blocks of set size. So if the block size was 100 elements big and you went and allocated 450 elements you would have 5 blocks, which possibly would be contiguous but not necessarily so. The internal representation of each block would be though.

graham.reeds 2010-03-19 11:41:37

@Shane, it comes from the fact that deques require O(1) time to do a push_front, as well as doing operator[], and the only way to achieve that is by allocating in blocks.

Peter Alexander 2010-03-19 11:57:42

@graham, Herb Sutters argument supports what you are saying; http://www.gotw.ca/gotw/054.htm Another useful post here, http://www.dreamincode.net/forums/index.php?showtopic=35344, which suggests that deque performance in terms of random access isn't that good, which is a big no-no for me.

Shane MacLaughlin 2010-03-19 12:40:36

@Shane - Does acceptance of this answer imply that you tried this and it worked out for you after all? Your last comment was rather negative.

T.E.D. 2010-03-19 12:56:07

@T.E.D., nope std::deque didn't suit my exact requirements, but the answer and subsequent comments pushed me in the direction I needed to go in. I'm currently implementing a solution as a virtual array represented intenally as a secondary array of large blocks. Getting an element is from a subscript is a simple matter of block no = subscript / number of elements per block, block position = subscript % number of elements per block. deque-like but not a deque, and hopefully very fast and efficient. I'll post it up when finished and tested.

Shane MacLaughlin 2010-03-19 13:59:20

Hmmm. It might be worth putting this in an answer yourself. I've done that for my own questions in the past. I'm pretty sure its kosher.

T.E.D. 2010-03-23 12:59:55

Answer 3

+4 A:

Exactly how sparse is this array? If there are large amounts of empty (unused) space in it, you may want to take another approach. The answer to this question suggests an stl map.

If it isn't sparse (as mentioned in the comments), one thing you might look into since you are running on Windows is using a memory-mapped file. Although your OS may be 32-bit, your file system is not. This does of course mean there will be swapping going on though, which is liable to be quite a bit slower than if you could really put the whole darn thing in RAM.

Also, you really ought to consider knocking the system's RAM up to the max (3GB on 32-bit Windows I believe) to see if that fixes it for you. That should only cost you about $100, and you are spending way more than that in man-hours just worrying about this.

T.E.D. 2010-03-18 20:04:41

No unused space at all, unfortunately. The data in question is a TIN network comprising of 3d coordinates and triangles linking them together, jointly representing a large irregular surface. Thanks for the link in any case, may well be useful elsewhere.

Shane MacLaughlin 2010-03-18 20:10:31

Having 2GB installed != 2GB physical ram available. Even with 4GB installed you may not have more than 2.5gb installed if you have a large memory graphics card (if you re doing 3D work then this is a distinct possibility). Personally make sure you have 4+GB installed and a 64bit OS.

graham.reeds 2010-03-19 11:49:00

@graham - He already said he was using a 32-bit OS, and was asking what he could do short of upgrading to 64bit. That's why I suggested going up to 3GB. That being said, I agree that using a 64-bit OS would probably be a good (and relatively cheap) solution.

T.E.D. 2010-03-19 12:52:00

Porting to 64 bit is certainly on the cards for later this year, but is only part of the solution. We have a lot of clients with 32bit laptops out there that aren't likely to upgrade any time soon, and a good 32 bit solution that solves a problem with their current hardware has real value.

Shane MacLaughlin 2010-03-19 14:02:31

Answer 4

+1 A:

std::deque does exactly what you're describing, but usually at the granularity of the OS page size (that is, the chunks it allocates are usually 4 kB).

If you're unhappy with deque's default performance, you might be able to write a custom allocator that grabs bigger chunks--that is, gets 1 MB or more at a time.

As others have said, your process's virtual address space is completely independent of all other processes, so you can address 2GB no matter what else is going on in your system. The OS will swap your memory pages to/from disk as necessary to fit the constraints of the amount of installed memory and all the processes contending for it. This will happen at the 4 kB page size, independent of how big your chunks are.

Drew Hall 2010-03-19 12:48:38

ansaurus

tags:

views:

answers:

Good C++ array class for dealing with large arrays of data in a fast and memory efficient way?

related questions