tags:

views:

342

answers:

3

I have an other active question HERE regarding some hopeless memory issues that possibly involve LOH Fragmentation among possibly other unknowns.

What my question now is, what is the accepted way of doing things? If my app needs to be done in Visual C#, and needs to deal with large arrays to the tune of int[4000000], how can I not be doomed by the garbage collector's refusal to deal with the LOH?

It would seem that I am forced to make any large arrays global, and never use the word "new" around any of them. So, I'm left with ungraceful global arrays with "maxindex" variables instead of neatly sized arrays that get passed around by functions.

I've always been told that this was bad practice. What alternative is there?

Is there some kind of function to the tune of System.GC.CollectLOH("Seriously") ? Are there possibly some way to outsource garbage collection to something other than System.GC?

Anyway, what are the generally accepted rules for dealing with large (>85Kb) variables?

+2  A: 

The first thing that comes to mind is to split the array up into smaller ones, so they don't reach the memory needed for the GC to put in it the LOH. You could spit the arrays into smaller ones of say 10,000, and build an object which would know which array to look in based on the indexer you pass.

Now I haven't seen the code, but I would also question why you need an array that large. I would potentially look at refactoring the code so all of that information doesn't need to be stored in memory at once.

Kevin
My application takes in images of potential size 1024x720, converts that to a matrix of "height" based on pixel intensity. Then it takes that matrix and renders a surface map in OpenGL. So, I really need all that data at the same time.
Gorchestopher H
+2  A: 

You get it wrong. You do NOT need to havean array size 4000000 and you definitely do not need to call the garbace collector.

  • Write your own IList implementation. Like "PagedList"
  • Store items in arrays of 65536 elements.
  • Create an array of arrays to hold the pages.

This allows you to access basically all your elements with ONE redirection only. And, as the individual arrays are smaller, fragmentation is not an issue...

...if it is... then REUSE pages. Dont throw them away on dispose, put them on a static "PageList" and pull them from there first. All this can be transparently done within your class.

The really good thing is that this List is pretty dynamic in the memory usage. You may want to resize the holder array (the redirector). Even when not, it is about 512kbdata per page only.

Second level arrays have basically 64k per byte - which is 8 byte for a class (512kb per page, 256kb on 32 bit), or 64kb per struct byte.

Technically:

Turn int[] into int[][]

Decide whether 32 or 64 bit is better as you want ;) Both ahve advantages and disadvantages.

Dealing with ONE large array like that is unwieldely in any langauge - if you ahve to, then... basically.... allocate at program start and never recreate. Only solution.

TomTom
So, a jagged array int[][] doesn't need to use continuous memory?Does an array such as int[,] behave the same way?
Gorchestopher H
I believe new int[,] would allocate one contiguous block. Jagged arrays are allocated separately.
Paul Williams
+4  A: 

Firstly, the garbage collector does collect the LOH, so do not be immediately scared by its prescence. The LOH gets collected when generation 2 gets collected.

The difference is that the LOH does not get compacted, which means that if you have an object in there that has a long lifetime then you will effectively be splitting the LOH into two sections — the area before and the area after this object. If this behaviour continues to happen then you could end up with the situation where the space between long-lived objects is not sufficiently large for subsequent assignments and .NET has to allocate more and more memory in order to place your large objects, i.e. the LOH gets fragmented.

Now, having said that, the LOH can shrink in size if the area at its end is completely free of live objects, so the only problem is if you leave objects in there for a long time (e.g. the duration of the application).

Strategies to avoid LOH fragmentation are:

  • Avoid creating large objects that hang around. Basically this just means large arrays, or objects which wrap large arrays (such as the MemoryStream which wraps a byte array), as nothing else is that big (components of complex objects are stored separately on the heap so are rarely very big). Also watch out for large dictionaries and lists as these use an array internally.
  • Watch out for double arrays — the threshold for these going into the LOH is much, much smaller — I can't remember the exact figure but its only a few thousand.
  • If you need a MemoryStream, considering making a chunked version that backs onto a number of smaller arrays rather than one huge array. You could also make custom version of the IList and IDictionary which using chunking to avoid stuff ending up in the LOH in the first place.
  • Avoid very long Remoting calls, as Remoting makes heavy use of MemoryStreams which can fragment the LOH during the length of the call.
  • Watch out for string interning — for some reason these are stored as pages on the LOH and can cause serious fragmentation if your application continues to encounter new strings to intern, i.e. avoid using string.Intern unless the set of strings is known to be finite and the full set is encountered early on in the application's life. (See my earlier question.)
  • Use Son of Strike to see what exactly is using the LOH memory. Again see this question for details on how to do this.

Edit: the LOH threshold for double arrays appears to be 8k.

Paul Ruane
This is most likely a bad question... but how can I get CDB and SOS? Google doesn't seem to know. All links I follow to "download CDB" point me to some MSDN search result page listing "DDK3" among other things that don't sound like CDB.
Gorchestopher H
Download 'Debugging Tools for Windows'. CDB is the console debugger and WinDbg is the 'graphical' equivalent (it's still text-mode but rendered in an MDI window). SOS comes with .NET: '%systmeroot%\microsoft.net\framework\v2.0.50727\sos.dll' (.NET 3 uses the .NET 2 SOS.dll, .NET 4 comes with its own version.)
Paul Ruane
Also, you can apparently load SOS from the Immediate window in Visual Studio (avoiding need for CDB), but I have never tried.
Paul Ruane
Well... maybe LOH Fragmentation isn't my problem after all. I don't see the word "free" next to anything...
Gorchestopher H