views:

109

answers:

3

I'm looking to build a library that needs to be very careful about memory management. Basically, I have to create a static factory to "disperse" instances of my tool to requesting objects. (I don't have a choice in this matter, I really do have to use a singleton) We'll call that class FooFactory. FooFactory defines a single method, getFoo(key:String):Foo.

getFoo looks in a private static flash.utils.Dictionary object for the appropriate Foo instance, and either lazy-instantiates it, or simply returns it. In any case, FooFactory MUST keep a reference to each Foo instance created, so all Foo instances can be updated by FooFactory using a method called updateFoos():void.

Here is some pseudo-code of what I'm talking about:

public class FooFactory {
    private static const foos:Dictionary = new Dictionary(true); //use weak keys for gc

    public static function getFoo(key:String):Foo {
        //search for the specified instance in the 'foos' dictionary
        if (foos[key] != null && foos[key] != undefined) {
            return foos[key];
        } else {
            //create foo if it doesn't exist. 
            var foo:Foo = new Foo(key);
            foos[key] = foo;
            return foo;
        }
    }

    public static function updateFoos():void {
        for (var key:String in foos) {
            if (foos[key] != null && foos[key] != undefined) {
                Foo(foos[key]).dispatchEvent(new Event("update"));
            }
        }
    }
}

The actual function and identity of Foo isn't too important.

What IS important is garbage collection in this situation. I created something similar to the above example in the past and had incredible garbage collection issues. (I did use an array rather than a dictionary, which could be part of the problem.) What would happen is that, in my Flex application, modules would never unload, since instances had a reference to a Foo instance which was referenced by the FooFactory, like so: (again, pseudocode)

<?xml version="1.0"?>
<s:Group>
    <fx:Script>
        <![CDATA[
            private static const foo:Foo = FooFactory.getFoo('myfoo');
        ]]>
    </fx:Script>
</s:Group>

What I want to know are the two following things:

  1. Is the pseudo-code above "garbage-collector safe?" IE: Will my modules unload properly and will instances of the Group subclass above get garbage collected?
  2. Is there a way in Flash Player (even in the debug player if need be) that can assist me in counting references so I can test if things are getting garbage collected or not?

I'm aware of the flash.sampler API, but I am not sure as to how to use it to count references.

+1  A: 

I don't think that the pattern you presented should give you problems GC-wise.

private static const foo:Foo = FooFactory.getFoo('myfoo');

Here, your module has a reference to a Foo instance. That means that this Foo instance won't be collectable as long as your module is not collectable. The module has a reference to foo, so here foo is reacheable (if the module is reachable). That's not true the other way round. Even if foo lives forever, it doesn't have a reference to the module, so it won't pint it down.

Of course there could be other stuff going on to prevent your module from being collectable, but foo is not the culprit here, unless foo gets a reference to the module somehow. For instance, the module adds a listener to foo, which for this matter, is the same as writting:

foo.addReference(this); // where this is your module

The fact that you declare the instance as const shouldn't change things per se, either. It only means that the reference stored cannot be changed at a later point. However, if you want to null out foo at some later point, you can't because that would be reassigning the reference; and you can't reassing a const reference (you should get a compiler error). Now, this does tie foo to module. As long as your module is alive it will have a reference to foo, so foo won't be collectable.

Regarding this line:

private static const foos:Dictionary = new Dictionary(true); //use weak keys for gc

It looks like you're trying to build some kind of cache. I'm not sure you want to use weak refs here. (I could be wrong here because I'm making an assumption, and they say assumption is the mother of all... mistakes, but I digress)

In any case, the effect of this is that if a module gets a Foo and at some point the module is successfully unloaded (I mean, cleaned up from memory), that instance of foo could be collected, provided that no one else has a ref to it (that is, the only way to reach it is through the dictionary key, but since the keys are weak referenced, this ref will not count for the purposes of the GC).

Regarding your second question, I'd recommend the FlexBuilder/FlashBuilder profiler, if FB is available to you. It's not the most intuitive tool, granted, but with some practice it could be really useful to track memory problems. Basically, it will let you know how many instances of a given class were created, how many of those are still allive, what objects have references to these instances and where were all these objects allocated (an option not checked by default when you launch the profiler, buy very handy to track a leak).

PS

Regarding your comment:

Perhaps the real issue is the static const reference bound by the Group instance? If that's an issue, I could simply abstract Foo to an interface, then create something called FooWeakReference which would use a weak dictionary to reference the actual Foo object. Thoughts?

Adding this extra layer of indirection only complicates things and makes your code less obvious for no gain here, I think. It's easier to consider the life-cycle of your module and define clear points of initialization and finalization. When it's finalized, make sure you remove any reference to the module added to the foo instance (i.e. if you have added listeners on foo, remove them, etc), so your module is collectable independently of the life-cycle of foo.

As a general rule, whenever a weak reference seems to solve a bug in your app, it's masking another one or covering up for a poor design; there are exceptions (and compromises that have to be made sometimes), but weak refs are abused gratuitously if you ask me; not everyone will agree, I know.

Also, weak-refs open a whole new kind of bugs: what happens if that instance you created lazily vanishes before you can use it or worse, while you are using it? Event listeners that stop working under not deterministically reproducible circumstances (e.g. you added a listener to an object that is gone), possible null references (e.g. you are trying to add a listener to an object that no longer exists), etc, etc. Don't drink the weak reference kool-aid ;).

Addedum

In conclusion, as one last question, is it true for me to say that no AS3 solution exists for counting references? I'm building a complete unit-testing suite for this library I'm building, and if I could do something like Assert.assertEquals(0, getReferenceCount(foo)), that would be rad.

Well, yes. You can't get the reference count of a given object from Actionscript. Even if it were possible, I'm not sure that would help, because reference counting is only a part of how GC works. The other one is a mark and sweep algorithm. So, if an object has a zero ref-count is collectable, but it could have, say, 3 references and still be collectable. To really determine whether an object is collectable or not, you should really be able to hook into the GC routine, I guess, and that's not possible from AS.

Also, this code will never work.

Assert.assertEquals(0, getReferenceCount(foo)

Why? Here you are trying to query some API to know whether an object is collectable or not. Since you can't know that, let's assume this tells you whether an object has been collected or not. The problem is, foo at that point is either null or not null. If it's null, it's not a valid reference, so you can't get any useful information out of it, for obvious reasons. If it's not null, it's a valid reference to an object, then you can access it and it's alive; so you already know the answer to the question you're asking.

Now, I think I undestand your goal. You want to be able to tell, programatically, if you certain objects are being leaked. Up to some extent that's possible. It involves using the flash.sampler API, as you mentioned in your original question.

I suggest you check out the Flash Preload Profiler by jpauclair:

I haven't used it, but it looks like it could be just as good as the FB profiler for memory watching.

Since this is Actionscript code (and since it's open source), you could to use it for what you want. I just skimmed through the code, but I've been able to get a very simple-minded proof of concept by monkey-patching the SampleAnalyzer class:

There's a lot of other things going on in this tool, but I just modified the memory analizer to be able to return a list of the alive objects.

So, I wrote a simple class that would run this profiler. The idea is that when you create an object, you can ask this class to watch it. This objects' allocation id will be looked up in the allocated objects table maintained by the memory profiler and a handle to it will be stored locally (only the id). This id handle will also be returned for convenience. So you can store this id handle and at a later point, use it to check whether the object has been collected or not. Also, there's a method that returns a list of all the handles you added and another one that returns a list of the added handles that point to live objects. A handle will allow you to access the original object (if it hasn't been collected yet), its class and also the allocation stack trace. (I'm not storing the object itself or the NewObjectSample object to avoid accidentally pinning it down)

Now, this is important: this queries for alive objects. The fact that an object is alive doesn't mean it's not collectable. So, this alone doens't mean there's a leak. It could be alive at this point but still it doesn't mean there's a leak. So, you should combine this with forcing GC to get more relevant results. Also, this could be of use if you are watching objects that are owned by you and not shared with other code (or other modules).

So, here's the code to the ProfileRunner, with some comments.

import flash.sampler.Sample;
import flash.sampler.NewObjectSample;
import flash.utils.Dictionary;

class ProfilerRunner {

    private var _watched:Array;

    public function ProfilerRunner() {
        _watched = [];
    }

    public function init():void {
        // setup the analyzer. I just copied this almost verbatim 
        // from SamplerProfiler... 
        // https://code.google.com/p/flashpreloadprofiler/source/browse/trunk/src/SamplerProfiler.as
        SampleAnalyzer.GetInstance().ResetStats();
        SampleAnalyzer.GetInstance().ObjectStatsEnabled = true;
        SampleAnalyzer.GetInstance().InternalEventStatsEnabled = false;         
        SampleAnalyzer.GetInstance().StartSampling();       
    }

    public function destroy():void {
        _watched = null;
    }

    private function updateSampling(hook:Function = null):void {
        SampleAnalyzer.GetInstance().PauseSampling();
        SampleAnalyzer.GetInstance().ProcessSampling();     

        if(hook is Function) {
            var samples:Dictionary = SampleAnalyzer.GetInstance().GetRawSamplesDict();
            hook(samples);
        }

        SampleAnalyzer.GetInstance().ClearSamples();
        SampleAnalyzer.GetInstance().ResumeSampling();          

    }

    public function addWatch(object:Object):WatchHandle {
        var handle:WatchHandle;
        updateSampling(function(samples:Dictionary):void {
            for each(var sample:Sample in samples) {
                var newSample:NewObjectSample;
                if((newSample = sample as NewObjectSample) != null) {
                    if(newSample.object == object) {
                        handle = new WatchHandle(newSample);
                        _watched.push(handle);
                    }
                }
            }           
        });
        return handle;
    }

    public function isActive(handle:WatchHandle):Boolean {
        var ret:Boolean;
        updateSampling(function(samples:Dictionary):void{
            for each(var sample:Sample in samples) {
                var newSample:NewObjectSample;
                if((newSample = sample as NewObjectSample) != null) {
                    if(newSample.id == handle.id) {
                        ret = true;
                        break;
                    }
                }
            }                   
        });
        return ret;
    }

    public function getActiveWatchedObjects():Array {
        var list:Array = [];
        updateSampling(function(samples:Dictionary):void {
            for each(var handle:WatchHandle in _watched) {
                if(samples[handle.id]) {
                    list.push(handle);
                }
            }               
        });
        return list;
    }

    public function getWatchedObjects():Array {
        var list:Array = [];
        for each(var handle:WatchHandle in _watched) {
            list.push(handle);
        }               
        return list;        
    }


}

class WatchHandle {

    private var _id:int;
    private var _objectProxy:Dictionary;
    private var _type:Class;
    private var _stack:Array;

    public function get id():int {
        return _id;
    }

    public function get object():Object {
        for(var k:Object in _objectProxy) {
            return k;
        }
        return null;
    }

    public function get stack():Array {
        return _stack;
    }

    public function getFormattedStack():String {
        return "\t" + _stack.join("\n\t");
    }
    public function WatchHandle(sample:NewObjectSample) {
        _id = sample.id;
        _objectProxy = new Dictionary(true);
        _objectProxy[sample.object] = true;
        _type = sample.type;
        _stack = sample.stack;
    }

    public function toString():String {
        return "[WatchHandle id: " + _id + ", type: " + _type + ", object: " + object + "]";
    }
}

And here's a simple demo of how you'd use it.

It initializes the runner, allocates 2 Foo objects and then, after 2 seconds, it finalizes itself. Note that in the finalizer, I'm nulling out one of the Foo objects and finalizing the profiler. There I try to force GC, wait for some time (GC is not synchronous) and then check if these objects are alive. The first object should return false, and the second true. So, this is the place were you'd put your assert. Keep in mind that all of this will only work in a debug player.

So, without any further addo, here's the sample code:

package {

    import flash.display.Sprite;
    import flash.sampler.NewObjectSample;
    import flash.sampler.Sample;
    import flash.system.System;
    import flash.utils.Dictionary;
    import flash.utils.setTimeout;

    public class test extends Sprite
    {

        private var x1:Foo;
        private var x2:Foo;

        private var _profiler:ProfilerRunner;

        private var _watch_x1:WatchHandle;
        private var _watch_x2:WatchHandle;

        public function test()
        {
            init();
            createObjects();
            setTimeout(finalize,2000);
        }

        public function init():void {
            initProfiler();
        }

        public function finalize():void {
            x1 = null;
            finalizeProfiler(); 
        }   

        private function initProfiler():void {
            _profiler = new ProfilerRunner();
            _profiler.init();
        }

        private function finalizeProfiler():void {
            //  sometimes, calling System.gc() in one frame doesn't work
            //  you have to call it repeatedly. This is a kind of lame workaround
            //  this should probably be hidden in the profiler runner
            var count:int = 0;
            var id:int = setInterval(function():void {
                System.gc();                
                count++;
                if(count >= 3) {
                    clearInterval(id);
                    destroyProfiler();
                }
            },100);
        }

        private function destroyProfiler():void {
            //  boolean check through saved handles
            trace(_profiler.isActive(_watch_x1));
            trace(_profiler.isActive(_watch_x2));
            //  print all objects being watched
            trace(_profiler.getWatchedObjects());   
            //  get a list of the active objects and print them, plus the alloc stack trace    
            var activeObjs:Array = _profiler.getActiveWatchedObjects();
            for each(var handle:WatchHandle in activeObjs) {
                trace(handle);
                trace(handle.getFormattedStack());
            }
            _profiler.destroy();

        }               

        private function createObjects():void {

            x1 = new Foo();
            x2 = new Foo();
                    // add them for watch. Also, let's keep a "handle" to
                    // them so we can query the profiler to know if the object
                    // is alive or not at any given time 
            _watch_x1 = _profiler.addWatch(x1);
            _watch_x2 = _profiler.addWatch(x2);

        }

    }
}

import flash.display.Sprite;

class Foo {

    public var someProp:Sprite;
}

Alternatively, a more light-weight approach for tracking alive objects is storing them in a weak-referenced dictionary, forcing GC and then checking how many objects are stil alive. Check out this answer to see how this could be implemented. The main difference is that this gives you less control, but maybe it's good enough for your purposes. Anyway, I felt like giving the other idea a shot, so I wrote this object watcher and kind of like the idea.

Juan Pablo Califano
You're right about the weak refs--the way the OP is using them has weak refs to the strings, not the objects. Not sure how string allocation works in AS, but watching for the strings being GC'd likely isn't the intention.
Michael Brewer-Davis
@Michael Brewer-Davis. Actually, I didn't realized there were strings! Good catch. Strings are kind of special in that they are immutable, so I'm not totally sure about how that affects the keys of the dict either.
Juan Pablo Califano
Awesome, response, thanks for that :) I have a theory that I'm posting as an answer, please feel free to comment on it and check if I'm making sense ;)
TK Kocheran
Win++. Thanks, as always. Makes things really clear! :)
TK Kocheran
@TK Kocheran. You're welcome! I'm glad you find it useful,
Juan Pablo Califano
A: 

Since you essentially want weak references, perhaps the best solution would involve one of the weak references available in AS3.

For example, have your method store Dictionaries rather than the actual objects. Something like this:

private var allFoos:Dictionary;

public function getFoo(key:String):Foo {
    var f:Foo = _getFoo(key);

    if (f == null) {
        f = _createFoo(key);
    }

    return f;
}

private function _createFoo(key:String):Foo {
    var f:Foo = new Foo();
    var d:Dictionary = new Dictionary(/* use weak keys */ true);
    d[f] = key;

    allFoos[key] = d;
}
Michael Brewer-Davis
A: 

With some intense thinking over the weekend, I believe I figured out what the problem is.

Essentially, we have this scenario:

.--------------.
| APP-DOMAIN 1 |
| [FooFactory] |
'--------------'
       | 
       | < [object Foo]
       |
.--------------.
| APP-DOMAIN 2 |
| [MyModule]   |
'--------------'

APP-DOMAIN 1 always stays in memory, since it's loaded in the highest app-domain possible: the original compiled code of a SWF. APP-DOMAIN 2 is loaded into and out of memory dynamically and must be able to completely sever itself from APP-DOMAIN 1. According to the genius answer above by Juan Pablo Califano, APP-DOMAIN 2 having a reference to [object Foo] doesn't necessarily tie APP-DOMAIN 2 into memory, though it could become tied into memory by [MyModule] adding an event listener to [object Foo], right?

Okay, so, with this in mind, an overkill solution would be to return a weak-reference-implementation of Foo from the getFoo method, since that's where things need to "break off" in case of "emergency." (Things need to be weak from this perspective so that APP-DOMAIN 1 can be garbage collected completely as it is unloaded.) Again, this is an overkill answer.

However, I do not need to keep a weak-ref to Foo in FooFactory, since FooFactory needs to have a surefire way of getting a hold of each created Foo object. In short, Juan Pablo Califano has the theory completely right, it just needs to be tested in the real world in order to prove everything definitively :)


All of this aside, I believe I have uncovered the real issue behind the scenes that caused a similar library I wrote in the past to never GC. The problem was not in the actual library I wrote, but it seems that it was in a reflection library I was using. The reflection library would "cache" every Class object I threw at it, since my original FooFactory.getFoo method took a Class parameter, rather than a String. Since the library seemed to be hard-referencing every Class object passed into memory, I'm pretty sure that was the memory leak.


In conclusion, as one last question, is it true for me to say that no AS3 solution exists for counting references? I'm building a complete unit-testing suite for this library I'm building, and if I could do something like Assert.assertEquals(0, getReferenceCount(foo)), that would be rad.

TK Kocheran
I've added an answer to your last question. Check my edit.
Juan Pablo Califano