views:

3059

answers:

6

The code looks like below:

using System;
using System.Collections.Generic;
using System.Linq;
using System.Text;

namespace Test
{
    public interface IMyClass
    {
     List<IMyClass> GetList();
    }


    public class MyClass : IMyClass
    {
     public List<IMyClass> GetList()
     {
      return new List<IMyClass>();
     }
    }
}

When I Run Code Analysis i get the following recommendation.

Warning 3 CA1002 : Microsoft.Design : Change 'List<IMyClass>' in 'IMyClass.GetList()' to use Collection<T>, ReadOnlyCollection<T> or KeyedCollection<K,V>

How should I fix this and what is good practice here?

+10  A: 

I would personally declare it to return an interface rather than a concrete collection. If you really want list access, use IList<T>. Otherwise, consider ICollection<T> and IEnumerable<T>.

Jon Skeet
Would the IList then extend the ICollection interface?
bovium
Yes, IList<T> extends ICollection<T>. I'll add documentation links.
Jon Skeet
@Jon: I know this is old, but can you comment on what Krzysztof says at http://blogs.msdn.com/b/kcwalina/archive/2005/09/26/474010.aspx? Specifically his comment, `We recommend using Collection<T>, ReadOnlyCollection<T>, or KeyedCollection<TKey,TItem> for outputs and properties and interfaces IEnumerable<T>, ICollection<T>, IList<T> for inputs.` CA1002 seems to go along with Krzysztof's comments. I can't imagine why a concrete collection would be recommended instead of an interface, and why the distinction between inputs/outputs.
Nelson
@Nelson: It's rare that you want to *require* callers to pass in an immutable list, but it's reasonable to *return* one so that they know it's definitely immutable. Not sure about the other collections though. Would be nice to have more detail.
Jon Skeet
It's not for a specific case. Obviously in general `ReadOnlyCollection<T>` wouldn't make sense for an input. Similarly, `IList<T>` as an input says, "I need Sort() or some other member that an IList has" which doesn't make sense for an output. But what I meant was, why would `ICollection<T>` be recommended as an input and `Collection<T>` as an output. Why *not* use `ICollection<T>` as the output also as you suggested?
Nelson
@Nelson: I don't know, to be honest. It would be good to get more information from Krzysztof.
Jon Skeet
I'm thinking it has to do with inambiguity. `Collection<T>` and `ReadOnlyCollection<T>` both derive from `ICollection<T>` (i.e. there is no `IReadOnlyCollection<T>`). If you return the interface, it's not obvious which one it is and whether it can be modified. Anyway, thanks for your input. This was a good sanity check for me.
Nelson
A: 

Well the Collection class is really just a wrapper class around other collections to hide their implementation details and other features. I reckon this has something to do with the property hiding coding pattern in object-oriented languages.

I think you shouldn't worry about it, but if you really want to please the code analysis tool, just do the following:

//using System.Collections.ObjectModel;

Collection<MyClass> myCollection = new Collection<MyClass>(myList);
DrJokepu
That won't work due to generic invariance :(
Jon Skeet
Sorry, that was a typo. I menat Collection<MyClass>. I am really looking forward to get my hands on C# 4 and generic covariance btw!
DrJokepu
A: 

I don't think anyone has answered the "why" part yet... so here goes. The reason "why" you "should" use a Collection instead of a List is because if you expose a "List", then anyone who gets access to your object can modify the items in the list. Whereas Collection is supposed to indicate that you are making your own "Add", "Remove", etc methods.

You likely don't need to worry about it, because you're probably coding the interface for yourself only (or maybe a few collegues). Here's another example that might make sense.

If you have a public array, ex: public int[] MyIntegers { get; }

You would think that because there is only a "get" accessor that no-one can mess with the values, but that's not true. Anyone can change the values inside there just like this: someObject.MyIngegers[3] = 12345;

Personally, I would just use List in most cases. But if you are designing a class library that you are going to give out to random developers, and you need to rely on the state of the objects... then you'll want to make your own Collection and lock it down from there :)

Timothy Khouri
You're pretty far off base.http://blogs.msdn.com/fxcop/archive/2006/04/27/faq-why-does-donotexposegenericlists-recommend-that-i-expose-collection-lt-t-gt-instead-of-list-lt-t-gt-david-kean.aspx
senfo
"If you return List<T> to the client code, you will not ever be able to receive notifications when client code modifies the collection." - FX COP... See also "http://msdn.microsoft.com/en-us/library/0fss9skc.aspx"... wow, seems I'm not off base after all :)
Timothy Khouri
+1  A: 

It's mostly about abstracting your own implementations away instead of exposing the List object to be manipulated directly.

It's not good practice to let other objects (or people) modify the state of your objects directly. Think property getters/setters.

Collection -> For normal collection
ReadOnlyCollection -> For collections that shouldn't be modified
KeyedCollection -> When you want dictionaries instead.

How to fix it depends on what you want your class to do and the purpose of the GetList() method. Can you elaborate?

chakrit
A: 

In these kind of case I usually try to expose the least amount of implemententation that is needed. If the consumers do not need to know that you are actually using a list then you don't need to return a list. By returning as Microsoft suggests a Collection you hide the fact that you are using a list from the consumers of your class and isolate them against an internal change.

Harald Scheirich
+28  A: 

To answer the "why" part of the question as to why not List<T>, The reasons are future-proofing and API simplicity.

Future-proofing

List<T> is not designed to be easily extensible by subclassing it; it is designed to be fast for internal implementations. You'll notice the methods on it are not virtual and so cannot be overridden, and there are no hooks into its Add/Insert/Remove operations.

This means that if you need to alter the behaviour of the collection in the future (e.g. to reject null objects that people try to add, or to perform additional work when this happens such as updating your class state) then you need to change the type of collection you return to one you can subclass, which will be a breaking interface change (of course changing the semantics of things like not allowing null may also be an interface change, but things like updating your internal class state would not be).

So by returning either a class that can be easily subclassed such as Collection<T> or an interface such as IList<T>, ICollection<T> or IEnumerable<T> you can change your internal implementation to be a different collection type to meet your needs, without breaking the code of consumers because it can still be returned as the type they are expecting.

API Simplicity

List<T> contains a lot of useful operations such as BinarySearch, Sort and so on. However if this is a collection you are exposing then it is likely that you control the semantics of the list, and not the consumers. So while your class internally may need these operations it is very unlikely that consumers of your class would want to (or even should) call them.

As such, by offering a simpler collection class or interface, you reduce the number of members that users of your API see, and make it easier for them to use.

Greg Beech
This response is dead on. Another good read on the subject can be found here: http://blogs.msdn.com/fxcop/archive/2006/04/27/faq-why-does-donotexposegenericlists-recommend-that-i-expose-collection-lt-t-gt-instead-of-list-lt-t-gt-david-kean.aspx
senfo
I see your first points, but I don't know if I agree on your API simplicity part.
borisCallens