Generically, a covariant type parameter is one which is allowed to vary down as the class is subtyped (alternatively, vary with subtyping, hence the "co-" prefix). More concretely:
trait List[+A]
List[Int]
is a subtype of List[AnyVal]
because Int
is a subtype of AnyVal
. This means that you may provide an instance of List[Int]
when a value of type List[AnyVal]
is expected. This is really a very intuitive way for generics to work, but it turns out that it is unsound (breaks the type system) when used in the presence of mutable data. This is why generics are invariant in Java. Brief example of unsoundness using Java arrays (which are erroneously covariant):
Object[] arr = new int[1];
arr[0] = "Hello, there!";
We just assigned a value of type String
to an array of type int[]
. For reasons which should be obvious, this is bad news. Java's type system actually allows this at compile time. The JVM will "helpfully" throw an ArrayStoreException
at runtime. Scala's type system prevents this problem because the type parameter on the Array
class is invariant (declaration is [A]
rather than [+A]
).
Note that there is another type of variance known as contravariance. This is very important as it explains why covariance can cause some issues. Contravariance is literally the opposite of covariance: parameters vary upward with subtyping. It is a lot less common partially because it is so counter-intuitive, though it does have one very important application: functions.
trait Function1[-P, +R] {
def apply(p: P): R
}
Notice the "-" variance annotation on the P
type parameter. This declaration as a whole means that Function1
is contravariant in P
and covariant in R
. Thus, we can derive the following axioms:
T1' <: T1
T2 <: T2'
---------------------------------------- S-Fun
Function1[T1, T2] <: Function1[T1', T2']
Notice that T1'
must be a subtype (or the same type) of T1
, whereas it is the opposite for T2
and T2'
. In English, this can be read as the following:
A function A is a subtype of another function B if the parameter type of A is a supertype of the parameter type of B while the return type of A is a subtype of the return type of B.
The reason for this rule is left as an exercise to the reader (hint: think about different cases as functions are subtyped, like my array example from above).
With your new-found knowledge of co- and contravariance, you should be able to see why the following example will not compile:
trait List[+A] {
def cons(hd: A): List[A]
}
The problem is that A
is covariant, while the cons
function expects its type parameter to be contravariant. Thus, A
is varying the wrong direction. Interestingly enough, we could solve this problem by making List
contravariant in A
, but then the return type List[A]
would be invalid as the cons
function expects its return type to be covariant.
Our only two options here are to a) make A
invariant, losing the nice, intuitive sub-typing properties of covariance, or b) add a local type parameter to the cons
method which defines A
as a lower bound:
def cons[B >: A](v: B): List[B]
This is now valid. You can imagine that A
is varying downward, but B
is able to vary upward with respect to A
since A
is its lower-bound. With this method declaration, we can have A
be covariant and everything works out.
Notice that this trick only works if we return an instance of List
which is specialized on the less-specific type B
. If you try to make List
mutable, things break down since you end up trying to assign values of type B
to a variable of type A
, which is disallowed by the compiler. Whenever you have mutability, you need to have a mutator of some sort, which requires a method parameter of a certain type, which (together with the accessor) implies invariance. Covariance works with immutable data since the only possible operation is an accessor, which may be given a covariant return type.