ansaurus

Question

Is there a programming language where types can be parametrized by values?

Answer 1

+3 A:

I think you're basically describing Dependent Types. There exist a number of (mostly research) languages that implement these, linked from the article. It tends to become intractable to automatically prove type inhabitation in the general case (i.e. type checking gets very hard, or is in the general case not decidable), but there have been some practical examples of their use.

Gian 2010-08-22 13:34:03

Answer 2

+1 A:

Ada95 supports generic formal parameters that are values. In the example on this page, Size is a generic formal parameter whose value is required to be a positive integer.

Stephen C 2010-08-22 13:43:31

Are these statically checkable somehow or do they end up as runtime assertions?

Gian 2010-08-22 13:52:42

I don't know ada, but at first glance this looks to be the same as Gabriel's template example, i.e. the value of Size needs to be known at compile time.

sepp2k 2010-08-22 14:18:50

Any type conversion involving constrained types can throw Constraint_Error, and I don't see any reason why conversion of values for generic formal parameters would be different. There is no mention in the (draft 4.0) Ada95 spec that expressions providing formal parameter values need to be compile-time evaluable.

Stephen C 2010-08-22 14:38:04

Answer 3

+4 A:

Types that are parametrized by values are called dependent types.¹ There has been a lot of research on the topic of dependent types, but little of it has reached “mainstream language”.

The big problem with dependent types is that if your types contains expressions, i.e., bits of code, then the type checker must be able to execute code. This can't be done in full generality: what if the code has side effects? what if the code contains an infinite loop? For example, consider the following program in a C-like syntax (error checking omitted):

int a, b;
scanf("%d %d", &a, &b);
int u[a], v[b];

How could the compiler know whether the arrays u and v have the same size? It depends on the numbers the user enters! One solution is to forbid side effects in expressions that appear in types. But that doesn't take care of everything:

int f(int x) { while (1); }
int u[f(a)], v[f(b)];

the compiler will go into an infinite loop trying to decide whether u and v have the same size.

^<expanded>
So let's forbid side effects inside types, and limit recursion and looping to provably terminating cases. Does it make type checking decidable? From a theoretic point of view, yes, it can. What you have might be something like a Coq proof term. The problem is that type checking is then easily decidable if you have enough type annotations (type annotations are the typing information that the programmer supplies). And here enough means a lot. An awful lot. As in, type annotations at every single language constructs, not just variable declarations but also function calls, operators and all the rest. And the types would represent 99.9999% of the program size. It would often be faster to write the whole thing in C++ and debug it than writing the whole program with all required type annotations.

Hence the difficulty here is to have a type system that requires only a reasonable amount of type annotations. From a theoretical point of view, as soon as you allow leaving off some of the type annotations, it becomes a type inference problem rather than a pure type checking problem. And type inference is undecidable for even relatively simple type systems. You can't easily have a decidable (guaranteed to terminate) static (operating at compile time) reasonable (not requiring an insane amount of type annotations) dependent type system.
_</expanded>

Dependent types do sometimes appear in a limited form in mainstream languages. For example, C99 allows arrays whose size is not a constant expression; the type of such an array is a dependent type. Unsurprisingly for C, the compiler is not required to check bounds on such an array, even when it would be required to check bounds for an array of constant size.

More usefully, Dependent ML is a dialect of ML with types that can be indexed by simple integer expressions. This allows the type checker to check most array bounds statically.

Another example of dependent type appears in module systems for ML. Modules and their signatures (also called interfaces) are similar to expressions and types, but rather than describing computations, they describe the structure of the program.

Dependent types turn up very often in languages that are not programming languages in the sense most programmers would recognize, but rather languages for proving mathematical properties of programs (or just mathematical theorems). Most of the examples in the wikipedia page are of that nature.

¹ _{More generally, type theorists classify type systems according to whether they have Higher-order types (types parametrized by types), polymorphism (expressions parametrized by types), and dependent types (types parametrized by expressions). This classification is called Barendregt's cube or the lambda cube. In fact it is a hypercube, but usually the fourth dimension (expressions parametrized by expressions, i.e., functions) goes without saying.}

Gilles 2010-08-22 22:44:12

Interesting! Maybe if the type checker could be limited to constant expressions and literals only, possibly with a non-turing complete subset of the language it could be an interesting addition to a language (such as enforcing basic preconditions and invariants). I'll investigate on this Dependent Types thing.

Gabriel Cuvillier 2010-08-23 07:18:01

@Gabriel: I see I went a bit fast in the middle there. Even with a fairly restricted subset of expressions available inside types, it can be hard to have a usable programming language with dependent types (see my edit). You're right about enforcing assertions, that's the big motivation behind dependent types. (Mind you, it's the big motivation behind types of any kind.)

Gilles 2010-08-23 22:38:01

ansaurus

tags:

views:

answers:

Is there a programming language where types can be parametrized by values?

related questions