Fun with Functional Dependencies <BR> <SMALL>or (Draft) Types as Values in Static Computations in Haskell</SMALL>

Abstract

This paper illustrates how Haskell's type class system can be used to express computations. Since computations on the type level are performed by the type checker, these computations are static (i.e., performed at compile-time), and, since the type system is decidable, they always terminate. Haskell thus provides a means to express static computations, and has a clear distinction between static and dynamic computations.

Instance declarations define predicates over types, or in the case of multi-parameter classes, relations between types. With functional dependencies, multi-parameter classes directly specify functions, and thanks to them you can get the type checker to compute the values of function applications, rather than just checking that the result of an application is what you say it is.

This way of expressing computation gives us the power of a small, first-order functional programming language, with pattern matching and structural recursion. We can easily define things like booleans, natural numbers, lists, and functions over these types. We give some examples of completely static computations, the most elaborate one being an implementation of insertion sort. We also give examples where static and dynamic computations are mixed.

Introduction

Concepts, such as programs, programming languages, computations, values and types, are probably familiar to most readers of this paper. But, to make a long story short, programming languages are used to express computations. Computations manipulate values. Typed programming languages distinguish between types and values. Types are related to values by a typing relation that says what values belong to what types, so one usually think of types as sets of values. Expressions, and other program parts, can be assigned types too, to indicate what kind of values the produce or manipulate. Types can thus be used do document programs (to clarify what kind of values are involved in a certain part of the program) and to help detect programmer mistakes.

In statically typed languages, the types are not seen as something that take part in computations, but rather something that allows a compiler to check that a program is type correct without actually running the program.

Seeing types as a way to organize values, one can ask the question if it would be meaningful to have a similar way to organize types? The answer is yes, and different programming languages have different ways to organize types. Most widely known is probably the way types are organized in class hierarchies in object oriented programming languages.

Haskell [haskell-homepage] also has a class system [wadler:overload] to organize types, originally introduced to allow a systematic treatment of overloading. Haskell classes are not quite like classes in object oriented languages: the relation between types and classes is similar to the relation between values and types, i.e., types can belong to classes. For example, the class Eq is used to group types that allow their values to be tested for equality and the Show class contain types whose values can be converted to strings.

The interesting observation for the following is that in Haskell we have three levels on which things are described. On the ground level we have values. The values belong to types, which form the second level, and the types belong to classes, which form the third level. We thus have two relations, one between values and types and one between types and classes. In the next section, we make some reflections on the similarities and differences between these two relations.

Values and types vs types and classes

Haskell has, unsurprisingly, ways to introduce values, types and classes, and to create relations between them.

Values and types are introduced together in data-type declarations. For example, the definition

Classes and their relations to types are introduced in a slightly different way. Classes are introduces by stating their names and parameters and giving the types of the overloaded operations that types belonging to the class should support. As an example, a class for types that support equality could be introduced with the following declaration:

To declare that booleans can be tested for equality we would give the following declaration:

Type definitions can be parameterized. A typical example is the definition of the list type, where the type parameter give the type of the elements of a list:

When parameterized types are declared as instances of classes, it is often useful to make some assumptions about the parameter types. For example, to define how lists are tested for equality, we need to refer to the equality test for the elements of the list. Instance declarations of this kind look like this:

An instance relation like this can be seen as a computation rule, that given an equality test for an arbitrary type, for example Bool, gives us an equality test for lists containing values of that type, for example List Bool. As we will see later, this gives us a way to express computations on the type level.

Computations

Dynamic computation

In Haskell, computations are usually expressed as functions from values to values. For example, if we define natural numbers (and an abbreviation for a sample number) as

In an interactive Haskell system, such as Hugs [hugs98], we can then ask for expressions to be computed:

Static computation

As mentioned earlier, some instance declarations in Haskell can be seen as computation rules. Since Haskell is statically typed, the computations expressed in this way will be static, i.e., performed at compile-time.

Predicates

To define what even and odd numbers are, a Prolog programmer could define the following predicates:

Note that the Prolog constants and predicates become types and classes, respectively. (Here, since we are not interested in values, but only types and classes, we have defined data types without any constructors and classes without any overloaded operations.) We then define the predicates using instance declarations:

The question now is: how do we ask the Haskell system to check if a number is even? The computations are performed by the type checker, and in Hugs, the only way to make the type checker work for us is to ask it to compute the type of an expression, or to check that an expression has a given type. Although the definitions given above are enough to express the desired computation, for practical reasons we have to make a small addition to them:

We are now saying that, if n is a type representing an even number, then there is an element of n, which can be referred to by the name isEven. The instance declarations can be left unchanged.

We got a type error because three is not an even number. An interpretation of the last line is that if zero were odd, then three would be even.

Relations

If a Prolog programmer wanted to define a relation corresponding more directly to the functions even and odd in section dynamic, the result would probably be the following:

Using multi-parameter classes [peyton-jones:multi-parameter-classes] we can again make a rather direct Haskell transcription. We start by declaring the constants we haven't used before:

And again, for practical reasons, we have included overloaded operations in the classes, although we are only interested in the types.

The queries now look a bit more complicated. In the first example, we asked if Three is related to True by the relation Odd, and Hugs replied that, indeed, that is the case. In the second example, we ask in the same way if three is false, and Hugs says that this would have be the case, if the program had contained an instance declaration like

Hugs' reply means that the result of applying Odd to Three can be any type a, provided the program contains instance declarations allowing us to derive that Odd (Succ (Succ (Succ Zero))) a holds. Hugs does not try to enumerate possible values of a, like a Prolog system would. With the given instance declarations, the only possible value for a is True, but since the instance relation is open, it is seen as a premature commitment to say that a must be True.

Functions

This says that the relation Even n b is actually a function from n to b. This prevents us from at the same time declaring both Even Zero True and Even Zero False, and allows b to be computed if n is a known number:

Now, having seen that these strange looking definitions actually can be used to compute something, we perhaps feel more motivated to go on and define some more functions on natural numbers. The following dynamic ones,

Note that we also introduced u as a convenient abbreviation of undefined. We can try some static additions and multiplications:

Note that the command :type asks Hugs to just infer the type of an expression, not to compute its value. No ordinary, dynamic Haskell computations are performed in the above examples.

Mixing static and dynamic computations

We have now seen that Haskell allows us to define dynamic functions (section dynamic), i.e., computations to be performed at run-time, and static functions (section static-functions), i.e., computations to be performed at compile-time. Can we mix the two, and define functions that are computed partly at compile-time, partly at run-time? The answer is: yes, definitely. It actually happens all the time, when overloaded functions are used in ordinary Haskell programs. Or, to be more precise, the compiler has the opportunity to perform some computations at compile-time, but can also choose to delay most of the work until run-time [jones:dictionary-free].

A common example used to illustrate static vs dynamic computations is the power function. The dynamic version could be defined as

Using the Haskell type Int for the dynamic part of the computation, we can define a version of the power function, where the base is dynamic and the exponent is static, as follows:

This simple example might seem a bit pointless in an interactive environment where compile-time and run-time coincide. The computation proceeds roughly as follows:

With an optimizing compiler, and the same function is used repeatedly, the possibility to move computations to compile-time could of course give a considerable speed-up.

A larger example of static computation

In the above sections we have presented a way to express static computations in Haskell, using the class system. We now show that this way of expressing static computations is not limited to the rather simple algorithms we have seen so far. We start with a representation of lists and conclude with an implementation of insertion sort.

Unfortunately, Hugs' type checker doesn't reduce the types as far as expected. The reason for this is at the time of this writing unknown...

Concluding remarks

The particular use of type classes explored in this paper are perhaps of the more esoteric kind, and probably not what they were intended for. But, as many people have already discovered, multi-parameter classes with functional dependencies can be very useful for more conventional programming tasks as well.

Haskell 98 [haskell98], the most recent version of Haskell, does not include multi-parameter classes and functional dependencies. GHC [ghc-webpage] and Hugs [hugs98] support these extensions to varying degree, though.

It appears that the limits of what can be done within Haskell-like type systems are yet to be found. Two recent examples of other tricks that seem to stretch the limits are [okasaki99:square_matrices] and [weirich00:type_safe_cast].

References

ghc00 The Glasgow Haskell Compiler.: http://www.haskell.org/ghc/, 2000.
JHe^+99 S. Peyton Jones, J. Hughes, (editors), L. Augustsson, D. Barton, B. Boutel, W. Burton, J. Fasel, K. Hammond, R. Hinze, P. Hudak, T. Johnsson, M. Jones, J. Launchbury, E. Meijer, J. Peterson, A. Reid, C. Runciman, and P. Wadler.: Report on the Programming Language Haskell 98, a Non-strict, Purely Functional Language.; Available from http://haskell.org, February 1999.
JJM97 Simon Peyton Jones, Mark Jones, and Erik Meijer.: Type classes: exploring the design space.; In Haskell Workshop. ACM SIGPLAN, 1997.
Jon94 Mark P. Jones.: Dictionary-free Overloading by Partial Evaluation.; In ACM SIGPLAN Workshop on Partial Evaluation and Semantics-Based Program Manipulation, Orlando, Florida, June 1994.
Jon00a Mark P. Jones.: Hugs 98.; http://www.haskell.org/hugs/, February 2000.
Jon00b Mark P. Jones.: Type Classes with Functional Dependencies.; In Proceedings of the 9th European Symposium on Programming, ESOP 2000, number 1782 in LNCS, Berlin, Germany, March 2000. Springer-Verlag.
Oka99 Chris Okasaki.: From Fast Exponentiation to Square Matrices: An Adventure in Types.; In International Conference on Functional Programming, pages 28--35, Paris, France, September 1999.
Pet97 J. Peterson.: The Haskell Home Page.; www.haskell.org, 1997.
WB89 P. Wadler and S. Blott.: How to make ad hoc polymorphism less ad hoc.; In Proceedings 1989 Symposium Principles of Programming Languages, pages 60--76, Austin, Texas, 1989.
Wei00 Stephanie Weirich.: Type-safe cast.; In International Conference on Functional Programming, Montréal, Canada, September 2000.

Fun with Functional Dependencies
or (Draft) Types as Values in Static Computations in Haskell

Thomas Hallgren

Abstract

Introduction

Values and types vs types and classes

Computations

Dynamic computation

Static computation

Predicates

Relations

Functions

Mixing static and dynamic computations

A larger example of static computation

Concluding remarks

References

Fun with Functional Dependencies or (Draft) Types as Values in Static Computations in Haskell

Thomas Hallgren

Abstract

Introduction

Values and types vs types and classes

Computations

Dynamic computation

Static computation

Predicates

Relations

Functions

Mixing static and dynamic computations

A larger example of static computation

Concluding remarks

References

Fun with Functional Dependencies
or (Draft) Types as Values in Static Computations in Haskell