Lambda Calculus
Lambda Calculus
Lambda Calculus
1
Lambda calculus
The lowercase lambda, the 11th letter of
the Greek alphabet, is used to symbolize
the lambda calculus.
Lambda calculus (also written as -calculus) is a formal system in
mathematical logic and computer science for expressing computation based
on function abstraction and application using variable binding and
substitution. First formulated by Alonzo Church to formalize the concept of
effective computability, lambda calculus found early successes in the area of
computability theory, such as a negative answer to Hilbert's
Entscheidungsproblem. Lambda calculus is a conceptually simple universal
model of computation (Turing showed in 1937
[1]
that Turing machines
equalled the lambda calculus in expressiveness). The name derives from the
Greek letter lambda () used to denote binding a variable in a function.
Because of the importance of the notion of variable binding and substitution,
there is not just one system of lambda calculus, and in particular there are
typed and untyped variants. Historically, the most important system was the
untyped lambda calculus, in which function application has no restrictions (so
the notion of the domain of a function is not built into the system). In the
ChurchTuring Thesis, the untyped lambda calculus is claimed to be capable
of computing all effectively calculable functions. The typed lambda calculus
is a variety that restricts function application, so that functions can only be
applied if they are capable of accepting the given input's "type" of data.
Today, the lambda calculus has applications in many different areas in mathematics, philosophy,
[2]
linguistics, and
computer science. It is still used in the area of computability theory, although Turing machines are also an important
model for computation. Lambda calculus has played an important role in the development of the theory of
programming languages. Counterparts to lambda calculus in computer science are functional programming
languages, which essentially implement the lambda calculus (augmented with some constants and datatypes).
Beyond programming languages, the lambda calculus also has many applications in proof theory. A major example
of this is the CurryHoward correspondence, which gives a correspondence between different systems of typed
lambda calculus and systems of formal logic.
Lambda calculus in history of mathematics
The lambda calculus was introduced by mathematician Alonzo Church in the 1930s as part of an investigation into
the foundations of mathematics.
[3][4]
The original system was shown to be logically inconsistent in 1935 when
Stephen Kleene and J. B. Rosser developed the KleeneRosser paradox.
Subsequently, in 1936 Church isolated and published just the portion relevant to computation, what is now called the
untyped lambda calculus.
[5]
In 1940, he also introduced a computationally weaker, but logically consistent system,
known as the simply typed lambda calculus.
[6]
Lambda calculus
2
Informal description
Motivation
Computable functions are a fundamental concept within computer science and mathematics. The -calculus provides
a simple semantics for computation, enabling properties of computation to be studied formally. The -calculus
incorporates two simplifications that make this semantics simple. The first simplification is that the -calculus treats
functions "anonymously", without giving them explicit names. For example, the function
can be rewritten in anonymous form as
(read as "the pair of and is mapped to "). Similarly,
can be rewritten in anonymous form as , where the input is simply mapped to itself.
The second simplification is that the -calculus only uses functions of a single input. An ordinary function that
requires two inputs, for instance the function, can be reworked into an equivalent function that accepts a
single input, and as output returns another function, that in turn accepts a single input. For example,
can be reworked into
This method, known as currying, transforms a function that takes multiple arguments into a chain of functions each
with a single argument.
Function application of the function to the arguments (5, 2), yields at once
,
whereas evaluation of the curried version requires one more step
to arrive at the same result.
The lambda calculus
The lambda calculus consists of a language of lambda terms, which is defined by a certain formal syntax, and a set
of transformation rules, which allow manipulation of the lambda terms. These transformation rules can be viewed as
an equational theory or as an operational definition.
As described above, all functions in the lambda calculus are anonymous functions, having no names. They only
accept one input variable, with currying used to implement functions with several variables.
Lambda calculus
3
Lambda terms
The syntax of the lambda calculus defines some expressions as valid lambda calculus expression and some as
invalid, just as some strings of characters are valid C programs and some are not. A valid lambda calculus expression
is called a "lambda term".
The following three rules give an inductive definition that can be applied to build all syntactically valid lambda
terms:
a variable, , is itself a valid lambda term
if is a lambda term, and is a variable, then is a lambda term (called a lambda abstraction);
if and are lambda terms, then is a lambda term (called an application).
Nothing else is a lambda term. Thus a lambda term is valid if and only if it can be obtained by repeated application
of these three rules. However, some parentheses can be omitted according to certain rules. For example, the
outermost parentheses are usually not written.
A lambda abstraction is a definition of an anonymous function that is capable of taking a single input and
substituting it into the expression . It thus defines an anonymous function that takes x and returns t. For example
is a lambda abstraction for the function using the term for t. The definition
of a function with a lambda abstraction merely "sets up" the function but does not invoke it. The abstraction binds
the variable in the term .
An application represents the application of a function to an input , that is, it represents the act of calling
function t on input s to produce .
There is no concept in lambda calculus of variable declaration. In a definition such as (i.e.
), the lambda calculus treats y as a variable that is not yet defined. The lambda abstraction
is syntactically valid, and represents a function that adds its input to the yet-unknown y.
Bracketing may be used and may be needed to disambiguate terms. For example, and
denote different terms (although coincidentally reduce to the same value.)
Functions that operate on functions
In lambda calculus, functions are taken to be 'first class values', so functions may be used as the inputs, or be
returned as outputs from other functions.
For example, represents the identity function, , and represents the identity function
applied to . Further, represents the constant function , the function that always returns , no
matter the input. In lambda calculus, function application is regarded as left-associative, so that means .
There are several notions of "equivalence" and "reduction" that allow lambda terms to be "reduced" to "equivalent"
lambda terms.
Alpha equivalence
A basic form of equivalence, definable on lambda terms, is alpha equivalence. It captures the intuition that the
particular choice of a bound variable, in a lambda abstraction, does not (usually) matter. For instance, and
are alpha-equivalent lambda terms, and they both represent the same function (the identity function). The
terms and are not alpha-equivalent, because they are not bound in a lambda abstraction. In many presentations,
it is usual to identify alpha-equivalent lambda terms.
The following definitions are necessary in order to be able to define beta reduction.
Lambda calculus
4
Free variables
The free variables of a term are those variables not bound by a lambda abstraction. The set of free variables of an
expression is defined inductively:
The free variables of are just
The set of free variables of is the set of free variables of , but with removed
The set of free variables of are the union of the set of free variables of and the set of free variables of .
For example, the lambda term representing the identity has no free variables, but the function has
a single free variable, .
Capture-avoiding substitutions
Suppose , and are lambda terms and and are variables. The notation indicates substitution
of for in in a capture-avoiding manner. This is defined so that:
;
if ;
;
;
if and is not in the free variables of . The variable is said
to be "fresh" for .
For example, , and
.
The freshness condition (requiring that is not in the free variables of ) is crucial in order to ensure that
substitution does not change the meaning of functions. For example, a substitution is made that ignores the freshness
condition: . This substitution turns the constant function
into the identity by substitution.
In general, failure to meet the freshness condition can be remedied by alpha-renaming with a suitable fresh variable.
For example, switching back to our correct notion of substitution, in the lambda abstraction can be
renamed with a fresh variable , to obtain , and the meaning of
the function is preserved by substitution.
Beta reduction
The beta reduction rule states that an application of the form reduces to the term . The notation
is used to indicate that beta reduces to . For example, for every ,
. This demonstrates that really is the identity. Similarly,
, which demonstrates that is a constant function.
The lambda calculus may be seen as an idealised functional programming language, like Haskell or Standard ML.
Under this view, beta reduction corresponds to a computational step. This step can be repeated by additional beta
conversions until there are no more applications left to reduce. In the untyped lambda calculus, as presented here,
this reduction process may not terminate. For instance, consider the term . Here
.
That is, the term reduces to itself in a single beta reduction, and therefore the reduction process will never terminate.
Another aspect of the untyped lambda calculus is that it does not distinguish between different kinds of data. For
instance, it may be desirable to write a function that only operates on numbers. However, in the untyped lambda
calculus, there is no way to prevent a function from being applied to truth values, strings, or other non-number
objects.
Lambda calculus
5
Formal definition
Definition
Lambda expressions are composed of
variables v
1
, v
2
, ..., v
n
, ...
the abstraction symbols lambda '' and dot '.'
parentheses ( )
The set of lambda expressions, , can be defined inductively:
1. If x is a variable, then x
2. If x is a variable and M , then (x.M)
3. If M, N , then (M N)
Instances of rule 2 are known as abstractions and instances of rule 3 are known as applications.
Notation
To keep the notation of lambda expressions uncluttered, the following conventions are usually applied.
Outermost parentheses are dropped: M N instead of (M N)
Applications are assumed to be left associative: M N P may be written instead of ((M N) P)
The body of an abstraction extends as far right as possible: x.M N means x.(M N) and not (x.M) N
A sequence of abstractions is contracted: x.y.z.N is abbreviated as xyz.N
Free and bound variables
The abstraction operator, , is said to bind its variable wherever it occurs in the body of the abstraction. Variables
that fall within the scope of an abstraction are said to be bound. All other variables are called free. For example, in
the following expression y is a bound variable and x is free: y.x x y. Also note that a variable is bound by its
"nearest" abstraction. In the following example the single occurrence of x in the expression is bound by the second
lambda: x.y (x.z x)
The set of free variables of a lambda expression, M, is denoted as FV(M) and is defined by recursion on the structure
of the terms, as follows:
1. 1. FV(x) = {x}, where x is a variable
2. 2. FV(x.M) = FV(M) \ {x}
3. FV(M N) = FV(M) FV(N)
An expression that contains no free variables is said to be closed. Closed lambda expressions are also known as
combinators and are equivalent to terms in combinatory logic.
Reduction
The meaning of lambda expressions is defined by how expressions can be reduced.
[7]
There are three kinds of reduction:
-conversion: changing bound variables (alpha);
-reduction: applying functions to their arguments (beta);
-conversion: which captures a notion of extensionality (eta).
We also speak of the resulting equivalences: two expressions are -equivalent, if they can be -converted into the
same expression, and /-equivalence are defined similarly.
Lambda calculus
6
The term redex, short for reducible expression, refers to subterms that can be reduced by one of the reduction rules.
For example, (x.M) N is a beta-redex in expressing N is substituted for x in M; if x is not free in M, x.M x is
an eta-redex. The expression to which a redex reduces is called its reduct; using the previous example, the reducts of
these expressions are respectively M[x:=N] and M.
-conversion
Alpha-conversion, sometimes known as alpha-renaming, allows bound variable names to be changed. For example,
alpha-conversion of x.x might yield y.y. Terms that differ only by alpha-conversion are called -equivalent.
Frequently in uses of lambda calculus, -equivalent terms are considered to be equivalent.
The precise rules for alpha-conversion are not completely trivial. First, when alpha-converting an abstraction, the
only variable occurrences that are renamed are those that are bound to the same abstraction. For example, an
alpha-conversion of x.x.x could result in y.x.x, but it could not result in y.x.y. The latter has a
different meaning from the original.
Second, alpha-conversion is not possible if it would result in a variable getting captured by a different abstraction.
For example, if we replace x with y in x.y.x, we get y.y.y, which is not at all the same.
In programming languages with static scope, alpha-conversion can be used to make name resolution simpler by
ensuring that no variable name masks a name in a containing scope (see alpha renaming to make name resolution
trivial).
Substitution
Substitution, written E[V := R], is the process of replacing all free occurrences of the variable V in the
expression E with expression R. Substitution on terms of the -calculus is defined by recursion on the structure of
terms, as follows (note: x and y are only variables while M and N are any expression).
x[x := N] N
y[x := N] y, if x y
(M
1
M
2
)[x := N] (M
1
[x := N]) (M
2
[x := N])
(x.M)[x := N] x.M
(y.M)[x := N] y.(M[x := N]), if x y, provided y FV(N)
To substitute into a lambda abstraction, it is sometimes necessary to -convert the expression. For example, it is not
correct for (x.y)[y := x] to result in (x.x), because the substituted x was supposed to be free but ended
up being bound. The correct substitution in this case is (z.x), up to -equivalence. Notice that substitution is
defined uniquely up to -equivalence.
-reduction
Beta-reduction captures the idea of function application. Beta-reduction is defined in terms of substitution: the
beta-reduction of ((V.E) E) is E[V := E].
For example, assuming some encoding of 2, 7, , we have the following -reduction: ((n.n2) 7)
72.
-conversion
Eta-conversion expresses the idea of extensionality, which in this context is that two functions are the same if and
only if they give the same result for all arguments. Eta-conversion converts between x.(f x) and f whenever
x does not appear free in f.
Lambda calculus
7
Normal forms and confluence
Main article: Normalization property (abstract rewriting)
For the untyped lambda calculus, -reduction as a rewriting rule is neither strongly normalising nor weakly
normalising.
However, it can be shown that -reduction is confluent. (Of course, we are working up to -conversion, i.e. we
consider two normal forms to be equal, if it is possible to -convert one into the other.)
Therefore, both strongly normalising terms and weakly normalising terms have a unique normal form. For strongly
normalising terms, any reduction strategy is guaranteed to yield the normal form, whereas for weakly normalising
terms, some reduction strategies may fail to find it.
Encoding datatypes
Main articles: Church encoding and MogensenScott encoding
The basic lambda calculus may be used to model booleans, arithmetic, data structures and recursion, as illustrated in
the following sub-sections.
Arithmetic in lambda calculus
There are several possible ways to define the natural numbers in lambda calculus, but by far the most common are
the Church numerals, which can be defined as follows:
0 := f.x.x
1 := f.x.f x
2 := f.x.f (f x)
3 := f.x.f (f (f x))
and so on. Or using the alternative syntax presented above in Notation:
0 := fx.x
1 := fx.f x
2 := fx.f (f x)
3 := fx.f (f (f x))
A Church numeral is a higher-order functionit takes a single-argument function f, and returns another
single-argument function. The Church numeral n is a function that takes a function f as argument and returns the
n-th composition of f, i.e. the function f composed with itself n times. This is denoted f
(n)
and is in fact the n-th
power of f (considered as an operator); f
(0)
is defined to be the identity function. Such repeated compositions (of
a single function f) obey the laws of exponents, which is why these numerals can be used for arithmetic. (In
Church's original lambda calculus, the formal parameter of a lambda expression was required to occur at least once
in the function body, which made the above definition of 0 impossible.)
We can define a successor function, which takes a number n and returns n + 1 by adding an additional
application of f,where '(mf)x' means the function 'f' is applied 'm' times on 'x':
SUCC := n.f.x.f (n f x)
Because the m-th composition of f composed with the n-th composition of f gives the m+n-th composition of f,
addition can be defined as follows:
PLUS := m.n.f.x.m f (n f x)
PLUS can be thought of as a function taking two natural numbers as arguments and returning a natural number; it
can be verified that
Lambda calculus
8
PLUS 2 3
and
5
are equivalent lambda expressions. Since adding m to a number n can be accomplished by adding 1 m times, an
equivalent definition is:
PLUS := m.n.m SUCC n
Similarly, multiplication can be defined as
MULT := m.n.f.m (n f)
Alternatively
MULT := m.n.m (PLUS n) 0
since multiplying m and n is the same as repeating the add n function m times and then applying it to zero.
Exponentiation has a rather simple rendering in Church numerals, namely
POW := b.e.e b
The predecessor function defined by PRED n = n 1 for a positive integer n and PRED 0 = 0 is
considerably more difficult. The formula
PRED := n.f.x.n (g.h.h (g f)) (u.x) (u.u)
can be validated by showing inductively that if T denotes (g.h.h (g f)), then T
(n)
(u.x) =
(h.h(f
(n1)
(x))) for n > 0. Two other definitions of PRED are given below, one using conditionals and the
other using pairs. With the predecessor function, subtraction is straightforward. Defining
SUB := m.n.n PRED m,
SUB m n yields m n when m > n and 0 otherwise.
Logic and predicates
By convention, the following two definitions (known as Church booleans) are used for the boolean values TRUE
and FALSE:
TRUE := x.y.x
FALSE := x.y.y
(Note that FALSE is equivalent to the Church numeral zero defined above)
Then, with these two -terms, we can define some logic operators (these are just possible formulations; other
expressions are equally correct):
AND := p.q.p q p
OR := p.q.p p q
NOT := p.a.b.p b a
IFTHENELSE := p.a.b.p a b
We are now able to compute some logic functions, for example:
AND TRUE FALSE
(p.q.p q p) TRUE FALSE
TRUE FALSE TRUE
(x.y.x) FALSE TRUE
FALSE
and we see that AND TRUE FALSE is equivalent to FALSE.
A predicate is a function that returns a boolean value. The most fundamental predicate is ISZERO, which returns
TRUE if its argument is the Church numeral 0, and FALSE if its argument is any other Church numeral:
Lambda calculus
9
ISZERO := n.n (x.FALSE) TRUE
The following predicate tests whether the first argument is less-than-or-equal-to the second:
LEQ := m.n.ISZERO (SUB m n),
and since m = n, if LEQ m n and LEQ n m, it is straightforward to build a predicate for numerical equality.
The availability of predicates and the above definition of TRUE and FALSE make it convenient to write
"if-then-else" expressions in lambda calculus. For example, the predecessor function can be defined as:
PRED := n.n (g.k.ISZERO (g 1) k (PLUS (g k) 1)) (v.0) 0
which can be verified by showing inductively that n (g.k.ISZERO (g 1) k (PLUS (g k) 1))
(v.0) is the add n 1 function for n > 0.
Pairs
A pair (2-tuple) can be defined in terms of TRUE and FALSE, by using the Church encoding for pairs. For example,
PAIR encapsulates the pair (x,y), FIRST returns the first element of the pair, and SECOND returns the second.
PAIR := x.y.f.f x y
FIRST := p.p TRUE
SECOND := p.p FALSE
NIL := x.TRUE
NULL := p.p (x.y.FALSE)
A linked list can be defined as either NIL for the empty list, or the PAIR of an element and a smaller list. The
predicate NULL tests for the value NIL. (Alternatively, with NIL := FALSE, the construct l
(h.t.z.deal_with_head_h_and_tail_t) (deal_with_nil) obviates the need for an explicit
NULL test).
As an example of the use of pairs, the shift-and-increment function that maps (m, n) to (n, n + 1) can be
defined as
:= x.PAIR (SECOND x) (SUCC (SECOND x))
which allows us to give perhaps the most transparent version of the predecessor function:
PRED := n.FIRST (n (PAIR 0 0)).
Recursion and fixed points
Main article: Fixed-point combinator
See also: SKI combinator calculus Self-application_and_recursion
Recursion is the definition of a function using the function itself; on the face of it, lambda calculus does not allow
this (we can't refer to a value which is yet to be defined, inside the lambda term defining that same value, as all
functions are anonymous in lambda calculus). However, this impression is misleading: in (x.x x) y both x 's
refer to the same lambda term, y, so it is possible for a lambda expression here y to be arranged to receive itself
as its argument value, through self-application.
Consider for instance the factorial function F(n) recursively defined by
F(n) = 1, if n = 0; else n F(n 1).
In the lambda expression which is to represent this function, a parameter (typically the first one) will be assumed to
receive the lambda expression itself as its value, so that calling it applying it to an argument will amount to
recursion. Thus to achieve recursion, the intended-as-self-referencing argument (called r here) must always be
passed to itself within the function body, at a call point:
Lambda calculus
10
G := r. n.(1, if n = 0; else n (r r (n1)))
with r r x = F x = G r x to hold, so r = G and
F := G G = (x.x x) G
The self-application achieves replication here, passing the function's lambda expression on to the next invocation as
an argument value, making it available to be referenced and called there.
This solves it but requires re-writing each recursive call as self-application. We would like to have a generic
solution, without a need for any re-writes:
G := r. n.(1, if n = 0; else n (r (n1)))
with r x = F x = G r x to hold, so r = G r =: FIX G and
F := FIX G where FIX g := (r where r = g r) = g (FIX g)
so that FIX G = G (FIX G) = (n.(1, if n = 0; else n ((FIX G)
(n1))))
Given a lambda term with first argument representing recursive call (e.g. G here), the fixed-point combinator FIX
will return a self-replicating lambda expression representing the recursive function (here, F). The function does not
need to be explicitly passed to itself at any point, for the self-replication is arranged in advance, when it is created, to
be done each time it is called. Thus the original lambda expression (FIX G) is re-created inside itself, at
call-point, achieving self-reference.
In fact, there are many possible definitions for this FIX operator, the simplest of them being:
Y := g.(x.g (x x)) (x.g (x x))
In the lambda calculus, Y g is a fixed-point of g, as it expands to:
Y g
h.((x.h (x x)) (x.h (x x))) g
(x.g (x x)) (x.g (x x))
g ((x.g (x x)) (x.g (x x)))
g (Y g)
Now, to perform our recursive call to the factorial function, we would simply call (Y G) n, where n is the number
we are calculating the factorial of. Given n = 4, for example, this gives:
(Y G) 4
G (Y G) 4
(r.n.(1, if n = 0; else n (r (n1)))) (Y G) 4
(n.(1, if n = 0; else n ((Y G) (n1)))) 4
1, if 4 = 0; else 4 ((Y G) (41))
4 (G (Y G) (41))
4 ((n.(1, if n = 0; else n ((Y G) (n1)))) (41))
4 (1, if 3 = 0; else 3 ((Y G) (31)))
4 (3 (G (Y G) (31)))
4 (3 ((n.(1, if n = 0; else n ((Y G) (n1)))) (31)))
4 (3 (1, if 2 = 0; else 2 ((Y G) (21))))
4 (3 (2 (G (Y G) (21))))
4 (3 (2 ((n.(1, if n = 0; else n ((Y G) (n1)))) (21))))
4 (3 (2 (1, if 1 = 0; else 1 ((Y G) (11)))))
Lambda calculus
11
4 (3 (2 (1 (G (Y G) (11)))))
4 (3 (2 (1 ((n.(1, if n = 0; else n ((Y G) (n1)))) (11)))))
4 (3 (2 (1 (1, if 0 = 0; else 0 ((Y G) (01))))))
4 (3 (2 (1 (1))))
24
Every recursively defined function can be seen as a fixed point of some suitably defined function closing over the
recursive call with an extra argument, and therefore, using Y, every recursively defined function can be expressed as
a lambda expression. In particular, we can now cleanly define the subtraction, multiplication and comparison
predicate of natural numbers recursively.
Standard terms
Certain terms have commonly accepted names:
I := x.x
K := x.y.x
S := x.y.z.x z (y z)
B := x.y.z.x (y z)
C := x.y.z.x z y
W := x.y.x y y
U := x.x x
:= x.x x
:=
Y := g.(x.g (x x)) (x.g (x x))
Typed lambda calculi
Main article: Typed lambda calculus
A typed lambda calculus is a typed formalism that uses the lambda-symbol ( ) to denote anonymous function
abstraction. In this context, types are usually objects of a syntactic nature that are assigned to lambda terms; the exact
nature of a type depends on the calculus considered (see kinds below). From a certain point of view, typed lambda
calculi can be seen as refinements of the untyped lambda calculus but from another point of view, they can also be
considered the more fundamental theory and untyped lambda calculus a special case with only one type.
Typed lambda calculi are foundational programming languages and are the base of typed functional programming
languages such as ML and Haskell and, more indirectly, typed imperative programming languages. Typed lambda
calculi play an important role in the design of type systems for programming languages; here typability usually
captures desirable properties of the program, e.g. the program will not cause a memory access violation.
Typed lambda calculi are closely related to mathematical logic and proof theory via the CurryHoward isomorphism
and they can be considered as the internal language of classes of categories, e.g. the simply typed lambda calculus is
the language of Cartesian closed categories (CCCs).
Lambda calculus
12
Computable functions and lambda calculus
A function F: N N of natural numbers is a computable function if and only if there exists a lambda expression f
such that for every pair of x, y in N, F(x)=y if and only if f x=