ChapterPDF Available

Automation of Higher-Order Logic

March 2014
Handbook of the History of Logic 9

March 2014
9

DOI:10.1016/B978-0-444-51624-4.50005-8

In book: The Handbook of the History of Logic, Volume 9: Logic and Computation
Chapter: Automation of Higher-Order Logic
Publisher: Elsevier
Editors: Gabbay, Woods, Siekmann

Authors:

Christoph Benzmüller

Otto-Friedrich-Universität Bamberg

Dale Miller

National Institute for Research in Computer Science and Control

Content uploaded by Christoph Benzmüller

Content may be subject to copyright.

Automation of Higher-Order Logic

Authors: Christoph Benzm¨uller and Dale Miller

Readers: Peter Andrews, Jasmin Blanchette, William Farmer, and Herman

Geuvers

Venue: The Handbook of the History of Logic, eds. D. Gabbay & J. Woods

Volume 9: Logic and Computation, editor J¨org Siekmann

Contents

1 Introduction 2

1.1 Formalizing set comprehension as λ-abstraction . . . . . . . . . . 3

1.2 Packing more into inference rules . . . . . . . . . . . . . . . . . . 3

1.3 Planofthischapter ......................... 4

2 Formalization of quantiﬁcational logic 5

2.1 Earliest work on higher-order logic . . . . . . . . . . . . . . . . . 5

2.2 Diﬀerent notions of higher-order logic . . . . . . . . . . . . . . . 7

3 Church’s simple theory of types (classical higher-order logic) 8

3.1 The λ-calculus as computation (middle and late 1930s) . . . . . . 8

3.2 Mixing λ-calculusandlogic ..................... 9

3.3 Simple types and typed λ-terms................... 9

3.4 Formulas as terms of type o..................... 11

3.5 Elementary type theory . . . . . . . . . . . . . . . . . . . . . . . 12

3.6 Simpletypetheory.......................... 12

3.7 Variants of elementary and simple type theory . . . . . . . . . . 14

3.8 Anexample.............................. 14

3.9 Church used diﬀerent syntax not adopted here . . . . . . . . . . 15

4 Meta-theory 16

4.1 Semantics and cut-elimination . . . . . . . . . . . . . . . . . . . . 16

4.2 Cut-simulation properties . . . . . . . . . . . . . . . . . . . . . . 18

4.3 Higher-order substitutions and normal forms . . . . . . . . . . . 19

4.4 Encodings of higher-order logic into ﬁrst-order logic . . . . . . . 19

5 Skolemization and uniﬁcation 19

5.1 Skolemization............................. 20

5.2 Uniﬁcation of simply typed λ-terms ................ 21

Preprint submitted to Elsevier February 3, 2014

5.3 Mixed preﬁx uniﬁcation problems . . . . . . . . . . . . . . . . . . 22

5.4 Pattern uniﬁcation . . . . . . . . . . . . . . . . . . . . . . . . . . 23

5.5 Practical considerations . . . . . . . . . . . . . . . . . . . . . . . 24

6 Challenges for the automation 24

6.1 Instantiation of predicate variables . . . . . . . . . . . . . . . . . 24

6.2 Induction invariants . . . . . . . . . . . . . . . . . . . . . . . . . 26

6.3 Equality, extensionality, and choice . . . . . . . . . . . . . . . . . 27

7 Automated theorem provers 27

7.1 Earlysystems............................. 27

7.2 The TPTP THF initiative . . . . . . . . . . . . . . . . . . . . . . 29

7.3 TPTP THF0 compliant higher-order theorem provers . . . . . . 30

7.4 Recent applications of automated THF0 provers . . . . . . . . . 32

8 Conclusion 33

1. Introduction

Early eﬀorts to formalize mathematics in order to make parts of it more rig-

orous and to show its consistency started with the codiﬁcation of parts of logic.

There was work on the logical connectives by, for example, Boole and Pierce,

and later work to formalize additionally quantiﬁers (Frege, Church, Gentzen,

etc.). Once the basic concepts of logic—logical connectives and (ﬁrst-order)

quantiﬁcation—were formalized and proved consistent (by, say, G¨odel’s com-

pleteness theorem and Gentzen’s cut-elimination theorem), logicians turned to

the formalization of the objects of mathematics, such as real numbers, sets,

groups, etc., by building them on top of logic.

There are several ways to undertake such formalizations. One early and

successful approach involved building various theories for, say, Zermelo-Fraenkel

set theory, as ﬁrst-order logic extended with axioms postulating the existence

of certain sets from the existence of other sets. Instead of sets, one could also

explore the use of algebra and universal properties to develop a categorical

theory of mathematics.

This chapter addresses still another approach to formalizing mathematical

concepts on top of quantiﬁcation logic: one can generalize ﬁrst-order quantiﬁ-

cation to higher-order. In the syntax of ﬁrst-order logic, there are terms and

predicates: the terms denote individuals of some intended domain of discourse,

and predicates denote some subset of that domain. Inspired by set theory, it

is also natural to ask if certain predicates hold of other predicates, e.g., is a

given binary relation transitive or not. Other natural notions, such as Leibniz’s

equality—which states that xis equal to yif every predicate true of xis also

true of y—would naturally be written as the formula ∀P P x ⊃P y.1

1Occasionally we use the notation in this paper to better seperate the body of quantiﬁed

Such higher-order quantiﬁcation was developed ﬁrst by Frege and then by

Russell in his ramiﬁed theory of types, which was later simpliﬁed by others,

including Chwistek and Ramsey, Carnap, and ﬁnally Church in his simple theory

of types (STT), also referred to as classical higher-order logic.

1.1. Formalizing set comprehension as λ-abstraction

Church’s STT (Church, 1940), which is the focus of this chapter, based

both terms and formulas on simply typed λ-terms and the equality of terms

and formulas is given by equality of such λ-terms. The use of the λ-calculus

had at least two major advantages. First, λ-abstraction allowed naming sets

and predicates given as formulas, something that is achieved in set theory via

the comprehension axioms. For example, if ϕ(x) is a formula with one free

variable x, then the set comprehension axiom provides the existence of the set

{x∈A|ϕ(x)}, for some set A. Typed λ-abstraction achieves this in a simple

step by providing the term λx.ϕ(x): here, the variable xis given a type that, in

principle, can be identiﬁed with the same set A. Second, the complex rules for

quantiﬁer instantiation at higher-types is completely explained via the rules of

λ-conversion (the so-called rules of α- and β-conversion) which were proposed

earlier by Church (1932,1936).

Higher-order substitution can be seen (from the inference step point-of-view)

as one step, but it can pack a signiﬁcant computational inﬂuence on a for-

mula: normalization of λ-terms can be explosive (and expressive), in particular,

since λ-terms may contain logical connectives and quantiﬁers. Bindings are also

treated uniformly for all structures and terms that have bindings. For example,

if pis a variable of predicate type and Ais the formula ∀p B(p), then the uni-

versal instantiation of Awith the term, say, t, namely the formula [t/p]B, can

be a formula with many more occurrences of logical connectives and quantiﬁers

than there are in the formula B(p).

1.2. Packing more into inference rules

Given that fewer axioms are needed in STT than in axiomatic set theory,

and that the term and formula structure is enriched, some earlier researchers

in automated theorem proving were attracted to STT since traditionally such

early provers were not well suited to deal with large numbers of axioms. If

small theories could be achieved by making the notion of terms and formula

more complex, this seemed like a natural choice. Thus, if one extended, say,

ﬁrst-order resolution with a more complex form of uniﬁcation (this time, on

λ-terms), then one might be addressing theorems that would otherwise need

explicit notions of sets and their axioms.

G¨odel (1936) pointed out that higher-order logic (actually, higher-order

arithmetic) can yield much shorter proofs than are possible in ﬁrst-order logic.

Parikh (1973) proved this result years later: in particular, he proved that there

formulas or λ-abstractions from the binder, and parantheses maybe avoided if the formulas

structure is obvious. An alternative notation for ∀P P x ⊃Py would thus be ∀P(P x ⊃P y).

exist arithmetical formulas that are provable in ﬁrst-order arithmetic, but whose

shortest proofs in second-order arithmetic are arbitrarily smaller than any proof

in ﬁrst-order. Similarly, Boolos (1987) presented a theorem of ﬁrst-order logic

comprising about 60 characters but whose shortest proof (allowing the intro-

duction and use of lemmas) is so large that the known size of the universe could

not be used to record it: on the other hand, a proof of a few pages is possible

using higher-order logic.

Church’s approach of merging λ-conversion with inference rules is related to

what is a rather modern approach to making inference rules more expressive

by adding computation to them. One simple approach to allowing large scale

inference is available in most modern interactive theorem provers: libraries of

lemmas are built up and these lemmas can be applied as essentially new infer-

ence steps. Such new inference rules are often hand-designed and may or may

not allow for signiﬁcant automated uses beyond the obvious “try using all of

them any number of times . . .”. A more modern updating of Church’s merge

of λ-conversion and inference is the work on deduction modulo (Dowek et al.,

2003; Cousineau and Dowek, 2007) where functional programming style com-

putations on formulas and terms are permitted within inference steps. Indeed,

the connection between Church’s approach to using higher-order quantiﬁcation

and λ-terms can be closely simulated using deduction modulo (Burel, 2011a).

Recent developments in using focused proof systems (Andreoli, 1992; Liang and

Miller, 2009) also allow one to rethink the nature of inference rules and allow

them to be large scale reasoning steps involving, possibly non-deterministic,

computation (Miller, 2011).

1.3. Plan of this chapter

We refer the reader looking for more details about higher-order logic and

STT to the textbook of Andrews (2002) and to the handbook and encyclope-

dia articles by Andrews (2001), Andrews (2009), Enderton (2012), and Leivant

(1994). Another recommended article has been contributed by Farmer (2008).

Here we shall focus on the issues surrounding theorem proving, particularly,

automated theorem proving in subsets and variants of Church’s STT. In par-

ticular, in Section 2, we describe some of the history and the background to a

treatment of quantiﬁcation and the closely associated notions of binding and

substitution. In Section 3, we present the technical details of STT. The meta-

theory of STT, including general models and cut-elimination, are addressed in

Section 4. Skolemization, uniﬁcation, pre-uniﬁcation, and pattern uniﬁcation,

which are central for proof automation, are discussed in Section 5. Section 6

then addresses core challenges for higher-order automated theorem provers, such

as substitutions for predicate variables and the automation of extensionality. An

overview of implemented higher-order theorem proving systems is presented in

Section 7. However, the focus thereby is on interactive and, most importantly,

automated provers for (fragments of ) STT. The latter systems have made sig-

niﬁcant progress in particular in the last few years.

2. Formalization of quantiﬁcational logic

Quantiﬁcational expressions are key phenomena of natural language and

their treatment has been widely studied by linguists and logicians. A core inter-

est has been to appropriately match informal use of quantiﬁcational expressions

in natural language(s) with their formal treatment in logic formalisms.

In this text quantiﬁcation also plays a pivotal role. However, the focus is on

the widely adopted traditional notion of universal and existential quantiﬁcation

only.2A crucial question is what kind of objects an existential or universal

quantiﬁer may range over, or, in other words, what kind of objects the universe

may contain. In classical ﬁrst-order logic these objects are strictly of elementary

nature, like the person ‘Bob’ or the number ‘5’; we call them ﬁrst-order objects.

Not allowed are quantiﬁcations over properties of objects (these are identiﬁed

with sets of objects) or functions. In higher-order logic quantiﬁcation is not

restricted to only elementary objects and quantiﬁcation over sets or functions of

objects is generally allowed. Peano’s induction principle for the natural numbers

is a good example. Using quantiﬁcation over ﬁrst-order objects (∀x) and over

properties (i.e., sets) of ﬁrst-order objects (∀P), this principle can be elegantly

expressed in higher-order logic by the following axiom:

∀P P 0⊃(∀x(P x ⊃P(sx)) ⊃ ∀y P y )

This formula belongs to second-order logic, since the variable Pranges only

over sets of ﬁrst-order objects. In higher-order logic one may move arbitrary

up in this hierarchy; that is, quantiﬁcations over sets of sets of objects etc.

are allowed. First-order and second-order logic are, in this sense, fragments of

higher-order logic.

There are signiﬁcant theoretical and practical diﬀerences between ﬁrst-order

logic and higher-order logic regarding, for example, expressive power and proof

theory. These fundamental diﬀerences—some of which will be outlined in the

next sections—have alienated many logicians, mathematicians, and linguists. A

(rather unfortunate) consequence has been that the community largely focused

on the theory and mechanization of ﬁrst-order logic. In particular automation

of higher-order logic, the topic of this article, has often been considered as too

challenging, at least until recently.

2.1. Earliest work on higher-order logic

Frege’s (1879) Begriﬀsschrift is commonly considered to mark the birth of

modern symbolic logic. The Begriﬀsschrift presents a 2-dimensional formal no-

tation for predicate calculus and develops an adequate and still relevant notion of

formal proof. Frege’s notation for universal quantiﬁcation appropriately marks

both the bound variable and the scope of quantiﬁcation. Most importantly,

2Traditional quantiﬁcation and generalized quantiﬁcation are contrasted in the text by

Westerst˚ahl (2011); see also the references therein.

quantiﬁcation in Frege’s notation is not restricted to ﬁrst-order objects, and in

his notation the above induction axiom can be formalized as:3

P y P(y)

xP(s(x))

P(x)

P(0)

Thus, the quantiﬁcation over predicate, relation and function symbols is ex-

plicitly allowed in the language of the Begriﬀsschrift. A respective example

is Frege’s statement (76) in the Begriﬀsschrift, which addresses the transitive

closure of a relation f.

The fact that Frege’s logic of the Begriﬀsschrift is indeed higher-order can

also be retraced by following some of his substitutions for relation symbols.

For example, in his derivation of statement (70), Frege substitutes a relation

symbol fwith a function of one argument. In modern notation his concrete

instantiation can be expressed by the lambda term λz F z ⊃ ∀a f za ⊃F a. The

support for such kind of higher-order substitutions is another distinctive feature

of higher-order logic. Frege carries out this substitution, and he implicitly ap-

plies normalization. However, he does not give a suﬃciently precise deﬁnition

of how such substitutions are applied.

It was Bertrand Russell (1902; 1903) who ﬁrst pointed out that unrestricted

quantiﬁcation as considered by Frege, in connection with the comprehension

principles,4enables the encoding of paradoxes and leads to inconsistency. The

most prominent example is the set of all non-self-containing sets, widely known

as Russell’s paradox . As a possible solution, Russell (1908) suggested a few years

later a ramiﬁed theory of types as a basis for the formalization of mathematics

that diﬀerentiates between objects and sets (or functions) consisting of these

kinds of objects. On the one hand, Russell was trying to avoid the paradoxes

that had plagued earlier work. Russell attributed some of the paradoxes to a

vicious circle principle in which some mathematical objects are deﬁned in terms

of collections that include the object being deﬁned. In modern terms, Russell

wanted to disallow impredicative deﬁnitions. Russell’s types were ramiﬁed in

an eﬀort to avoid such impredicativity. On the other hand, Russell was trying

to reduce mathematics to logic. The ramiﬁcation which avoids impredicative

deﬁnitions makes it diﬃcult to encode mathematics. Russell’s solution was to

add an axiom of reducibility which essentially collapses the ramiﬁcations and

allows one to make impredicative deﬁnitions after all (Ramsey, 1926; Chwistek,

3Here yP(y) corresponds to ∀y P y and

P(s(x))

P(x)to P x ⊃P sx; the rest is obvious.

The vertical bar on the left marks that the entire statement is asserted.

4The comprehension principles assure the existence of abstractions over formula

expressions; an example for a type restricted comprehension axiom (schema) is

∃uα1→...→αn→o∀x1...∀xn(ux1...xn) = bo.

1948).

The ramiﬁed theory of types was subsequently selected by Russell & White-

head as the logical foundation for the Principia Mathematica (Whitehead and

Russell (1910, 1912, 1913)). They shared the philosophical standpoint of logi-

cism, initiated by the work of Frege, and they wanted to show that all math-

ematics can be reduced to logic. The Principia succeeded to a signiﬁcant ex-

tent and an impressive number of theorems of set-theory and ﬁnite and inﬁnite

arithmetic, for example, were systematically derived from the Principia’s log-

ical foundations. However, the Principia was also criticized for its use of the

axiom of reducibility and its use of the axiom of inﬁnity, which asserts that a

set with inﬁnitely many objects exists. It thus remained debatable of what the

Principia actually demonstrated: was it a reduction of mathematics to logic or

a reduction of mathematics to some (controversial) set theory?

In the 1920s, a number of people suggested a simple theory of types as an

alternative to Russell’s ramiﬁed type theory.5These suggestions led to the

seminal paper by Church (1940), which will be addressed in some detail in

the next section. The terms simple type theory and classical higher-order logic

typically refer to Church (1940).

It should be remarked that the idea of employing a type hierarchy can,

at least to some extent, be attributed to Frege: in his writings he is usually

explicitly mentioning the kind of objects—predicates, predicates of predicates,

etc.—a quantiﬁed variable is representing (cf. Quine (1940)).

In summary, higher-order logic is an expressive formalism that supports

quantiﬁcation over predicate, relation, and function variables and that supports

complex substitutions of such variables. Such a rich language has several pitfalls

with which to be concerned. One such pitfall involves providing a technically

precise and sound notion of substitution involving bindings. Another (more

important) pitfall involves the careful treatment of self-referential, impredicative

deﬁnitions since these may lead to inconsistencies. A possible solution to the

latter pitfall is to consider syntactical restrictions based on type hierarchies and

to use these to rule out problematic impredicative deﬁnitions.

2.2. Diﬀerent notions of higher-order logic

The notion of higher-order when applied to logic formalism is generally not

as unambiguous as the above text might suggest. We mention below three

diﬀerent groups of people who appear to use this term in three diﬀerent ways.

Philosophers of mathematics often distinguish between ﬁrst-order logic and

second-order logic only. The latter logic, which is used as a formal basis for all of

mathematics, involves quantiﬁcation over the domain of all possible functions.

In particular Kurt G¨odel’s work draws an important theoretical line between

ﬁrst- and second-order logic. Shortly after proving completeness of ﬁrst-order

5In Church (1940), Church attributes the simple theory of types to Chwistek, Ramsey,

and, ultimately, Carnap.

logic (1929; 1930), G¨odel presented in his celebrated ﬁrst incompleteness theo-

rem (1931). From this theorem it follows that second-order logic is necessarily

incomplete, that is, truth in higher-order logic can not be recursively axiom-

atized. Thus, higher-order logic interpreted in this sense consists largely of a

model-theoretic study, typically of the standard model of arithmetic (cf. Shapiro

(1985)).

Proof-theoreticians take logic to be synonymous with a formal system that

provides a recursive enumeration of the notion of theoremhood. A higher-order

logic is understood no diﬀerently. The distinctive characteristic of such a logic,

instead, is the presence of predicate quantiﬁcation and of comprehension (i.e.,

the ability to form abstractions over formula expressions). These features, es-

pecially the ability to quantify over predicates, profoundly inﬂuence the proof-

theoretic structure of the logic. One important consequence is that the simpler

induction arguments of cut-elimination that are used for ﬁrst-order logic do not

carry over to the higher-order setting and more sophisticated techniques, such

as the “candidats de r´eductibilit´e” due to Jean-Yves Girard (1971), must be

used. Semantical methods can also be employed, but the collection of models

must now include non-standard models that use restricted function spaces in

addition to the standard models used for second-order logic.

Implementers of deduction systems usually interpret higher-order logic as

any computational logic that employs λ-terms and quantiﬁcation at higher-order

types, although not necessarily at predicate types. Notice that if quantiﬁcation

is extended only to non-predicate function variables,6then the logic is similar

to a ﬁrst-order one in that the cut-elimination process can be deﬁned using

an induction involving the sizes of (cut) formulas. However, such a logic may

incorporate a notion of equality based on the rules of λ-conversion and the

implementation of theorem proving in it must use (some form of ) higher-order

uniﬁcation.

3. Church’s simple theory of types (classical higher-order logic)

3.1. The λ-calculus as computation (middle and late 1930s)

The λ-calculus is usually thought of as a framework for computing functions.

In the setting of STT, however, where the discipline of simple types is applied,

those functions are greatly limited in what they can compute. A typical use

of λ-conversion in STT is to provide the function over syntax that instantiate

a quantiﬁed formula with a term. If one wants to describe the function that,

for example, returns the smallest prime divisor of an integer, one would specify

relational speciﬁcations of primality, division, etc., and then show that such

relations are, in fact, total functions. Thus, the foundations that STT provides

to mathematics is based on relations: this is in contrast to, say, the function-

centric foundation of Martin-L¨of type theory (Martin-L¨of, 1982). It is worth

6This is meant to also exclude nested predicates as in ∀F(ι→o)→ι.

pointing out that although typed λ-calculi are considered the quintessential

proof structure for intuitionistic logic, Church, as the father of the λ-calculus,

shown little interest in intuitionistic logic itself: in particular, his development

of STT was based on classical logic.

3.2. Mixing λ-calculus and logic

Church applied his λ-calculus to the description of not only quantiﬁcational

structures and higher-order substitution but also many familiar mathematical

constructions. For example, the usual notion for membership x∈P, i.e., “x

is a member of the set P” can be written instead using the notion P x which

is familiar from ﬁrst-order logic, i.e., “the predicate Pis true of x”. Thus, the

concept of set is represented not as a primitive concept itself but is constructed

using logic. Of course, to allow interesting constructions involving sets, we need

a higher-order logic that allows predicates to be combined to yield new sets. For

example, if predicates Aand Bdenote sets then the expressions λx Ax ∧Bx

and λx Ax∨Bx denote the intersection and union of the sets described by these

predicates. Furthermore, λC ∀x Cx ⊃Ax describes the power-set of A(i.e., the

set of sets Cthat are subsets of A). We can even use λ-abstractions to make

even more abstractions possible: the notion of set union, for example, can be

deﬁned as the λ-abstraction λA λB λx Ax ∧Bx and the notion of power-set

can be λA λC ∀x C x ⊃Ax.

As Kleene and Rosser (1935) discovered, a direct mixing of the λ-calculus

with logic can lead to an inconsistent logic. One of the simplest presentations

of an inconsistency arising from mixing the untyped λ-calculus with (classical)

logic is called Curry’s paradox (Curry, 1942). Let ybe a formula and let rbe the

λ-abstraction λx.xx ⊃y. Via λ-conversion rr is equal and, hence, equivalent to

rr ⊃y. Hence, we have the two implications rr ⊃(rr ⊃y) and (rr ⊃y)⊃rr.

From the former we get (by contracting assumptions) rr ⊃y, and hence, by

modus ponens with the latter we know rr. By a further modus ponens step we

thus get y. Since ywas an arbitrary formula, we have proved a contradiction.

One way to avoid inconsistencies in a logic extended with the λ-calculus

is to adopt a variation of Russell’s use of types (thereby eliminating the self

application rr in the above counterexample). When Church modiﬁed Russell’s

ramiﬁed theory of types to a “simple theory” of types, the core logic of this

chapter was born (Church, 1940). Mixing λ-terms and logic as is done in STT

permits capturing many aspects of set theory without direct reference to axioms

of set theory.

There are costs to using the strict hierarchy of sets enforced by typing: no

set can contain both a member of Aas well as a subset of A. The deﬁnition of

subset is based on a given type: asking if a set of integers is a subset of another

such set is necessarily diﬀerent than asking if a binary relation on integers is a

subset of another such set.

3.3. Simple types and typed λ-terms

The primitive types are of two kinds: ois the type of propositions and the

rest are the types of basic domains of individuals: thus we are adopting a many-

sorted approach to logic. However, analogously to Church (1940), we shall just

admit one additional primitive type, namely, ι, for the domain of individuals

which correspond to the domain of ﬁrst-order variables. (Admitting more than

this one additional primitive type is no challenge.) The set of type expressions

is the least set containing the primitive types and such that (γ→τ) is a type

symbol whenever γand τare type expressions. Here, types of the form (γ→τ)

denote the type of functions with domain type γand codomain type τ. If

parentheses are omitted, we shall assume that the arrow constructor associates

to the right: i.e., δ→γ→τdenotes (δ→(γ→τ)). The order ord(τ) of a

type τis deﬁned by recursion on their structure7: if τis a primitive type, then

ord(τ) = 0; otherwise ord(γ→τ) = max(ord(γ) + 1, ord(τ)). Thus, the order

of ι→ι→ιis 1 and the order of (ι→ι)→ιis 2.

Let Σ be a set of typed constants, i.e., tokens with a subscript that is a

simple type and denote by Στthe subset of Σ of constants with subscript τ.

Lowercase letters with subscripts, e.g., cτ, are syntactic variables ranging over

constants with subscript τ. For each type τ, let Vτbe an inﬁnite set of variables

τ, x2

τ, . . ., all with subscript τ. Uppercase letters with subscripts, e.g., Xτ, are

syntactic variables ranging over the particular variables xi

τin Vτ. Subscripts of

syntactic variables may be omitted when they are obvious or irrevelevant in a

particular context. Given the constants in Σ and variables in Vτ(τ∈ T ), we

can now deﬁne the binary relation of a term with a type as the least relation

satisfying the following clauses.

1. If cτ∈Σ then cτis a term of type τ.

2. If Xτ∈ Vτthen Xτis a term of type τ.

3. If Xτis a variable with subscript τand Mγis a term of type γ, then

(λXτMγ) is a term of type (τ→γ).

4. If Fτ→γis a term of type (τ→γ), and Aτis a term of type τ, then

(Fτ→γAτ) is a term of type γ.

Uppercase letters with subscripts, e.g., Mτ, are syntactic variables ranging over

terms of type τ. The parentheses in (λXτMγ) and (Fτ→γAτ) can be omitted

if clear in a given context.

Each occurrence of a variable in a term is either bound by a λor free. We

consider two terms Aand Bto be equal (and write A≡B), if they are the

same up to a systematic change of bound variable names (i.e., we consider α-

conversion implicitly). A term Ais closed if it has no free variables.

A substitution of a term Aαfor a variable Xαin a term Bβis denoted by

[A/X]B. Since we consider α-conversion implicitly, we assume that the bound

variables of Bare changed as necessary from avoiding variable capture. For

example, [x1

i/x2

i]λx1x2is equal to, say, λx3x1, which is not equal to λx1x1.

7Diﬀerent notions of ‘order’ have actually been discussed in the literature. We may e.g.

start with ord(ι) = 0 and ord(o) = 1.

Two important relations on terms are given by β-reduction and η-reduction.

Aβ-redex (λXA)B(i.e., the application of an abstraction to an argument) β-

reduces to [B/X ]A(i.e., the substitution of an actual argument for a formal

argument). If Xis not free in C, then λX(CX ) is an η-redex and it η-reduces

to C. If A β-reduces to Bthen we say that B β-expands to A. Similarly, if

A η-reduces to Bthen we say that B η-expands to A. For terms Aand Bof

the same type, we write A≡βBto mean Acan be converted to Bby a series

of β-reductions and expansions. Similarly, A≡βη Bmeans Acan be converted

to Busing both β- and η-conversion. For each simply typed λ-term Athere is

a unique β-normal form (denoted A↓β) and a unique βη-normal form (denoted

A↓βη). From this fact we know A≡βB(A≡βη B) if and only if A↓β≡B↓β

(A↓βη ≡B↓βη ).

The simply typed λ-terms of Church (1940) are essentially the ones in com-

mon use today (cf. Barendregt et al. (2013)). One subtlety is that all variables

and constants carry with them their type as part of their name: that is, constants

and variable are not associated with a type which could vary with diﬀerent type

contexts. Instead, constants and variables have ﬁxed types just as they have

ﬁxed names: thus, the variable fι→ιhas the name fι→ιand the type ι→ι.

This handling of type information is also called Church-style (as opposed to

Curry-style). In this paper we often omit the type subscript if it can easily be

inferred in the given context.

3.4. Formulas as terms of type o

In most presentations of ﬁrst-order logic, terms can be components of for-

mulas but formulas are never components of terms. Church’s STT allows for

this later possibility as well: formulas and logical connectives are allowed within

terms. Such a possibility will greatly increase both the expressive strength of the

logic and the complexities of automated reasoning in the logic. STT achieves

this intertwining of terms and formulas by using the special primitive type o

to denote those simply typed terms that are the formulas of STT. Thus, we

introduce logical connectives as speciﬁc constant constructors of type o. Since

Church’s version of STT was based on classical logic, he chose for the primitive

logical connectives ¬o→ofor negation, ∨o→o→ofor disjunction, and for each type

γ, a symbol ∀(γ→o)→o. Other logical connectives, such as ∧o→o→o,⊃o→o→o, and

∃(γ→o)→o(for every type γ), can be introduced by either also treating them as

primitive or via deﬁnitions. A formula or proposition of STT is a simply typed

λ-term of type oand a sentence of STT is a closed proposition (i.e., containing

no free variables). In order to make the syntax of λ-terms converge more to

the conventional syntax of logical formulas, we shall adopt the usual conven-

tions for quantiﬁed expressions by using the abbreviations ∀xγ.B and ∃xγ.B for

∀(γ→o)→oλxγ.B and ∃(γ→o)→oλxγ.B, respectively. Similarly, the familiar inﬁx

notion B∨C,B∧C, and B⊃Cabbreviate the expressions ((∨o→o→oB)C),

((∧o→o→oB)C), and ((⊃o→o→oB)C), respectively.

Beyond the logical connectives and quantiﬁers of classical logic, STT also

contains the constant ι(γ→o)→γfor each simple type γ. This constant is axiom-

atized in STT so that ι(γ→o)→γBdenotes a member of the set that is described

by the expression Bof type γ→o. Thus, ι(γ→o)→γis used variously as a

description operator or a choice function depending on the precise set of ax-

ioms assumed for it. Choice selects a member from Bif Bis non-empty and

description selects a member from Bonly if Bis a singleton set.

3.5. Elementary type theory

Church (1940) gives a Frege-Hilbert style logical calculus for deriving formu-

las. The inference rules can be classiﬁed as follows.

I–III. One step of α-reduction, β-reduction, or β-expansion.

IV. Substitution: From Fτ→oXτ, infer Fτ→oAτif Xis not free in F.

V. Modus Ponens: From A⊃Band A, infer B.

VI. Generalization: From Fτ→oXτ, infer ∀(α→o)→oFτ→oif Xis not free in F.

In addition to the inference rules, Church gives various axiom schemas. Consider

ﬁrst the following axiom schemas.

1–4. Classical propositional axioms

5τ.For every simple type τ,∀(τ→o)→oFτ→o⊃F X .

6τ.For every simple type τ,∀Xτ(po∨Fτ→oX)⊃p∨ ∀(τ→o)→oF.

These axioms (together with the inference rules above) describe the theorems

of what is often called elementary type theory (ETT) (Andrews, 1974): these

axioms simply describe an extension of ﬁrst-order logic with quantiﬁcation at

all simple types and with the term structure upgraded to be all simply typed λ-

terms. In the last century, much of the work on the automation of higher-order

logic focused on the automation of the elementary type theory.

3.6. Simple type theory

In order to encode mathematical concepts, additional axioms are needed. In

order to describe these, we need to introduce expressions for denoting equality

and natural numbers.

Equality. Equality for terms of type τ,Aτ.

=τBτ, is deﬁned using Leibniz’s

formula ∀Pτ→oP A ⊃P B . By Aτ6.

=τBτwe mean ¬(Aτ.

=τBτ).

Natural numbers. A individual natural number nis denoted by the Church

numeral encoding n-fold iteration (Church, 1936) . Thus, the following denote

the λ-calculus encoding of zero, one, two and three (here, τis any simple type).

λfτ→τλxτx

λfτ→τλxτfx

λfτ→τλxτf(fx)

λfτ→τλxτf(f(fx))

Notice that if we denote by ˆτthe type (τ→τ)→τ→τ, then all these terms

are of type ˆτ. The λ-abstraction λnˆτλfτ→τλxτ.f(nf x) is denoted Sˆτ→ˆτand

has the property that it computes the successor of a number encoded in this

fashion. The set of all natural numbers (based on iteration of functions over

type τ) can be deﬁned as the λ-abstraction

λnˆτ∀pˆτ→o(p0ˆτ⊃((∀xˆτpx ⊃p(Sˆτ→ˆτx)) ⊃pn))

of type ˆτ→o. This expression uses higher-order quantiﬁcation to deﬁne the

set of all natural numbers as being the set of terms nthat are members of all

sets that contain zero and are closed under successor. It is unfortunate that the

encoding of numbers is dependent on a speciﬁc type: in other words, there is a

diﬀerent set of natural numbers for every type τ. The polymorphic type system

of Girard (1971, 1986) and Reynolds (1974) ﬁxed this problem by admitting

within λ-terms the ability to abstract and apply types.

Adding the following axioms to those of the elementary type theory yields

Church’s simple theory of types (STT).

7. There exists at least two individuals: ∃XιYιX6.

=ιY.

8. Inﬁnity: The successor function on Church numerals at type (ι→ι)→ι→ι

is injective.

9τ.Description: Fτ→oXτ⊃(∀YτF Y ⊃X.

=Y)⊃F(ι(τ→o)→τF).

10τ→γ.Functional extensionality: (∀XτF X .

=GX)⊃F.

=τ→γG.

11τ.Choice: Fτ→oXτ⊃F(ι(τ→o)→τF).

Church also mentions the possibility of including an additional axiom of

extensionality of the form P⇔Q⊃P.

=oQ. In fact, Henkin (1950) includes

this axiom as part of his Axiom 10 (and he excludes axioms 7–9τ). We follow

Henkin and include the following axiom as part of Axiom 10.

10o.Boolean extensionality: P⇔Q⊃P.

=oQ.

The description axioms (Axioms 9τ) allow us to use ι(τ→o)→τto extract the

unique element of any singleton set. If we assume the description axiom, then

we can prove that every functional relation corresponds to a function. That is,

we can prove

(∀xτ∃yγrτ→γ→oxy ∧ ∀zγrxz ⊃y.

=z)⊃ ∃fτ→γ∀xτrx(f x)

This fact may be used to justify restricting the relational perspective for the

foundation of mathematics since functions are derivable from relations in STT.

The choice axioms (Axioms 11τ) are strictly stronger than the description ax-

ioms (see Andrews (1972a)). Many interactive theorem provers include a choice

operator, but the systematic inclusion of choice into automated procedures has

only happened recently (see also Section 6).

Finally, Axioms 7 and 8 guarantee that there will be inﬁnitely many indi-

viduals. There are many ways to add an axiom of inﬁnity and Church’s choice

is convenient for developing some basic number theory using Church numerals.

3.7. Variants of elementary and simple type theory

Besides the various subsets of STT that involve choosing diﬀerent subsets of

the axioms to consider, other important variants of ETT and STT have been

developed.

Adding to ETT the axioms of Boolean and functional extensionality (Ax-

ioms 10oand 10τ→β), and possibly choice, gives a theory sometimes called

extensional type theory (ExTT): equivalently STT without description, inﬁnity

and axiom 7. This is the logic studied by Henkin (1950), and it is the logic

that is automated by the theorem provers described in Section 7.3. One can-

not prove from ETT alone that η-conversion preserves the equality of terms: a

fact that is provable, however, using ExTT. Also, Boolean extensionality can

be considered without including functional extensionality and vice versa. Most

modern implementations of ETT generally treat the equality of typed λ-terms

up to βη-conversion. By doing this some weak form of extensionality is thus au-

tomatically guaranteed. However, this should not be confused with supporting

full functional and Boolean extensionality (cf. the discussion of non-functional

models and extensionality in Section 4.1).

While Church axiomatized the logical connectives of STT in a rather conven-

tional fashion (using, for example, negation, conjunction, and universal quantiﬁ-

cation as the primitive connectives), Henkin (1963) and Andrews (1972a, 2002)

provided a formulation of STT in which the sole logical connective was equality

(at all types). Not only was a formulation of logic using just this one logical

connective perspicuous, it also improved on the semantics of Henkin’s general

models (Henkin, 1950).

Probably the most signiﬁcant other variant of STT is one that replaces the

classical logic underlying it with intuitionistic logic: several existing higher-

order logic systems are based on such a variant of STT. Intuitionistic variants

of STT are easily achieved by changing the logical axioms of STT from those

for classical logic to those for intuitionistic logic.

A logic of unity. ETT (and analogously ExTT or STT) provides a framework

for considering propositional logic and ﬁrst-order logic as fragments of higher-

order logic. In the era of computer implementations of logic, this unifying aspect

of ETT is of great value: an implementation of aspects of ETT immediately can

be seen as an implementation that is also eﬀective for these two kinds of simpler

logics.

3.8. An example

Consider formalizing the most basic notions of point-set topology in STT.

First, we formalize some simple set-theoretic concepts using the following typed

constants and λ-expressions.

empty set ∅ι→oλx.x 6.

=x(or λx.⊥)

membership ∈ι→(ι→o)→oλxλC Cx

subset ⊆(ι→o)→(ι→o)→oλAλB∀x Ax ⊃Bx

intersection ∩(ι→o)→(ι→o)→ι→oλAλBλx Ax ∧Bx

family union S((ι→o)→o)→ι→oλDλx∃C DC ∧C x

We now deﬁne the symbol open((i→o)→o)→(i→o)→oso that (open CS) holds

when Cis a topology (a collection of open sets) on S. Informally, this should

hold when Cis a set of subsets of Ssuch that Ccontains the empty set as well

as the set Sand it is closed under (binary) intersections and arbitrary unions.

Formally, the symbol open can be deﬁned as the λ-abstraction

λCλS.(∅∈C)∧(S∈ C)∧[∀A∀B.(A∈ C ∧ B∈ C ⊃ (A∩B‘) ∈ C]

∧[∀B.(B ⊆ C)⊃(SB)∈ C]

A simple fact about open sets is the following. Assume that Cis a topology for

S. If Gis a subset of Sand all elements of Gare also members of an open set

(i.e., of a member of C) that is a subset of G, then Gitself is open. We can

formalize this theorem as the following formula in STT.

∀C∀S. (open CS)⊃[∀x. x ∈G⊃ ∃S. S ∈ C ∧ x∈S∧S⊆G]⊃(G∈ C )

This formula is provable in STT if we employ the functional extensionality axiom

10ι→oin order to show that the two predicates Gand

[(λH. (open CH)∧(H⊆G))

(both of type i→o) are equal. Since it is an easy matter to prove that this

second expression is in C, Leibniz’s deﬁnition of equality immediately concludes

that Gmust also be in C.

One weakness of using STT for formalizing an abstract notion of topology is

that we provided above a deﬁnition in which open sets were sets of individuals:

that is, they were of type i→o. Of course, it might be interesting to consider

topologies on other types, for example, on sets of sets. We could adopt the

technique used in (Church, 1940) of indexing most notions with types, such

as, for example, ⊆τ. Clearly, more expressive logics with richer treatment of

types and their quantiﬁcation—for example, Girard’s System F (Girard, 1986),

Reynold’s polymorphic λ-calculus (Reynolds, 1974), and Andrews’s transﬁnite

type system (Andrews, 1965)—provide important techniques for abstracting

over types.

3.9. Church used diﬀerent syntax not adopted here

Church’s introduction of λas a preﬁx operator to denote function abstrac-

tion and the use of juxtaposition to denote function application is now well

established syntax. On the other hand, Church used a number of syntactic

conventions and choices that appear rather odd to the modern reader. While

Church used a simpliﬁcation of the dot notation used in Whitehead and Russell

(1910, 1912, 1913), most uses of dots in syntax have been dropped in modern sys-

tems, although a dot is sometimes retained to separate a bound variable from its

body. Church similarly used concatenation to denote function types, but most

modern systems use an arrow. The use of omicron as the type for propositions

survives in some systems while many other systems use P rop (the latter is used

more frequently in systems for ETT, while the former seems more prominent in

systems for ExTT). Similarly, the connectives ∀(γ→o)→oand ∃(γ→o)→oare often

replaced by the binders ∀and ∃, respectively, although they are often used to

denote quantiﬁcation at the level of types in certain strong type systems.

4. Meta-theory

Church (1940) proved that the deduction theorem holds for the proof sys-

tem consisting of the axioms and inference rules described in Section 3. The

availability of the deduction theorem means that the familiar style of reasoning

from assumptions is valid in STT. Church also proved a number of theorems

regarding natural numbers and the possibility of deﬁning functions using prim-

itive recursive deﬁnitions. The consistency of STT and a formal model theory

of STT were left open by Church.

4.1. Semantics and cut-elimination

We outline below several major meta-theoretic results concerning STT and

closely related logics.

Standard models. G¨odel’s incompleteness theorem (G¨odel, 1931) can be ex-

tended directly to ETT (and STT) since second-order quantiﬁcation can be

used to deﬁne Peano arithmetic: that is, there is a “true” formula of ETT that

is not provable. The notion of truth here, however, is that arising from what is

called the standard model of ETT (resp. STT) in which a functional type, say,

γ→τ, contains all functions from the type γto the type τ. Moreover, the type

ois assumed to contain exactly two truth values, namely true and false.

Henkin models. Henkin (1950) introduced a broader notion of general model

in which a type contains “enough” functions but not necessarily all functions.

Henkin then showed soundness and completeness. More precisely, he showed

that provability in ExTT coincides with truth in all general models (the stan-

dard one as well as the non-standard ones). Andrews (1972b) provided an

improvement on Henkin’s deﬁnition of general models by replacing the notion

that there be enough functions to provide denotations for all formulas of ETT

with a more direct means to deﬁne general models based on combinatory logic.

Andrews (1972a) points out that Henkin’s deﬁnition of general model technically

was in error since his deﬁnition of general models admitted models in which the

axiom of functional extensionality (10τ→β) does not hold. Andrews then showed

that there is a rather direct way to ﬁx that problem by shifting the underly-

ing logical connectives away from the usual Boolean connectives and quantiﬁers

for a type-indexed family of connectives {Qτ→τ→o}τin which Qτ→τ→odenotes

equality at type τ.

Non-functional models and extensionality. Henkin models are fully extensional,

i.e., they validate the functional and Boolean extensionality axioms 10τ→γand

10o. The construction of non-functional models for ETT has been pioneered

by Andrews (1971). In Andrews’ so-called v-complexes, which are based on

Sch¨uttes semi-valuation method (Sch¨utte, 1960), both the functional and the

Boolean extensionality principles fail. Assuming β-equality, functional exten-

sionality 10τ→γsplits into two weaker and independent principles η(F.

λX F X , if Xis not free in term F) and ξ(from ∀X.F .

=Ginfer λX F .

=λX G,

where Xmay occur free in Fand G). Conversely, βη-conversion, which is built-

in in many modern implementations of ETT, together with ξimplies functional

extensionality. Boolean extensionality, however, is independent of any of these

principles. A whole landscape of respective notions of models structures for ETT

between Andrews’ v-complexes and Henkin semantics that further illustrate and

clarify the above connections is developed in Benzm¨uller et al. (2004); Brown

(2004); Benzm¨uller (1999a), and an alternative development and discussion has

been contributed by Muskens (2007).

Takeuti’s conjecture. Takeuti (1953) deﬁned GLC (“generalize logical calculus”)

by extending Gentzen’s LK with (second-order) quantiﬁcation over predicates.

He conjectured cut-elimination for the GLC proof system and he showed that

this conjecture proved the consistency of analysis (second-order arithmetic).

Sch¨utte (1960) presented a simpliﬁed version of Takeuti’s GLC and gave a se-

mantic characterization of the Takeuti conjecture. This important conjecture

was proved by Tait (1966) for the second-order fragment using Sch¨utte’s se-

mantic results. The higher-order version of the conjecture was later proved by

Takahashi (1967) and by Prawitz (1968). The proof of strong normalization for

System F given by Girard (1971) also proves Takeuti’s conjecture as a conse-

quence. Andrews (1971) used the completeness of cut-free proofs (but phrased in

the contrapositive form as the abstract consistency principle (Smullyan, 1963))

in order to give a proof of the completeness of resolution in ETT. Takeuti (1975)

presented a cut-free sequent calculus with extensionality that is complete for

Henkin’s general models. The abstract consistency proof technique, as used by

Andrews, has been further extended and applied to obtain cut-elimination re-

sults for diﬀerent systems between ETT and ExTT by Brown (2004), Benzm¨uller

et al. (2004, 2008a), and Brown and Smolka (2010).

Candidates of reducibility. In the setting of the intuitionistic variants of STT,

the proofs themselves are of interest since they can be seen as programs that

carry the computational content of constructive proofs. Girard (1971, 1986)

proved the strong normalization of such proofs (expressed as richly typed λ-

terms). To achieve this strong normalization result, Girard introduce the can-

didats de reductibilit´e technique which is a common technique used to prove

results such as cut-elimination for higher-order logics.

Herbrand’s theorem for ETT. In (Andrews et al., 1984), Andrews introduced a

notion of proof called a development that resembles Craig-style linear reason-

ing in which a formula can be repeatedly rewritten until a tautologous formula

is encountered. Three kinds of formula rewritings are possible: instantiate a

top-level universal quantiﬁer (with an eigenvariable), instantiate a top-level ex-

istential quantiﬁer (with a term), or duplicate a top-level existential quantiﬁer.

Completeness of developments for ETT can be taken as a kind of Herbrand

theorem for ETT. Miller (1983, 1987) presented the rewrites of developments

as a tree instead of a line. The resulting proof structure, called expansion trees,

provides a compact presentation of proofs for higher-order classical logic. Ex-

pansion trees are a natural generalization of Herbrand disjunctions to formulas

which might not be in prenex normal form and where higher-order quantiﬁcation

might be involved.

4.2. Cut-simulation properties

Cut-elimination in ﬁrst-order logic gives rise to the subformula property:

that is, cut-free proofs are arrangements of formulas which are just subformulas

of the formulas in the sequent at the root of the proof. In ETT (and ExTT or

STT), however, cut-free proofs do not necessarily satisfy this subformula prop-

erty. To better understand this situation remember that predicate variables may

be instantiated with terms that introduce new formula structure. For this rea-

son, the subformula property may break (cf. Section 6.1). However, at the same

time this oﬀers the opportunity to mimic cut-introductions by appropriately se-

lecting such instantiations for predicate variables. For example, a cut formula ϕ

may be introduced by instantiating the law of excluded middle ∀P.P ∨ ¬Pwith

ϕand by applying disjunction elimination. In other words, one may trivially

eliminate cut-rule applications by instead working with the axiom of excluded

middle.8As shown by Benzm¨uller et al. (2009), eﬀective cut-simulation is also

supported by other prominent axioms, including comprehension, induction, ex-

tensionality, description, and choice. Also arbitrary (positive) Leibniz equations

can be employed for the task.

Cut-simulations have in fact been extensively used in literature. For exam-

ple, Takeuti showed that a conjecture of G¨odel could be proved without cut

by using the induction principle instead (Takeuti, 1960); McDowell and Miller

(2002) illustrate how the induction rule can be used to hide the cut rule; and

Sch¨utte (1960) used excluded middle to similarly mask the cut rule.

In higher-order logic, cut-elimination and cut-simulation should always be

considered in combination: a pure cut-elimination result may indeed mean little

if at the same time axioms are assumed that support eﬀective cut-simulation.

Church’s use of the λ-calculus to build comprehension principles into the

language can therefore be seen as a ﬁrst step in the program to eliminate the

need for cut-simulating axioms. Further steps have recently been achieved, and

tableaux and resolution calculi have been presented that employ primitive equal-

ity and which provide calculus rules (as opposed to an axiomatic treatment) for

8For automating higher-order logic it is thus very questionable to start with intuitionistic

logic ﬁrst and to simply add the law of excluded middle to arrive at classical logic.

extensionality and choice (cf. Section 6.3). These calculus rules do not support

cut-simulation.

4.3. Higher-order substitutions and normal forms

One of the challenges posed by higher-order substitution is that the many

normal forms on which theorem provers often rely are not stable under such sub-

stitution. Clearly, a formula in βη-normal form may no longer be in βη-normal

form after a λ-term instantiates a higher-order free variable in it. Similarly,

many other normal forms—e.g., negation normal, conjunctive normal, Skolem

normal—are not preserved under such substitutions. In general, this instability

is not a major problem since often one can re-normalize after performing such

substitutions. For example, one often immediately places terms into βη-normal

form after making a substitution. Since there can be an explosion in the size of

terms when such normalization is made, there are compelling reasons to delay

such normalization (Liang et al., 2005). Andrews (1971), for example, inte-

grates the production of conjunctive normal and Skolem normal forms within

the process of doing resolution.

4.4. Encodings of higher-order logic into ﬁrst-order logic

Given the expressiveness of ﬁrst-order logic and that theoremhood in both

ﬁrst-order logic and ETT (and ExTT or STT) is recursively enumerable, it

is not a surprise that provability in the latter can be formalized in ﬁrst-order

logic. Some of the encodings have high-enough ﬁdelity to make it possible to

learn something structural about ETT from its encoding. For example, Dowek

(2008) and Dowek et al. (2001) use an encoding of ETT in ﬁrst-order logic

along with Skolemization in ﬁrst-order logic in order to explain the nature of

Skolemization in ETT.

Mappings of second-order logic into many-sorted ﬁrst-order logic have been

studied by Enderton (1972). Henschen (1972) presents a mapping from higher-

order logic and addresses the handling of comprehension axioms. For (type

restricted) ExTT with Henkin-style semantics, complete translations into many-

sorted, ﬁrst-order logic have been studied by Kerber (1991, 1994).

Modern interactive theorem provers such as Isabelle nowadays employ trans-

lations from polymorphic higher-order logic into (unsorted or many-sorted) ﬁrst-

order logic in order to employ ﬁrst-order theorem provers to help prove subgoals.

Achieving Henkin completeness is thereby typically not a main issue. The fo-

cus is rather on practical eﬀectiveness. Even soundness may be abandoned if

other techniques, such as subsequent proof reconstruction, can be be employed

to identify unsound proofs. Relevant related work has been presented by Hurd

(2003), Meng and Paulson (2008), and Blanchette et al. (2013b).

5. Skolemization and uniﬁcation

In the latter sections of this chapter, we describe a number of theorem provers

for various subsets of STT. They all achieve elements of their automation in

ways that resemble provers in ﬁrst-order logic. In particular, when quantiﬁers

are encountered, they are either instantiated with eigenvariables (in the sense

of Gentzen (1969)) or, dually, instantiated by logic variables whose actual val-

ues are determined later via uniﬁcation. To simplify the relationship between

eigenvariables and logic variables and, hence, simplify the implementation of

uniﬁcation, it is customary to simplify quantiﬁers prior to performing proof

search. In classical ﬁrst-order logic theorem provers, Skolemization provides

such simpliﬁcation and uniﬁcation does not need to deal with eigenvariables at

all.

While such a simpliﬁcation of quantiﬁcational structure is possible in classi-

cal higher-order theorem provers, some important issues arise concerning quan-

tiﬁer alternation, Skolemization, and term uniﬁcation that are not genuine issues

in a ﬁrst-order setting. We discuss these diﬀerences below.

5.1. Skolemization

A typical approach to simplifying the alternation of quantiﬁers in ﬁrst-order

logic is to use Skolemization. Such a technique replaces an assumption of the

form, say, ∀xτ∃yδP xy with the assumption ∀xτP x(fx), where fis a new con-

stant of type τ→δ. If the original assumption is satisﬁable then the Skolemized

formula is also satisﬁable: in a model of the Skolemized formula, the meaning

of the Skolem function fis a suitable choice function.

Lifting Skolemization into higher-order logic is problematic for a number of

reasons. First, the λ-abstraction available in STT can internalize the choice

functions named by Skolem function symbols: such internalization can occur

even for fragments of STT in which the axiom of choice is not assumed. For

example, the resolution system for ETT introduced by Andrews (1971) used

Skolem functions to simplify quantiﬁer alternations. While Andrews was able

to prove that resolution was complete for ETT, he did not provide the converse

result of soundness since some versions of the axiom of choice could be proved

(Andrews, 1973). As was shown by Miller (1983, 1992), the soundness of Skolem

functions can be guaranteed by placing suitable restrictions on the occurrences

of Skolem functions within λ-terms. In particular, consider an assumption of the

form ∀xτ∃yδ→θP xy and its Skolemized version ∀xτP x(fx), where fis a new

Skolem function of type τ→δ→θ. In order for a proof not to internalize the

choice function named by f, every substitution term tused in that proof must

be restricted so that every occurrence of fin tmust have at least one argument

and any free variable occurrences in that argument must also be free in t. Thus

it is not possible to form an abstraction involving the Skolemization-induced

argument and, in that way, the Skolem function is not used as a general choice

function.

A second problem with using Skolemization is that there are situations where

a type may have zero or one inhabitant prior to Skolemization but can have

an inﬁnite number of inhabitants after Skolemization (Miller, 1992). Such a

change in the nature and number of terms that appear in types before and after

Skolemization introduces new constants is a serious problem when a prover

wishes to present its proofs in forms that do not use Skolemization (such as

natural deduction or sequent calculus).

A third problem using Skolemization is that in the uniﬁcation of typed λ-

terms, the treatment of λ-abstractions and the treatment of eigenvariables are

intimately related. For example, the uniﬁcation problems ∃wi.(λxι.x)=(λxι.w)

and ∃wi∀xι.x =ware essentially the same: since the second (purely ﬁrst-order)

problem is not provable in ﬁrst-order logic (since it is true only in a singleton

domain), the original uniﬁcation problem also has no solutions. Explaining the

non-uniﬁcation of terms λxι.x and λxι.w in terms of Skolemization and choice

functions seems rather indirect.

5.2. Uniﬁcation of simply typed λ-terms

Traditionally, the uniﬁcation of simply typed λ-terms can be described as

proving the formula

∃x1

τ1. . . ∃xn

τn. t1=s1∧ · · · ∧ tm=sm(n, m ≥0).

If we make the additional assumption that no variable in the quantiﬁer pre-

ﬁx is free in any of the terms s1, . . . , smthen this formula is also called a

matching problem. The order of the uniﬁcation problem displayed above is

1 + max{ord(τ1), . . . , ord(τn)}; thus, if n= 0 that order is 1. Andrews showed

(Andrews, 1974, Theorem 2) that such a formula is provable in ETT if and only

if there is a substitution θfor the variables in the quantiﬁer preﬁx such that

for each i= 1, . . . , n, the terms tιθand sιθhave the same normal form. Such a

substitution as θis called a uniﬁer for that uniﬁcation problem. Such uniﬁca-

tion problems have been studied in which the common normal form is computed

using just β-conversion or with βη-conversion: thus one speaks of uniﬁcation

or matching modulo βor modulo βη. This theorem immediately generalizes a

similar theorem for ﬁrst-order logic.

Although Guard and his student Gould investigated higher-order versions

of unication as early as 1964 (Guard, 1964; Gould, 1966), it was not until 1972

that the undecidability of such uniﬁcation was demonstrated independently by

Huet (1973a) and Lucchesi (1972). Those two papers showed that third-order

uniﬁcation was undecidable; later Goldfarb (1981) showed that second-order

uniﬁcation was also undecidable. The decidability of higher-order matching

was shown after several decades of eﬀort: it was ﬁrst shown for second-order

matching in Huet and Lang (1978); for third-order matching in Dowek (1992);

and for fourth-order matching in Padovani (2000). Finally, Stirling (2009) has

shown that matching at all orders is decidable.

Following such undecidability results for uniﬁcation, the search for uniﬁca-

tion procedures for simply typed λ-terms focused on the recursive enumeration

of uniﬁers. The ﬁrst such enumeration was presented in (Pietrzykowski and

Jensen, 1972; Pietrzykowski, 1973; Jensen and Pietrzykowski, 1976). Their enu-

meration was intractable in implemented systems since when it enumerated a

uniﬁer, subsequent uniﬁers in the enumeration would often subsume it, thus

leading to a highly redundant search for uniﬁers.

Huet (1975) presented a diﬀerent approach to the enumeration of uniﬁers.

Instead of solving all uniﬁcation problems, some uniﬁcation pairs (the so-called

ﬂex-ﬂex pairs) were deemed too unconstrained to schedule for solving. In such

problems, the head of all terms in all equalities are existentially quantiﬁed. For

example, the uniﬁcation problem ∃fi→i∃gi→i.f a =g a is composed of only

ﬂex-ﬂex pairs and it has a surprising number of uniﬁers. In particular, let t

be any βη-normal closed term of type iand assume that the constant ahas

noccurrences in t. There are 2ndiﬀerent ways to abstract afrom tand by

assigning one of these to fand possibly another to gwe have a uniﬁer for

this uniﬁcation problem. Clearly, picking blindly from this exponential set of

choices on an arbitrary term is not a good idea. An important design choice

in the semi-decision procedure of Huet (1975) is the delay of such uniﬁcation

problems. In particular, Huet’s procedure computed “pre-uniﬁers”; that is,

substitutions that can reduce the original uniﬁcation problem to one involving

only ﬂex-ﬂex equations. Huet showed that the search for pre-uniﬁers could be

done, in fact, without redundancy. He also showed how to build a resolution

procedure for ETT on pre-uniﬁcation instead of uniﬁcation by making ﬂex-ﬂex

equations into “constraints” on resolution (Huet, 1972, 1973b). The earliest

theorem provers for various supersets of ETT—TPS (Andrews et al., 1996),

Isabelle (Paulson, 1989), and λProlog (Nadathur and Miller, 1988; Miller and

Nadathur, 2012)—all implemented rather directly Huet’s search procedure for

pre-uniﬁers.

The uniﬁcation of simply typed λ-terms does not have the most-general-

uniﬁer property: that is, there can be two uniﬁers and neither is an instance of

the other. Let gbe a constant of type i→i→iand aa constant of type i. Then

the second-order uniﬁcation problem ∃fi→i.fa =gaa has four uniﬁers in which f

is instantiated with λw.gww,λw.gwa,λw.gaw, and λw.gaa. A theorem prover

that encounters such a uniﬁcation problem may need to explore all four of these

uniﬁers during the search for a proof. It is also possible for a uniﬁcation problem

to have an inﬁnite number of uniﬁers that are not instances of one another. Such

is the case for the uniﬁcation problem ∃fi→i.λx.f(hx) = λx.h(f x), where his

a constant of type i→i. All the following instantiations for fyield a uniﬁer:

λw.w,λw.hw,λw.h(hw), λw.h(h(hw)), . . ..

For more details about Huet’s search procedure for uniﬁers, we recommend

Huet’s original paper (Huet, 1975) as well as the subsequent papers by Snyder

and Gallier (1989) and Miller (1992), and the handbook chapter by Dowek

(2001). Here we illustrate some of the complexities involved with this style of

uniﬁcation.

5.3. Mixed preﬁx uniﬁcation problems

As we motivated above, it is natural to generalize uniﬁcation problems away

from a purely existential quantiﬁer preﬁx to one that has a mixed quantiﬁer

preﬁx, i.e., a uniﬁcation problem will be a formula of the form

Q1x1

τ1. . . Qnxn

τn. t1=s1∧ · · · ∧ tm=sm(n, m ≥0).

Here, Qιis either ∀or ∃for i= 1, . . . , n. There is, in fact, a simple technique

available in higher-order logic that is not available in ﬁrst-order logic which

can simplify quantiﬁer alternation in such uniﬁcation problems. In particular, if

∀xτ∃yσoccurs within the preﬁx of a uniﬁcation problem, it is a simple matter to

“rotate” the ∀xto the right: this requires “raising” the type of the ∃yquantiﬁer.

That is, ∀xτ∃yσcan be replaced by ∃hτ→σ∀xτif all occurrences of yin the scope

of ∃yare substituted by (hx). Thus, the uniﬁcation problem ∀xι∀yι∃zι.f zx =

fyz (for some constant fof type ι→ι→ι) can be rewritten to the uniﬁcation

problem

∃hι→ι→ι∀xι∀yι.f(hxy)x=f y(hxy).

This latter problem can be replaced by the equivalent uniﬁcation problem

∃hι→ι→ι.λxλy.f (hxy)x=λxλy.f y(hxy). Using the technique of raising, any

uniﬁcation problem with a mixed quantiﬁer preﬁx can be rewritten to one with

a preﬁx of the form ∃∀. Furthermore, the block of ∀quantiﬁers can be removed

from the preﬁx if they are converted to a block of λ-bindings in front of all terms

in all the equations. In this way, a mixed preﬁx can be rewritten to an equiva-

lent one involving only existential quantiﬁers. Details of performing uniﬁcation

under a mixed preﬁx can be found in (Miller, 1992). The notion of ∀-lifting

employed by the Isabelle prover can be seen as a combination of backchaining

and raising (Miller, 1991; Nipkow, 1993).

5.4. Pattern uniﬁcation

There is a small subset of uniﬁcation problems, ﬁrst studied by Miller (1991),

whose identiﬁcation has been important for the construction of practical sys-

tems. Call a uniﬁcation problem a pattern uniﬁcation problem if every occur-

rence of an existentially quantiﬁed variable, say, M, in the preﬁx is applied to a

list of arguments that are all distinct variables bound by either a λ-binder or a

universal quantiﬁer in the scope of the existential quantiﬁer. Thus, existentially

quantiﬁed variables cannot be applied to general terms but a very restricted set

of bound variables. For example,

∃M∃N.λxλy.f(M xy) = λxλy.N y ∃M∀x∀y.f(M xy) = f y

∃M∀x.λy.M xy =λy.M yx ∃M∃N.∀x∀y.Mxy =N y

are all pattern uniﬁcation problems. All these uniﬁcation problems have most

general uniﬁers, respectively, [M7→ λxλy .P y, N 7→ λy.f (P y)], [M7→ λxλy.y],

[M7→ λxλy.P ], and [M7→ λxλy.Ny], where Pis a new (existentially quantiﬁed)

variable. Notice that although the last two of these are examples of ﬂex-ﬂex

uniﬁcation problems, they both have a most general uniﬁer. The following

uniﬁcation problems do not fall into this fragment:

∃M∃N.λx.f (Mxx) = Nx ∃M.∀x.f(M x) = M(f x).

Notice that all ﬁrst-order uniﬁcation problems are, in fact, pattern uniﬁ-

cation problems, and that pattern uniﬁcation problems are stable under the

raising technique mentioned earlier. The main result about pattern uniﬁcation

is that—like ﬁrst-order uniﬁcation—deciding uniﬁability is decidable and most

general uniﬁers exist for solvable problems. Also like ﬁrst-order uniﬁcation,

types attributed to constructors are not needed for doing the uniﬁcation.

5.5. Practical considerations

Earlier we mentioned that uniﬁcation problems can be addressed using ei-

ther just β-conversion or βη-conversion. Although Huet (1975) considered both

uniﬁcation modulo βand βη conversion separately, almost no implemented sys-

tem considers only the pure βconversion rules alone: term equality for STT is

uniformly treated as βη-convertibility.

Skolemization is a common technique for simplifying quantiﬁer alternation

in many implemented higher-order theorem provers (cf. Section 7). On the

other hand, several other systems, particularly those based on the intuitionistic

fragment of ETT, do not use Skolemization: instead they either use raising, as

is done in Isabelle (Paulson, 1989, 1994) or they work directly with a represen-

tation of an unaltered quantiﬁer preﬁx, as is done in the Teyjus implementation

(Nadathur and Linnell, 2005) of λProlog.

It is frequently the case that in computational logic systems that unify sim-

ply typed λ-terms, only pattern uniﬁcation problems need to be solved. As a

result, some systems—such as the Teyjus implementation of λProlog and the

interactive theorem provers Minlog (Benl et al., 1998) and Abella (Gacek et al.,

2012)—only implement the pattern fragment since this makes their design and

implementation easier.

6. Challenges for the automation

While theorem provers for ETT, ExTT, and STT can borrow many tech-

niques from theorem provers for ﬁrst-order logic, there are several challenges to

the direct implementation of such provers. We discuss some of these challenges

below.

6.1. Instantiation of predicate variables

During the search for proofs in quantiﬁcational logics, quantiﬁers need to be

instantiated (possibly more than once) with various terms. Choosing such terms

is a challenge partly because when a quantiﬁer needs to be instantiated, the role

of that instantiation term in later proof steps is not usually known. To address

this gap between when a quantiﬁer needs an instantiation term and when that

term’s structure is ﬁnally relevant to the completion of a proof, the techniques

of uniﬁcation described in the previous section are used. When uniﬁcation is

involved, quantiﬁers are instantiated not with terms but with variables which

represent a promise: before a proof is complete, those variables will be replaced

by terms. The variables that are introduced in this way are sometimes called

logical variables: these variables correspond to those marked using existential

quantiﬁcation in the uniﬁcation problems of Section 5.3.

In this way, one can delay the choice of which term to use to instantiate the

quantiﬁer until the point where that term is actually used in the proof. As an

illustration of using uniﬁcation in the search for a proof, consider attempting a

proof of the formula (q(f a)) from the conjunctive assumption

pa ∧(∀x px ⊃p(f x)) ∧(∀y py ⊃qy).

One way to prove this goal would be to assume, for example, that each univer-

sally quantiﬁed premise is used once for some, currently, unspeciﬁed term. In

this case, instantiate ∀xand ∀ywith variables Xand Y, respectively, and we

have an assumption of the form

pa ∧(pX ⊃p(fX)) ∧(pY ⊃qY ).

We can then observe that the proof is complete if we chain together two appli-

cations of modus ponens: for that to work, we need to ﬁnd substitution terms

for Xand Yto solve the equations

pa =pX ∧p(fX) = pY ∧qY =q(f a).

Clearly, this uniﬁcation problem is solvable when Xand Yare replaced by a

and fa, respectively. Thus, if we were to repeat the steps of the proof but this

time instantiate the quantiﬁers ∀xand ∀ywith aand (f a), respectively, the

chaining of the modus ponens steps would now lead to a proper proof.

A key property of ﬁrst-order quantiﬁcational logic is that the terms needed

for instantiating quantiﬁers can all be found using uniﬁcation of atomic formu-

las. When predicate variables are present, however, the uniﬁcation of atomic

formulas is no longer suﬃcient to generate all quantiﬁer instantiations needed

for proofs. For example, the ETT theorem

∃p(px ⊃(ax ∧bx)) ∧((ax ∧bx)⊃px).

is proved by instantiating pwith λw.a w ∧b w. If that quantiﬁer were, instead,

instantiated by the logic variable Pto yield the formula

(P x ⊃(ax ∧bx)) ∧((ax ∧bx)⊃P x)

no equalities between occurrences of atomic formulas will provide a uniﬁcation

problem that has this uniﬁer. Similarly, the theorem

∀q(qa ⊃qb)⊃pb ⊃pa

is proved (in intuitionistic and classical logic) by instantiating ∀qwith the term

λw.pw ⊃pa. Once again, however, if the quantiﬁer ∀qwas instantiated with a

logic variable Q, then no uniﬁcation of that atomic formulas Qa,Qb,pa, and

pb would have yielded this substitution terms for Q.

Of course, it is not surprising that simple syntactic checks involving sub-

formulas are not sophisticated enough to compute substitutions for predicates.

Often the key insight into proving a mathematical theorem is the production of

the right set or relation: in ETT (ExTT or STT), these would be encoded as λ-

abstractions containing logical connectives. Similarly, induction can be encoded

and the invariants for inductive proofs would be encoded as similar terms and

used to instantiate predicate quantiﬁers. Nonetheless, a number of researchers

have described various schemes for inventing substitution terms for predicate

variables. We mention a few below.

Enumeration of substitutions. An early approach at the generation of predi-

cate substitutions was provided by Huet (1972; 1973b) by essentially providing

a mechanism for guessing the top-level, logical structure of a substitution for

a predicate variable. Such guessing (called splittings in that paper) was in-

terleaved with resolution steps by a system of constraints. Thus, his system

suggested a candidate top-level connective for the body of a predicate substitu-

tion and then proceeded with the proof under that assumption.

A simple, prominent example to illustrate the need for splittings is ∃PoP.

When using resolution the formula is ﬁrst negated and then normalized to clause

¬Xo, where Xis a predicate variable. There is no resolution partner for this

clause available, hence the empty clause can not be derived. However, when

guessing some top-level, logical structure for X, here the substituition [¬Y/X]

is suitable, then ¬¬Yis derived, which normalizes into a new clause Y. Now,

resolution between the clauses ¬Xand Ywith substitution [Y/X] directly leads

to the empty clause.

Andrews’s primitive substitutions (Andrews, 1989) incorporates Huet’s no-

tion of splitting, and an alternative description of splitting can be found in

Dowek (1993).

Maximal set variables and set constraints. Bledsoe (1979) suggested a diﬀerent

strategy for coming up with predicate substitutions: in some cases, one can tell

the maximal set that can solve a subgoal. Consider, for example, the formula

∃A(∀x.Ax ⊃px)∧ C(A)

Clearly there are many instantiations possible for Athat will satisfy the ﬁrst

conjunct. For example, the empty set λw ⊥, is one of them but it seems not

to be the best one. Rather, a more appropriate substitution for Amight be

λw pw ∧Bw, where Bis a new variable that has the same type as A. This

extension of the latter expression can then range from the empty set (where Bis

substituted by λw ⊥) to λw pw (where Bis substituted by λw >). Felty (2000)

generalized and applied Bledsoe’s technique to the higher-order logic found in

the calculus of constructions. Moreover, Brown (2002) generalized Bledsoe’s

technique to STT. His solution, which employs reasoning with set constraints,

has been applied within the TPS theorem prover.

6.2. Induction invariants

Induction and, to a lesser extent, coinduction are important inference rules

in computer science and mathematics. Most forms of the induction rule require

showing that some set is, in fact, an invariant of the induction. Even if one

is only interested in ﬁrst-order logic, the induction rule in this form requires

discovering the instantiation of the predicate variable that codes the induction

invariant. While Bledsoe (1979) provides some weak approaches to generating

such invariants, a range of techniques are routinely used to provide either invari-

ants explicitly or some evidence that an invariant exists. For an example of the

latter, the work on cyclic proofs (Spenger and Dams, 2003) attempts to identify

cycles in the unfolding of a proof attempt as a guarantee that an invariant ex-

ists. Descente inﬁnie (sometimes also called inductionless induction) and proof

by consistency (Comon, 2001) are also methods for proving inductive theorems

without explicitly needing to invent an invariant (cf. also Wirth (2004)).

6.3. Equality, extensionality, and choice

There has been work on automating various axioms beyond those included

in ETT. As mentioned above, various work has focused on automation of in-

duction, which is based roughly on axioms 7 and 8. For many applications,

including mathematics, one certainly wants and needs to have extensionality

and maybe also choice (or description). However, the idea to treat such princi-

ples axiomatically, as e.g., proposed in Huet (1973b) for extensionality, leads to

a signiﬁcant increase of the search space, since these axioms (just like the induc-

tion axiom) introduce predicate variables and support cut-simulation (cf. Sec-

tion 4.2). Another challenge is that uniﬁcation modulo Boolean extensionality

subsumes theorem proving: proving a proposition ϕis the same as unifying ϕ

and >modulo Boolean extensionality. More information on these challenges is

provided by Benzm¨uller et al. (2009) and Benzm¨uller (2002).

Signiﬁcant progress in the automation of ExTT in existing prover imple-

mentations has therefore been achieved after providing calculus level support

for extensionality and also choice. Respective extensionality rules have been pro-

vided for resolution (Benzm¨uller, 1999b), expansion and sequent calculi (Brown,

2004, 2005), and tableaux (Brown and Smolka, 2010). Similarly, choice rules

have been proposed for the various settings: sequent calculus (Mints, 1999),

tableaux (Backes and Brown, 2011) and resolution (Benzm¨uller and Sultana,

2013).

Analogously, (positive) Leibniz equations are toxic for proof automation,

since they also support cut-simulation. For this reason, the automation ori-

ented tableaux and resolution approaches above support primitive equality and

provide respective rules. The use of Leibniz equations can hence be omitted in

the modeling of theories and conjectures in these approaches.

7. Automated theorem provers

7.1. Early systems

Probably the earliest project to mention is Peter Andrews NSF grant Proof

procedures for Type Theory (1965-67). The goal was to lift ideas from proposi-

tional and ﬁrst-order logic to the higher-order case. Also J. A. Robinson (1969,

1970) argued for the construction of automated tools for higher-order logic. To-

gether with E. Cohen, Andrews started a ﬁrst computer implementation based

on the inference rules of Andrews (1971) and the uniﬁcation algorithm of Huet

(1975) in a subsequent project (1971-76). In 1977 this system did prove Can-

tor’s theorem automatically in 259 seconds (Andrews and Cohen, 1977). After

1983, when D. Miller, F. Pfenning, and other students got involved, this theorem

prover got substantially revised. The revised system was then called TPS. The

TPS proof assistant (Miller et al., 1982; Andrews et al., 1996, 2000; Andrews

and Brown, 2006), was, in fact, not based on resolution but on matrix-style

theorem proving. Both λProlog (Nadathur and Miller, 1988) and the Isabelle

theorem prover (Paulson, 1989) were early systems that implemented sizable

fragments of the intuitionistic variants of ETT: they were tractable systems be-

cause they either removed or greatly restricted predicate quantiﬁcation. Below

we survey other higher-order systems that attempted to deal with interactive

and automatic theorem proving in the presence of predicate quantiﬁcation.

HOL. The ML based provers of the HOL family include HOL88, HOL98, and

HOL4 (Gordon and Melham, 1993). These systems are all based on the LCF

approach (Gordon et al., 1979), in which powerful proof tactics are iteratively

built up from a small kernel of basic proof rules. Other LCF-based provers for

higher-order logic are the minimalist system HOL Light (Harrison, 2009), which

provides powerful automation tactics and which has recently played a key role in

the veriﬁcation of Kepler’s conjecture (Hales, 2013) , and the ProofPower system

(Arthan, 2011), which provides special support for a set-theoretic speciﬁcation

language.

Isabelle/HOL. Isabelle (Paulson, 1989) is a theorem prover with a core tactic

language built on a fragment of the intuitionistic variant of ETT. Built on this

core is the Isabelle/HOL (Nipkow et al., 2002) interactive theorem prover for

classical higher-order logic. Isabelle/HOL includes several powerful features

such as bridges to external theorem provers, sophisticated user interaction, and

the possibility to export executable speciﬁcations written in Isabelle/HOL as

executable code in various programming languages.

PVS. The prototype veriﬁcation system PVS (Owre et al., 1992) combines a

higher-order speciﬁcation languages with an interactive theorem proving en-

vironment that integrates decision procedures, a model checker, and various

other utilities to improve user productivity in large formalization and veriﬁ-

cation projects. Like Isabelle and the HOL provers, PVS also includes a rich

library of formalized theories.

IMPS. The higher-order interactive proof assistant IMPS (Farmer, 1993) pro-

vides good support for partial functions and undeﬁned terms in STT (Farmer,

1990). Moreover, it supports human oriented formal proofs which are never-

theless machine checked. Most importantly, IMPS organizes mathematics using

the “little theories” method in which reasoning is distributed over a network of

theory linked by theory morphisms (Farmer et al., 1992). It is the ﬁrst theorem

proving system to employ this approach.

ΩMEGA. The higher-order proof assistant ΩMEGA (Benzm¨uller et al., 1997)

combines tactic based interactive theorem proving with automated proof plan-

ning. With support from an agent-based model, it integrates various external

reasoners: including ﬁrst-order automated theorem provers, the higher-order

automated theorem provers LEO (Benzm¨uller and Kohlhase, 1998; Benzm¨uller,

1999a) and TPS, and computer algebra systems (Autexier et al., 2010). Proof

certiﬁcates from these external systems can be transformed and veriﬁed in

ΩMEGA.

λClam and IsaPlanner. λ-Clam (Richardson et al., 1998) is a higher-order vari-

ant of the CLAM proof planner (Bundy et al., 1990) built in λProlog. This

prover focuses on induction proofs based on the rippling technique. IsaPlanner

(Dixon and Fleuriot, 2003) is a related generic proof planner built on top of the

Isabelle system.

Deduction Modulo. In the deduction-modulo approach to theorem proving

(Dowek et al., 2003), a ﬁrst-order presentation of (intensional) higher-order

logic can be exploited to automate higher-order reasoning (Dowek et al., 2001).

A recent implementation of the deduction modulo approach (still restricted to

ﬁrst-order) has been presented by Burel (2011b).

Other early interactive proof assistants, for variants of constructive higher-

order logic, include Automath (Nederpelt et al., 1994), Nuprl (Constable et al.,

1986), LEGO (Pollack, 1994), Coq (Bertot and Casteran, 2004), and Agda (Co-

quand and Coquand, 1999). The logical frameworks Elf (Pfenning, 1994), Twelf

(Pfenning and Sch¨urmann, 1999), and Beluga (Pientka and Dunﬁeld, 2010) are

based on dependently typed higher-order logic. Related provers include the

general-purpose, interactive, type-free, equational higher-order theorem prover

Watson (Holmes and Alves-Foss, 2001) and the fully automated theorem prover

Otter-λ(Beeson, 2006) for λ-logic (a combination of λ-calculus and ﬁrst-order

logic). Abella (Gacek et al., 2012) is a recently implemented interactive theorem

prover for an intuitionistic, predicative higher-order logic with inference rules

for induction and co-induction. ACL2 (Kaufmann and Moore, 1997) and KeY

(Beckert et al., 2007) are prominent ﬁrst-order interactive proof assistants that

integrate induction.

7.2. The TPTP THF initiative

To foster the systematic development and improvement of higher-order au-

tomated theorem proving systems, Sutcliﬀe and Benzm¨uller (2010), supported

by several other members of the community, initiated the TPTP THF infras-

tructure. This project has introduced the THF syntax for higher-order logic, it

has developed a library of benchmark and example problems, and it provides

various support tools for the new THF0 language fragment. The THF0 lan-

guage supports ExTT (with choice) as also studied by Henkin (1950), that is,

it addresses the most commonly used and accepted aspects of Church’s type

theory.

Version 6.0.0 of the TPTP library contains more than 3000 problems in the

THF0 language.

The library also includes the entire problem library of Andrews’ TPS project,

which, among others, contains formalizations of many theorems of his textbook

(Andrews, 2002). The ﬁrst-order TPTP infrastructure (Sutcliﬀe, 2009) provides

a range of resources to support usage of the TPTP problem library. Many of

these resources are now immediately applicable to the higher-order setting al-

though some have required changes to support the new features of THF. The

development of the THF0 language, has been paralleled and signiﬁcantly in-

ﬂuenced by the development of the LEO-II prover (Benzm¨uller et al., 2008b).

Several other provers have quickly adopted this language, leading to fruitful

mutual comparisons and evaluations. Several implementation bugs in diﬀerent

systems have been detected this way.

7.3. TPTP THF0 compliant higher-order theorem provers

We brieﬂy describe the currently available, fully automated theorem provers

for ExTT (with choice). These systems all support the new THF0 language

and they can be deployed online (avoiding local installations) via Sutcliﬀe’s

SystemOnTPTP facility.9

TPS. The TPS prover can be used to prove theorems of ETT or ExTT au-

tomatically, interactively, or semi-automatically. When searching for a proof

automatically, TPS ﬁrst searches for an expansion proof (Miller, 1987) or an

extensional expansion proof (Brown, 2004) of the theorem. Part of this process

involves searching for acceptable matings (Andrews, 1981). Using higher-order

uniﬁcation, a pair of occurrences of subformulas (which are usually literals) is

mated appropriately on each vertical path through an expanded form of the

theorem to be proved. Skolemization and pre-uniﬁcation is employed, and cal-

culus rules for extensionality reasoning are provided. The behavior of TPS is

controlled by sets of ﬂags, also called modes. About ﬁfty modes have been found

that collectively suﬃce for automatically proving virtually all the theorems that

TPS has proved automatically thus far. A simple scheduling mechanism is em-

ployed in TPS to sequentially run these modes for a limited amount of time.

The resulting fully automated system is called TPS (TPTP).

LEO-II. (Benzm¨uller et al., 2008b), the successor of LEO, is an automated the-

orem prover for ExTT (with choice) which is based on extensional higher-order

resolution. More precisely, LEO-II employs a reﬁnement of extensional higher-

order RUE resolution (Benzm¨uller, 1999b). LEO-II employs Skolemization, (ex-

tensional) pre-uniﬁcation, and calculus rules for extensionality and choice are

provided. LEO-II is designed to cooperate with specialist systems for fragments

9See http://www.cs.miami.edu/~tptp/cgi-bin/SystemOnTPTP

of higher-order logic. By default, LEO-II cooperates with the ﬁrst-order prover

systems E (Schulz, 2002). LEO-II is often too weak to ﬁnd a refutation amongst

the steadily growing set of clauses on its own. However, some of the clauses in

LEO-II’s search space attain a special status: they are ﬁrst-order clauses modulo

the application of an appropriate transformation function. Therefore, LEO-II

regularly launches time limited calls with these clauses to a ﬁrst-order theorem

prover, and when the ﬁrst-order prover reports a refutation, LEO-II also termi-

nates. Communication between LEO-II and the cooperating ﬁrst-order theorem

prover uses the TPTP language and standards. LEO-II outputs proofs in TPTP

TSTP syntax.

Isabelle/HOL. The Isabelle/HOL system has originally been designed as an in-

teractive prover. However, in order to ease user interaction several automatic

proof tactics have been added over the years. By appropriately scheduling a sub-

set of these proof tactics, some of which are quite powerful, Isabelle/HOL has in

recent years been turned also into an automatic theorem prover, that can be run

from a command shell like other provers. The latest releases of this automated

version of Isabelle/HOL provide native support for diﬀerent TPTP syntax for-

mats, including THF0. The most powerful proof tactics that are scheduled by

Isabelle/HOL include the sledgehammer tool (Blanchette et al., 2013a), which

invokes a sequence of external ﬁrst-order and higher-order theorem provers, the

model ﬁnder Nitpick (Blanchette and Nipkow, 2010), the equational reasoner

simp (Nipkow, 1989), the untyped tableau prover blast (Paulson, 1999), the

simpliﬁer and classical reasoners auto,force, and fast (Paulson, 1994), and the

best-ﬁrst search procedure best. The TPTP incarnation of Isabelle/HOL does

not yet output proof terms.

Satallax. The higher-order, automated theorem prover Satallax (Brown, 2012,

2013) comes with model ﬁnding capabilities. The system is based on a complete

ground tableau calculus for ExTT (with choice) (Backes and Brown, 2011).

An initial tableau branch is formed from the assumptions of a conjecture and

negation of its conclusion. From that point on, Satallax tries to determine

unsatisﬁability or satisﬁability of this branch. Satallax progressively generates

higher-order formulas and corresponding propositional clauses. These formulas

and propositional clauses correspond to instances of the tableau rules. Satallax

uses the SAT solver MiniSat as an engine to test the current set of propositional

clauses for unsatisﬁability. If the clauses are unsatisﬁable, the original branch is

unsatisﬁable. Satallax employs restricted instantiation and pre-uniﬁcation, and

it provides calculus rules for extensionality and choice. If there are no quantiﬁers

at function types, the generation of higher-order formulas and corresponding

clauses may terminate. In that case, if MiniSat reports the ﬁnal set of clauses

as satisﬁable, then the original set of higher-order formulas is satisﬁable (by

a standard model in which all types are interpreted as ﬁnite sets). Satallax

outputs proofs in diﬀerent formats, including Coq proof scripts and Coq proof

terms.

Nitpick and Refute. These systems are (counter-)model ﬁnders. The ability

of Isabelle to ﬁnd (counter-)models using the Refute and Nitpick (Blanchette

and Nipkow, 2010) commands has also been integrated into automatic systems.

They provide the capability to ﬁnd models for THF0 formulas, which conﬁrm the

satisﬁability of axiom sets, or the unsatisﬁability of non-theorems. This has been

particularly useful for exposing errors in some THF0 problem encodings, and

revealing bugs in the THF0 theorem provers. Nitpick employs Skolemization.

agsyHOL. The agsyHOL prover (Lindblad) is based on a generic lazy narrow-

ing proof search algorithm. Backtracking is employed and a comparably small

search state is maintained. The prover outputs proof terms in sequent style

which can be veriﬁed in the Agda proof checker.

coqATP. The coqATP prover (Bertot and Casteran, 2004) implements (the non-

inductive) part of the calculus of constructions. The system outputs proof terms

which are accepted as proofs by Coq (after the addition of a few deﬁnitions). The

prover has axioms for functional extensionality, choice, and excluded middle.

Propositional extensionality is not supported yet. In addition to axioms, a

small library of basic lemmas is employed.

7.4. Recent applications of automated THF0 provers

Over the years, the proof assistants from Section 7.1 have been applied in a

wide range of applications, including mathematics and formal veriﬁcation. Typ-

ically these applications combine user interaction and partial proof automation.

For further information we refer to the websites of these systems.

With respect to full proof automation the TPS system has long been the

leading system, and the system has been employed to built up the TPS library

of formalized and automated mathematical proofs. More recently, however, TPS

is outperformed by several other THF0 theorem provers. Below we brieﬂy point

to some selected recent applications of the leading systems.

Both Isabelle/HOL and Nitpick have been successfully employed to check

a formalization of a C++ memory model against various concurrent programs

written in C++ (such as a simple locking algorithm) (Blanchette et al., 2011).

Moreover, Nitpick has been employed in the development of algebraic formal

methods within Isabelle/HOL (Guttmann et al., 2011).

Isabelle/HOL, Satallax, and LEO-II performed well in recent experiments

related to the Flyspeck project (Hales, 2013), in which a formalized proof of the

Kepler conjecture is being developed (mainly) in HOL Light; cf. the experiments

reported by Kaliszyk and Urban (2012, Table 7).

Most recently, LEO-II, Satallax, and Nitpick were employed to achieve a

formalization, mechanization, and automation of G¨odel’s ontological proof of

the existence of God (Benzm¨uller and Woltzenlogel Paleo, 2013). This work

employs a semantic embedding of quantiﬁed modal logic in THF0 (Benzm¨uller

and Paulson, 2013). Some previously unknown results were contributed by the

provers.

Using the semantic embeddings approach, a wide range of propositional and

quantiﬁed non-classical logics, including parts of their meta-theory and their

combinations, can be automated with THF0 reasoners (cf. Benzm¨uller (2013);

Benzm¨uller et al. (2012) and Benzm¨uller (2011)). Automation is thereby com-

petitive, as recent experiments for ﬁrst-order modal logic show (Benzm¨uller and

Raths, 2013).

THF0 reasoners can also be fruitfully employed for reasoning in expres-

sive ontologies (Benzm¨uller and Pease, 2012). Furthermore, the heteroge-

neous toolset HETS (Mossakowski et al., 2007) employs THF0 to integrate

the automated higher-order provers Satallax, LEO-II, Nitpick, Refute, and Is-

abelle/HOL.

8. Conclusion

We have summarized the development of theorem provers for Church’s sim-

ple theory of types (and elementary type theory) in the 20th century. Given

that the model theory and proof theory for ETT and STT is mature, a sig-

niﬁcant number of interactive and, most recently, automated theorem proving

systems have been built for both STT and ETT. Many applications of these

systems support Church’s original motivation for STT, namely that it could be

an elegant, powerful, and mechanized foundations for mathematics. In addition

to mathematics, various other application areas (including non-classical logics)

are currently being explored.

Acknowledgments. We thank Chad Brown for sharing notes that he has written

related to the material in this chapter. Moreover, we thank Bruno Woltzenlogel-

Paleo, Julian R¨oder, and Max Wisnieswki for proof reading of the document.

The ﬁrst author has been supported by the German Research Foundation under

Heisenberg grant BE2501/9-1 and the second author has been supported by the

ERC Advanced Grant Proof Cert.

References

Andreoli, J.M., 1992. Logic programming with focusing proofs in linear logic.

J. of Logic and Computation 2, 297–347.

Andrews, P., Brown, C., 2006. Tps: A hybrid automatic-interactive system for

developing proofs. J. Applied Logic 4, 367–395.

Andrews, P., Cohen, E., 1977. Theorem proving in type theory, in: Proc. of

IJCAI-77, 5th International Joint Conference on Artiﬁcial Intelligence.

Andrews, P.B., 1965. A Transﬁnite Type Theory with Type Variables. Stud-

ies in Logic and the Foundations of Mathematics, North-Holland Publishing

Company.

Andrews, P.B., 1971. Resolution in type theory. Journal of Symbolic Logic 36,

414–432.

Andrews, P.B., 1972a. General models and extensionality. Journal of Symbolic

Logic 37, 395–397.

Andrews, P.B., 1972b. General models, descriptions, and choice in type theory.

Journal of Symbolic Logic 37, 385–394.

Andrews, P.B., 1973. Letter to Roger Hindley dated January 22, 1973.

Andrews, P.B., 1974. Provability in elementary type theory. Zeitschrift fur

Mathematische Logic und Grundlagen der Mathematik 20, 411–418.

Andrews, P.B., 1981. Theorem proving via general matings. J. ACM 28, 193–

214.

Andrews, P.B., 1989. On connections and higher order logic. J. of Autom.

Reasoning 5, 257–291.

Andrews, P.B., 2001. Classical type theory, in: Robinson, A., Voronkov, A.

(Eds.), Handbook of Automated Reasoning. Elsevier Science, Amsterdam.

volume 2. chapter 15, pp. 965–1007.

Andrews, P.B., 2002. An Introduction to Mathematical Logic and Type Theory:

To Truth Through Proof. Second ed., Kluwer Academic Publishers.

Andrews, P.B., 2009. Church’s type theory, in: Zalta, E.N. (Ed.), The Stanford

Encyclopedia of Philosophy. spring 2009 ed.. Stanford University.

Andrews, P.B., Bishop, M., Brown, C.E., 2000. TPS: A theorem proving system

for type theory, in: McAllester, D. (Ed.), Proceedings of the 17th Interna-

tional Conference on Automated Deduction, Springer, Pittsburgh, USA. pp.

164–169.

Andrews, P.B., Bishop, M., Issar, S., Nesmith, D., Pfenning, F., Xi, H., 1996.

TPS: A theorem proving system for classical type theory. J. Autom. Reasoning

16, 321–353.

Andrews, P.B., Longini-Cohen, E., Miller, D., Pfenning, F., 1984. Automating

higher order logics. Contemp. Math 29, 169–192.

Arthan, R., 2011. Proofpower website. http://www.lemma-one.com/

ProofPower/index/.

Autexier, S., Benzm¨uller, C., Dietrich, D., Siekmann, J., 2010. OMEGA:

Resource-adaptive processes in an automated reasoning system, in: Crocker,

M.W., Siekmann, J. (Eds.), Resource-Adaptive Cognitive Processes, Springer,

Cognitive Technologies. pp. 389–423.

Backes, J., Brown, C.E., 2011. Analytic tableaux for higher-order logic with

choice. J. Autom. Reasoning 47, 451–479.

Barendregt, H., Dekkers, W., Statman, R., 2013. Lambda Calculus with Types.

Perspectives in Logic, Cambridge University Press.

Beckert, B., H¨ahnle, R., Schmitt, P.H. (Eds.), 2007. Veriﬁcation of Object-

Oriented Software: The KeY Approach. LNCS 4334, Springer-Verlag.

Beeson, M., 2006. Mathematical induction in Otter-lambda. J. Autom. Rea-

soning 36, 311–344.

Benl, H., Berger, U., Schwichtenberg, H., Seisenberger, M., Zuber, W., 1998.

Proof theory at work: Program development in the minlog system, in: Bibel,

W., Schmitt, P. (Eds.), Automated Deduction. Kluwer. volume II.

Benzm¨uller, C., 1999a. Equality and Extensionality in Automated Higher-Order

Theorem Proving. Ph.D. thesis. Saarland University. URL: http://www.ags.

uni-sb.de/\textasciitildechris/papers/diss.ps.gz.

Benzm¨uller, C., 1999b. Extensional higher-order paramodulation and RUE-

resolution, in: Ganzinger, H. (Ed.), Proc. of CADE-16, Springer. pp. 399–

413. URL: http://christoph-benzmueller.de/papers/C5.pdf, doi:10.

1007/3-540-48660-7_39.

Benzm¨uller, C., 2002. Comparing approaches to resolution based

higher-order theorem proving. Synthese 133, 203–235. URL:

http://christoph-benzmueller.de/papers/J5.pdf, doi:10.1023/A:

1020840027781.

Benzm¨uller, C., 2011. Combining and automating classical and non-classical

logics in classical higher-order logic. Annals of Mathematics and Artiﬁcial In-

telligence 62, 103–128. URL: http://christoph-benzmueller.de/papers/

J25.pdf.

Benzm¨uller, C., 2013. Automating quantiﬁed conditional logics in HOL,

in: Rossi, F. (Ed.), Proc. of IJCAI-23, Beijing, China. URL: http:

//christoph-benzmueller.de/papers/C37.pdf.

Benzm¨uller, C., Brown, C., Kohlhase, M., 2004. Higher-order semantics and

extensionality. Journal of Symbolic Logic 69, 1027–1088.

Benzm¨uller, C., Brown, C., Kohlhase, M., 2008a. Cut elimination with xi-

functionality, in: Benzm¨uller, C., Brown, C., Siekmann, J., Statman, R.

(Eds.), Reasoning in Simple Type Theory: Festschrift in Honor of Pe-

ter B. Andrews on His 70th Birthday. College Publications. Studies in

Logic, Mathematical Logic and Foundations, pp. 84–100. URL: http:

//christoph-benzmueller.de/papers/B8.pdf.

Benzm¨uller, C., Brown, C., Kohlhase, M., 2009. Cut-simulation and im-

predicativity. Logical Methods in Computer Science 5, 1–21. URL: http:

//christoph-benzmueller.de/papers/J18.pdf, doi:10.2168/LMCS-5(1:

6for)2009.

Benzm¨uller, C., Cheikhrouhou, L., Fehrer, D., Fiedler, A., Huang, X., Ker-

ber, M., Kohlhase, M., Konrad, K., Melis, E., Meier, A., Schaarschmidt,

W., Siekmann, J., Sorge, V., 1997. OMEGA: Towards a mathemati-

cal assistant, in: McCune, W. (Ed.), Proceedings of CADE-14, Springer.

pp. 252–255. URL: http://christoph-benzmueller.de/papers/C1.pdf,

doi:10.1007/3-540-63104-6_23.

Benzm¨uller, C., Gabbay, D., Genovese, V., Rispoli, D., 2012. Embedding and

automating conditional logics in classical higher-order logic. Ann. Math. Ar-

tif. Intell. 66, 257–271. URL: http://christoph-benzmueller.de/papers/

J26.pdf.

Benzm¨uller, C., Kohlhase, M., 1998. LEO – a higher-order theorem prover, in:

Kirchner, C., Kirchner, H. (Eds.), Proc. of CADE-15, Springer. pp. 139–

143. URL: http://christoph-benzmueller.de/papers/C3.pdf, doi:10.

1007/BFb0054256.

Benzm¨uller, C., Paulson, L., 2013. Quantiﬁed multimodal logics in simple type

theory. Logica Universalis (Special Issue on Multimodal Logics) 7, 7–20.

URL: http://christoph-benzmueller.de/papers/J23.pdf, doi:10.1007/

s11787-012-0052-y.

Benzm¨uller, C., Pease, A., 2012. Higher-order aspects and context in

SUMO. Journal of Web Semantics 12-13, 104–117. URL: http:

//christoph-benzmueller.de/papers/J27.pdf, doi:10.1016/j.websem.

2011.11.008.

Benzm¨uller, C., Raths, T., 2013. HOL based ﬁrst-order modal logic provers, in:

McMillan, K., Middeldorp, A., Voronkov, A. (Eds.), Proceedings of LPAR-

19, Stellenbosch, South Africa. URL: http://christoph-benzmueller.de/

papers/C38.pdf.

Benzm¨uller, C., Sultana, N., 2013. LEO-II version 1.5, in: Blanchette, J.C.,

Urban, J. (Eds.), PxTP 2013, EasyChair EPiC Series 14, 2-12. pp. 2–10.

Benzm¨uller, C., Theiss, F., Paulson, L., Fietzke, A., 2008b. LEO-II

- a cooperative automatic theorem prover for higher-order logic (sys-

tem description), in: Proc. of IJCAR 2008, Springer. pp. 162–170.

URL: http://christoph-benzmueller.de/papers/C26.pdf, doi:10.1007/

978-3-540-71070- 7_14.

Benzm¨uller, C., Woltzenlogel Paleo, B., 2013. Formalization, Mechaniza-

tion and Automation of G¨odel’s Proof of God’s Existence. ArXiv e-prints

arXiv:1308.4526.

Bertot, Y., Casteran, P., 2004. Interactive Theorem Proving and Program De-

velopment - Coq’Art: The Calculus of Inductive Constructions. Texts in

Theoretical Computer Science, Springer.

Bishop, M., 1999. Mating Search Without Path Enumeration. Ph.D. thesis.

Carnegie Mellon University.

Blanchette, J.C., B¨ohme, S., Paulson, L.C., 2013a. Extending Sledgehammer

with SMT solvers. J. Autom. Reasoning 51, 109–128.

Blanchette, J.C., B¨ohme, S., Popescu, A., Smallbone, N., 2013b. Encoding

monomorphic and polymorphic types, in: Piterman, N., Smolka, S.A. (Eds.),

Proc. of TACAS-19, Springer. pp. 493–507.

Blanchette, J.C., Nipkow, T., 2010. Nitpick: A counterexample generator for

higher-order logic based on a relational model ﬁnder, in: Kaufmann, M.,

Paulson, L.C. (Eds.), Proc. of ITP 2010, Springer. pp. 131–146.

Blanchette, J.C., Weber, T., Batty, M., Owens, S., Sarkar, S., 2011. Nitpicking

C++ concurrency, in: Schneider-Kamp, P., Hanus, M. (Eds.), Proceedings of

the 13th International ACM SIGPLAN Conference on Principles and Practice

of Declarative Programming, July 20-22, 2011, Odense, Denmark, ACM. pp.

113–124.

Bledsoe, W.W., 1979. A maximal method for set variables in automatic theorem-

proving, in: Machine Intelligence 9. John Wiley & Sons, pp. 53–100.

Boolos, G., 1987. A curious inference. Journal of Philosophical Logic 16, 1–12.

Brown, C., 2012. Satallax: an automatic higher-order prover. J. Autom. Rea-

soning , 111–117.

Brown, C.E., 2002. Solving for set variables in higher-order theorem proving,

in: Voronkov, A. (Ed.), Proc. of CADE-18, Springer. pp. 408–422.

Brown, C.E., 2004. Set Comprehension in Church’s Type Theory. Ph.D. the-

sis. Department of Mathematical Sciences, Carnegie Mellon University. See

also Chad E. Brown, Automated Reasoning in Higher-Order Logic, College

Publications, 2007.

Brown, C.E., 2005. Reasoning in extensional type theory with equality, in:

Nieuwenhuis, R. (Ed.), Proc. of CADE-20, Springer. pp. 23–37.

Brown, C.E., 2013. Reducing higher-order theorem proving to a sequence of sat

problems. J. Autom. Reasoning 51, 57–77.

Brown, C.E., Smolka, G., 2010. Analytic tableaux for simple type theory and

its ﬁrst-order fragment. Logical Methods in Computer Science 6.

Bundy, A., van Harmelen, F., Horn, C., Smaill, A., 1990. The Oyster-Clam

system, in: Stickel, M.E. (Ed.), 10th International Conference on Automated

Deduction, Kaiserslautern, FRG, July 24-27, 1990, Proceedings, Springer. pp.

647–648.

Burel, G., 2011a. Eﬃciently simulating higher-order arithmetic by a ﬁrst-order

theory modulo. Logical Methods in Computer Science 7, 1–31.

Burel, G., 2011b. Experimenting with deduction modulo, in: Bjørner, N.,

Sofronie-Stokkermans, V. (Eds.), Proc. of CADE-23, Springer. pp. 162–176.

Church, A., 1932. A set of postulates for the foundation of logic. Annals of

Mathematics 33, 346–366.

Church, A., 1936. An unsolvable problem of elementary number theory. Amer-

ican Journal of Mathematics 58, 354–363.

Church, A., 1940. A formulation of the simple theory of types. Journal of

Symbolic Logic 5, 56–68.

Chwistek, L., 1948. The Limits of Science: Outline of Logic and of the Method-

ology of the Exact Sciences. London: Routledge and Kegan Paul.

Comon, H., 2001. Inductionless induction, in: Robinson, A., Voronkov, A.

(Eds.), Handbook of Automated Reasoning. Elsevier Science. volume I. chap-

ter 14, pp. 913–962.

Constable, R., Allen, S., Bromly, H., Cleaveland, W., Cremer, J., Harper, R.,

Howe, D., Knoblock, T., Mendler, N., Panangaden, P., Sasaki, J., Smith, S.,

1986. Implementing Mathematics with the Nuprl Proof Development System.

Prentice-Hall.

Coquand, C., Coquand, T., 1999. Structured type theory, in: Felty, A. (Ed.),

Proc. of LMF99: Workshop on Logical Frameworks and Meta-languages.

URL: http://www.site.uottawa.ca/~afelty/LFM99/.

Cousineau, D., Dowek, G., 2007. Embedding pure type systems in the lambda-

pi-calculus modulo, in: Rocca, S.R.D. (Ed.), Typed Lambda Calculi and

Applications, 8th International Conference, TLCA 2007, Paris, France, June

26-28, 2007, Proceedings, Springer. pp. 102–117.

Curry, H., 1942. The inconsistency of certain formal logics. Journal of Symbolic

Logic 7, 115–117.

Dixon, L., Fleuriot, J.D., 2003. IsaPlanner: A prototype proof planner in is-

abelle, in: Baader, F. (Ed.), Automated Deduction - CADE-19, 19th Inter-

national Conference on Automated Deduction Miami Beach, FL, USA, July

28 - August 2, 2003, Proceedings, Springer. pp. 279–283.

Dowek, G., 1992. Third order matching is decidable, in: 7th Symp. on Logic

in Computer Science, IEEE Computer Society Press, Santa Cruz, California.

pp. 2–10.

Dowek, G., 1993. A complete proof synthesis method for the cube of type

systems. Journal of Logic and Computation 3, 287–315.

Dowek, G., 2001. Higher-order uniﬁcation and matching, in: Robinson, A.,

Voronkov, A. (Eds.), Handbook of Automated Reasoning. Elsevier Science,

New York. volume II. chapter 16, pp. 1009–1062.

Dowek, G., 2008. Skolemization in simple type theory: the logical and the

theoretical points of view, in: Reasoning in Simple Type Theory: Festschrift

in Honor of Peter B. Andrews on His 70th Birthday. College Publications.

number 17 in Studies in Logic, pp. 244–255.

Dowek, G., Hardin, T., Kirchner, C., 2001. HOL-λσ an intentional ﬁrst-order

expression of higher-order logic. Mathematical Structures in Computer Sci-

ence 11, 1–25.

Dowek, G., Hardin, T., Kirchner, C., 2003. Theorem proving modulo. J. Autom.

Reasoning 31, 33–72.

Enderton, H.B., 1972. A Mathematical Introduction to Logic. Academic Press.

Enderton, H.B., 2012. Second-order and higher-order logic, in: Zalta, E.N.

(Ed.), The Stanford Encyclopedia of Philosophy. fall 2012 ed.. Stanford Uni-

versity.

Farmer, W.M., 1990. A partial functions version of church’s simple theory of

types. J. Symb. Log. 55, 1269–1291.

Farmer, W.M., 1993. IMPS: An interactive mathematical proof system. J.

Autom. Reasoning 11, 213–248.

Farmer, W.M., 2008. The seven virtues of simple type theory. J. Applied Logic

6, 267–286.

Farmer, W.M., Guttman, J.D., Thayer, F.J., 1992. Little theories, in: Kapur,

D. (Ed.), Automated Deduction - CADE-11, 11th International Conference

on Automated Deduction, Saratoga Springs, NY, USA, June 15-18, 1992,

Proceedings, Springer. pp. 567–581.

Felty, A., 2000. The calculus of constructions as a framework for proof search

with set variable instantiation. Theoretical Computer Science 232, 187–229.

Frege, G., 1879. Begriﬀsschrift, eine der arithmetischen nachgebildete Formel-

sprache des reinen Denkens. Halle. Translated in van Heijenoort (1967).

Gacek, A., Miller, D., Nadathur, G., 2012. A two-level logic approach to

reasoning about computations. J. Autom. Reasoning 49, 241–273. URL:

http://arxiv.org/abs/0911.2993, doi:10.1007/s10817-011-9218-1.

Gentzen, G., 1969. Investigations into logical deduction, in: Szabo, M.E. (Ed.),

The Collected Papers of Gerhard Gentzen. North-Holland, Amsterdam, pp.

68–131. Translation of articles that appeared in 1934-35.

Girard, J.Y., 1971. Une extension de l’interpretation de G¨odel `a l’analyse, et son

application `a l’´elimination des coupures dans l’analyse et la th´eorie des types,

in: Fenstad, J.E. (Ed.), 2nd Scandinavian Logic Symposium. North-Holland,

Amsterdam, pp. 63–92.

Girard, J.Y., 1986. The system F of variable types: Fifteen years later. Theo-

retical Computer Science 45, 159–192.

G¨odel, K., 1929. ¨

Uber die Vollst¨andigkeit des Logikkalk¨uls. Ph.D. thesis. Uni-

versit¨at Wien.

G¨odel, K., 1930. Die Vollst¨andigkeit der Axiome des logischen Funktio-

nenkalk¨uls. Monatshefte f¨ur Mathematik und Physik 37, 349–360.

G¨odel, K., 1931. ¨

Uber formal unentscheidbare S¨atze der Principia Mathematica

und verwandter Systeme I. Monatshefte der Mathematischen Physik 38, 173–

198. English Version in van Heijenoort (1967).

G¨odel, K., 1936. ¨

Uber die L¨ange von Beweisen, in: Ergebnisse eines Mathema-

tischen Kolloquiums, pp. 23–24. English translation “On the length of proofs”

in Kurt G¨odel: Collected Works, Volume 1, pages 396-399, Oxford University

Press, 1986.

Goldfarb, W., 1981. The undecidability of the second-order uniﬁcation problem.

Theoretical Computer Science 13, 225–230.

Gordon, M., Melham, T., 1993. Introduction to HOL: A Theorem-Proving

Environment for Higher-Order Logic. Cambridge University Press.

Gordon, M.J.C., Milner, R., Wadsworth, C.P., 1979. Edinburgh LCF. volume 78

of LNCS. Springer.

Gould, W.E., 1966. A Matching Procedure for ω-Order Logic. Technical Report

Scientiﬁc Report No. 4. A F C R L.

Guard, J.R., 1964. Automated logic for semi-automated mathematics, in: Sci-

entiﬁc Report No 1. A F C R L, pp. 64–411.

Guttmann, W., Struth, G., Weber, T., 2011. Automating algebraic methods

in isabelle, in: Qin, S., Qiu, Z. (Eds.), Proc. of ICFEM 2011, Springer. pp.

617–632.

Hales, T., 2013. Mathematics in the Age of the Turing Machine. ArXiv e-prints

arXiv:1302.2898.

Harrison, J., 2009. HOL Light: An overview, in: Berghofer, S., Nipkow, T.,

Urban, C., Wenzel, M. (Eds.), Proceedings of TPHOLs 2009, Springer. pp.

60–66.

van Heijenoort, J., 1967. From Frege to G¨odel: A Source Book in Mathematics,

1879-1931. Source books in the history of the sciences series. 3rd printing,

1997 ed., Harvard Univ. Press, Cambridge, MA.

Henkin, L., 1950. Completeness in the theory of types. Journal of Symbolic

Logic 15, 81–91.

Henkin, L., 1963. A theory of propositional types. Fundamatae Mathematicae

, 323–344.

Henschen, L.J., 1972. N-sorted logic for automatic theorem-proving in higher-

order logic, in: Proceedings of the ACM Annual Conference - Volume 1,

ACM, New York, NY, USA. pp. 71–81. URL: http://doi.acm.org/10.

1145/800193.805823, doi:10.1145/800193.805823.

Holmes, M.R., Alves-Foss, J., 2001. The Watson theorem prover. J. Autom.

Reasoning 26, 357–408.

Huet, G., 1973a. The undecidability of uniﬁcation in third order logic. Infor-

mation and Control 22, 257–267.

Huet, G., 1975. A uniﬁcation algorithm for typed λ-calculus. Theoretical Com-

puter Science 1, 27–57.

Huet, G., Lang, B., 1978. Proving and applying program transformations ex-

pressed with second-order patterns. Acta Informatica 11, 31–55.

Huet, G.P., 1972. Constrained Resolution: A Complete Method for Higher

Order Logic. Ph.D. thesis. Case Western Reserve University.

Huet, G.P., 1973b. A mechanization of type theory, in: Proceedings of the 3rd

International Joint Conference on Artiﬁcial Intelligence, pp. 139–146.

Hurd, J., 2003. First-order proof tactics in higher-order logic theorem provers,

in: Design and Application of Strategies/Tactics in Higher Order Logics,

number NASA/CP-2003-212448 in NASA Technical Reports, pp. 56–68.

Jensen, D.C., Pietrzykowski, T., 1976. Mechanizing omega-order type theory

through uniﬁcation. Theor. Comput. Sci. 3, 123–171. doi:http://dx.doi.

org/10.1016/0304-3975(76)90021-9.

Kaliszyk, C., Urban, J., 2012. Learning-assisted automated reasoning with

ﬂyspeck. CoRR abs/1211.7012.

Kaufmann, M., Moore, J.S., 1997. An industrial strength theorem prover for a

logic based on Common Lisp. IEEE Trans. Software Eng. 23, 203–213.

Kerber, M., 1991. How to prove higher order theorems in ﬁrst order logic, in:

Mylopoulos, J., Reiter, R. (Eds.), Proc. of IJCAI-12, Morgan Kaufmann. pp.

137–142.

Kerber, M., 1994. On the translation of higher-order problems into ﬁrst-order

logic, in: Proc. of ECAI, pp. 145–149.

Kleene, S., Rosser, J., 1935. The inconsistency of certain formal logics. Annals

of Mathematics 36, 630–636.

Kohlhase, M., 1994. A Mechanization of Sorted Higher-Order Logic Based on

the Resolution Principle. Ph.D. thesis. Saarland University.

Leivant, D., 1994. Higher-order logic, in: Gabbay, D.M., Hogger, C.J., Robin-

son, J.A. (Eds.), Handbook of Logic in Artiﬁcial Intelligence and Logic Pro-

gramming. Oxford University Press. volume 2, pp. 229–321.

Liang, C., Miller, D., 2009. Focusing and polarization in linear, intuitionistic,

and classical logics. Theoretical Computer Science 410, 4747–4768. doi:10.

1016/j.tcs.2009.07.041.

Liang, C., Nadathur, G., Qi, X., 2005. Choices in representing and reduction

strategies for lambda terms in intensional contexts. Journal of Automated

Reasoning 33, 89–132.

Lindblad, F., . agsyHOL website. https://github.com/frelindb/agsyHOL.

Lucchesi, C.L., 1972. The Undecidability of Uniﬁcation for Third Order Lan-

guages. Technical Report Report CSRR 2059. Dept of Applied Analysis and

Computer Science, University of Waterloo.

Martin-L¨of, P., 1982. Constructive mathematics and computer programming,

in: Sixth International Congress for Logic, Methodology, and Philosophy of

Science, North-Holland, Amsterdam. pp. 153–175.

McDowell, R., Miller, D., 2002. Reasoning with higher-order abstract syntax in

a logical framework. ACM Trans. Comput. Log. 3, 80–136.

Meng, J., Paulson, L.C., 2008. Translating higher-order clauses to ﬁrst-order

clauses. J. Autom. Reasoning 40, 35–60.

Miller, D., 1983. Proofs in Higher-Order Logic. Ph.D. thesis. Carnegie-Mellon

University.

Miller, D., 1987. A compact representation of proofs. Studia Logica 46, 347–370.

Miller, D., 1991. A logic programming language with lambda-abstraction, func-

tion variables, and simple uniﬁcation. Journal of Logic and Computation 4,

497–536.

Miller, D., 1992. Uniﬁcation under a mixed preﬁx. Journal of Symbolic Com-

putation 14, 321–358.

Miller, D., 2011. A proposal for broad spectrum proof certiﬁcates, in: Jouan-

naud, J.P., Shao, Z. (Eds.), CPP: First International Conference on Certiﬁed

Programs and Proofs, pp. 54–69.

Miller, D., Nadathur, G., 2012. Programming with Higher-Order Logic. Cam-

bridge University Press. doi:10.1017/CBO9781139021326.

Miller, D.A., Cohen, E.L., Andrews, P.B., 1982. A look at TPS, in: Loveland,

D.W. (Ed.), Sixth Conference on Automated Deduction, Springer, New York.

pp. 50–69.

Mints, G., 1999. Cut-elimination for simple type theory with an axiom of choice.

J. Symb. Log. 64, 479–485.

Mossakowski, T., Maeder, C., L¨uttich, K., 2007. The heterogeneous tool set,

Hets, in: Proceedings of TACAS 2007, Springer. pp. 519–522.

Muskens, R., 2007. Intensional models for the theory of types. J. Symb. Log.

72, 98–118.

Nadathur, G., Linnell, N., 2005. Practical higher-order pattern uniﬁcation with

on-the-ﬂy raising, in: ICLP 2005: 21st International Logic Programming

Conference, Springer, Sitges, Spain. pp. 371–386.

Nadathur, G., Miller, D., 1988. An Overview of λProlog, in: Fifth

International Logic Programming Conference, MIT Press, Seattle. pp.

810–827. URL: http://www.lix.polytechnique.fr/Labo/Dale.Miller/

papers/iclp88.pdf.

Nederpelt, R.P., Geuvers, J.H., Vrijer, R.C.D. (Eds.), 1994. Selected Papers on

Automath. volume 133 of Studies in Logic and The Foundations of Mathe-

matics. North Holland.

Nipkow, T., 1989. Equational reasoning in Isabelle. Sci. Comput. Program. 12,

123–149.

Nipkow, T., 1993. Functional uniﬁcation of higher-order patterns, in: Proceed-

ings of the Eighth Annual Symposium on Logic in Computer Science (LICS

’93), Montreal, Canada, June 19-23, 1993, IEEE Computer Society. pp. 64–74.

Nipkow, T., Paulson, L., Wenzel, M., 2002. Isabelle/HOL: A Proof Assistant

for Higher-Order Logic. Number 2283 in LNCS, Springer.

Owre, S., Rushby, J., Shankar, N., 1992. PVS: A Prototype Veriﬁcation Sys-

tem, in: D., K. (Ed.), Proceedings of the 11th International Conference on

Automated Deduction, Springer. pp. 748–752.

Padovani, V., 2000. Decidability of fourth-order matching. Mathematical Struc-

tures in Computer Science 10, 361–372.

Parikh, R.J., 1973. Some results on the length of proofs. Transactions of the

ACM 177, 29–36.

Paulson, L.C., 1989. The foundation of a generic theorem prover. J. Autom.

Reasoning 5, 363–397.

Paulson, L.C., 1994. Isabelle - A Generic Theorem Prover (with a contribution

by T. Nipkow). volume 828 of LNCS. Springer.

Paulson, L.C., 1999. A generic tableau prover and its integration with isabelle.

Journal of Universal Computer Science 5, 51–60.

Pfenning, F., 1987. Proof Transformations in Higher-Order Logic. Ph.D. thesis.

Carnegie Mellon University. 156 pp.

Pfenning, F., 1994. Elf: A meta-language for deductive systems (system des-

crition), in: Bundy, A. (Ed.), Automated Deduction - CADE-12, 12th Inter-

national Conference on Automated Deduction, Nancy, France, June 26 - July

1, 1994, Proceedings, Springer. pp. 811–815.

Pfenning, F., Sch¨urmann, C., 1999. System description: Twelf - a meta-logical

framework for deductive systems, in: Ganzinger, H. (Ed.), Automated De-

duction - CADE-16, 16th International Conference on Automated Deduction,

Trento, Italy, July 7-10, 1999, Proceedings, Springer. pp. 202–206.

Pientka, B., Dunﬁeld, J., 2010. Beluga: A framework for programming and

reasoning with deductive systems (system description), in: Giesl, J., H¨ahnle,

R. (Eds.), Automated Reasoning, 5th International Joint Conference, IJCAR

2010, Edinburgh, UK, July 16-19, 2010. Proceedings, Springer. pp. 15–21.

Pietrzykowski, T., 1973. A complete mechanization of second-order type theory.

J. ACM 20, 333–364. doi:http://doi.acm.org/10.1145/321752.321764.

Pietrzykowski, T., Jensen, D.C., 1972. A complete mechanization of ω-order

type theory, in: ACM ’72: Proceedings of the ACM annual conference, ACM

Press, New York, NY, USA. pp. 82–92. doi:http://doi.acm.org/10.1145/

800193.805824.

Pollack, R., 1994. The Theory of LEGO. Ph.D. thesis. University of Ediburgh.

Prawitz, D., 1968. Hauptsatz for higher order logic. Journal of Symbolic Logic

33, 452–457.

Quine, W.V.O., 1940. Mathematical Logic. Harvard University Press, Boston,

MA.

Ramsey, F.P., 1926. The foundations of mathematics, in: Proceedings of the

London Mathematical Society, pp. 338–384.

Reynolds, J.C., 1974. Towards a theory of type structure, in: Colloque sur la

Programmation, Paris, France, Springer, New York. pp. 408–425.

Richardson, J., Smaill, A., Green, I., 1998. System description: Proof planning

in higher-order logic with lambda-Clam, in: Kirchner, C., Kirchner, H. (Eds.),

Automated Deduction - CADE-15, 15th International Conference on Auto-

mated Deduction, Lindau, Germany, July 5-10, 1998, Proceedings, Springer.

pp. 129–133.

Robinson, J.A., 1969. Mechanizing higher-order logic, in: Machine Intelligence

4. Edinburgh University Press, pp. 151–170.

Robinson, J.A., 1970. A note on mechanizing higher order logic, in: Machine

Intelligence 5. Edinburgh University Press, pp. 121–135.

Russell, B., 1902. Letter to Frege. Translated in van Heijenoort (1967).

Russell, B., 1903. The principles of mathematics. Cambridge University Press,

Cambridge, England.

Russell, B., 1908. Mathematical logic as based on the theory of types. American

Journal of Mathematics 30, 222–262.

Schulz, S., 2002. E – a brainiac theorem prover. AI Communications 15, 111–

126.

Sch¨utte, K., 1960. Semantical and syntactical properties of simple type theory.

Journal of Symbolic Logic 25, 305–326.

Shapiro, S., 1985. Second-order languages and mathematical practice. Jour-

nal of Symbolic Logic 50, 714–742. URL: http://www.jstor.org/stable/

2274326.

Smullyan, R.M., 1963. A unifying principle for quantiﬁcation theory. Proc. Nat.

Acad Sciences 49, 828–832.

Snyder, W., Gallier, J.H., 1989. Higher order uniﬁcation revisited: Complete

sets of transformations. Journal of Symbolic Computation 8, 101–140.

Spenger, C., Dams, M., 2003. On the structure of inductive reasoning: Circular

and tree-shaped proofs in the µ-calculus, in: Gordon, A. (Ed.), FOSSACS’03,

Springer. pp. 425–440.

Stirling, C., 2009. Decidability of higher-order matching. Logical Methods in

Computer Science 5, 1–52.

Sutcliﬀe, G., 2009. The TPTP problem library and associated infrastructure.

Journal of Automated Reasoning 43, 337–362.

Sutcliﬀe, G., Benzm¨uller, C., 2010. Automated reasoning in higher-order logic

using the TPTP THF infrastructure. Journal of Formalized Reasoning 3,

1–27. URL: http://christoph-benzmueller.de/papers/J22.pdf.

Tait, W.W., 1966. A nonconstructive proof of Gentzen’s Hauptsatz for second

order predicate logic. Bulletin of the American Mathematical Society 72,

980983.

Takahashi, M., 1967. A proof of cut-elimination theorem in simple type theory.

Journal of the Mathematical Society of Japan 19, 399–410.

Takeuti, G., 1953. On a generalized logic calculus. Japanese Journal of Mathe-

matics 23, 39–96. Errata: ibid, vol. 24 (1954), 149–156.

Takeuti, G., 1960. An example on the fundamental conjecture of GLC. Journal

of the Mathematical Society of Japan 12, 238–242.

Takeuti, G., 1975. Proof Theory. volume 81 of Studies in Logic and the

Foundations of Mathematics. Elsevier. URL: http://www.sciencedirect.

com/science/article/pii/S0049237X08706037, doi:http://dx.doi.org/

10.1016/S0049-237X(08)70603-7.

Westerst˚ahl, D., 2011. Generalized quantiﬁers, in: Zalta, E.N. (Ed.), The Stan-

ford Encyclopedia of Philosophy. summer 2011 ed.

Whitehead, A.N., Russell, B., 1910, 1912, 1913. Principia Mathematica, 3

vols. Cambridge: Cambridge University Press. Second edition, 1925 (Vol.

1), 1927 (Vols 2, 3). Abridged as Principia Mathematica to *56, Cambridge:

Cambridge University Press, 1962.

Wirth, C.P., 2004. Descente inﬁnie + deduction. Logic Journal of the IGPL 12,

1–96.

Data

March 2014

Christoph Benzmüller · Dale Miller

Download

Automating Public Announcement Logic with Relativized Common Knowledge as a Fragment of HOL in LogiKEy

Preprint

Full-text available

Jan 2022

A shallow semantical embedding for public announcement logic with relativized common knowledge is presented. This embedding enables the first-time automation of this logic with off-the-shelf theorem provers for classical higher-order logic. It is demonstrated (i) how meta-theoretical studies can be automated this way, and (ii) how non-trivial reasoning in the target logic (public announcement logic), required e.g. to obtain a convincing encoding and automation of the wise men puzzle, can be realized. Key to the presented semantical embedding is that evaluation domains are modeled explicitly and treated as an additional parameter in the encodings of the constituents of the embedded target logic; in previous related works, e.g. on the embedding of normal modal logics, evaluation domains were implicitly shared between meta-logic and target logic. The work presented in this article constitutes an important addition to the pluralist LogiKEy knowledge engineering methodology, which enables experimentation with logics and their combinations, with general and domain knowledge, and with concrete use cases -- all at the same time.

Superposition with Lambdas

Article

Full-text available

Aug 2021
J AUTOM REASONING

We designed a superposition calculus for a clausal fragment of extensional polymorphic higher-order logic that includes anonymous functions but excludes Booleans. The inference rules work on $$\beta \eta $$ β η -equivalence classes of $$\lambda $$ λ -terms and rely on higher-order unification to achieve refutational completeness. We implemented the calculus in the Zipperposition prover and evaluated it on TPTP and Isabelle benchmarks. The results suggest that superposition is a suitable basis for higher-order reasoning.

Modelling Value-oriented Legal Reasoning in LOGIKEY

Preprint

Full-text available

Mar 2022

The logico-pluralist LOGIKEY knowledge engineering methodology and framework is applied to the modelling of a theory of legal balancing in which legal knowledge (cases and laws) is encoded by utilising context-dependent value preferences. The theory obtained is then used to formalise, automatically evaluate, and reconstruct illustrative property law cases (involving appropriation of wild animals) within the Isabelle/HOL proof assistant system, illustrating how LOGIKEY can harness interactive and automated theorem proving technology to provide a testbed for the development and formal verification of legal domain-specific languages and theories. Modelling value-oriented legal reasoning in that framework, we establish novel bridges between latest research in knowledge representation and reasoning in non-classical logics, automated theorem proving, and applications in legal reasoning.

Extending a brainiac prover to lambda-free higher-order logic

Article

Full-text available

Feb 2022
Int J Software Tool Tech Tran

Decades of work have gone into developing efficient proof calculi, data structures, algorithms, and heuristics for first-order automatic theorem proving. Higher-order provers lag behind in terms of efficiency. Instead of developing a new higher-order prover from the ground up, we propose to start with the state-of-the-art superposition prover E and gradually enrich it with higher-order features. We explain how to extend the prover’s data structures, algorithms, and heuristics to $$\lambda $$ λ -free higher-order logic, a formalism that supports partial application and applied variables. Our extension outperforms the traditional encoding and appears promising as a stepping stone toward full higher-order logic.

Classicism: Andrew Bacon and Cian Dorr

Chapter

Mar 2024

The last decade has seen a resurgence of interest in the use of higher-order logics in metaphysics. Characteristic of this trend is the use of higher-order languages to formulate metaphysical views and arguments. We call such uses of higher-order logic in metaphysics “higher-order metaphysics”. Often, higher-order quantifiers are used to formalize talk of propositions, properties and relations. This is the first volume of papers on this field, comprising 17 new essays by many of the leading contributors. The articles in this volume introduce and motivate higher-order metaphysics, discuss different choices of higher-order languages and logics, apply higher-order logic to a number of central metaphysical topics, discuss the history of higher-order logic in metaphysics, and debate the arguments for and against using higher-order logic in metaphysics.

An encoding of abstract dialectical frameworks into higher-order logic

Article

Jan 2024

An approach for encoding abstract dialectical frameworks and their semantics into classical higher-order logic is presented. Important properties and semantic relationships are formally encoded and proven using the proof assistant Isabelle/HOL. This approach allows for the computer-assisted analysis of abstract dialectical frameworks using automated and interactive reasoning tools within a uniform logic environment. Exemplary applications include the formal analysis and verification of meta-theoretical properties, and the generation of interpretations and extensions under specific semantic constraints.

Solving Modal Logic Problems by Translation to Higher-Order Logic

Chapter

Aug 2023

This paper describes an evaluation of Automated Theorem Proving (ATP) systems on problems taken from the QMLTP library of first-order modal logic problems. Principally, the problems are translated to higher-order logic in the TPTP language using an embedding approach, and solved using higher-order logic ATP systems. Additionally, the results from native modal logic ATP systems are considered, and compared with those from the embedding approach. The findings are that the embedding process is reliable and successful, the choice of backend ATP system can significantly impact the performance of the embedding approach, native modal logic ATP systems outperform the embedding approach, and the embedding approach can cope with a wider range modal logics than the native modal systems considered.KeywordsNon-classical logicsQuantified modal logicsHigher-order logicAutomated theorem proving

Expressing predicate subtyping in computational logical frameworks

Thesis

Sep 2022

Gabriel Hondet

Safe programming as well as most proof systems rely on typing. The more a type system is expressive, the more these types can be used to encode invariants which are therefore verified mechanically through type checking procedures. Dependent types extend simple types by allowing types to depend on values. For instance, it allows to define the types of lists of a certain length. Predicate subtyping is another extension of simple type theory in which types can be defined by predicates. A predicate subtype, usually noted {x: A | P(x)}, is inhabited by elements t of type A for which P(t) is true. This extension provides an extremely rich and intuitive type system, which is at the heart of the proof assistant PVS, at the cost of making type checking undecidable.This work is dedicated to the encoding of predicate subtyping in Dedukti: a logical framework with computation rules. We begin with the encoding of explicit predicate subtyping for which the terms in {x: A | P(x)} and terms of Aare syntactically different. We show that any derivable judgement of predicate subtyping can be encoded into a derivable judgement of the logical framework. Predicate subtyping, is often used implicitly: with no syntactic difference between terms of type A and terms of type {x: A | P(x) }. We enrich our logical framework with a term refiner which can add these syntactic markers. This refiner can be used to refine judgements typed with implicit predicate subtyping into explicited judgements.The proof assistant PVS uses extensively predicate subtyping. We show how its standard library can be exported to Dedukti. Because PVS only store proof traces rather than complete proof terms, we sketch in the penultimate section a procedure to generate complete proof terms from these proof traces.The last section provides the architecture of a repository dedicated to the exchange of formal proofs. The goal of such a repository is to categorise and store proofs encoded in Dedukti to promote interoperability.

Automating public announcement logic with relativized common knowledge as a fragment of HOL in LogiKEy

Article

Apr 2022

A shallow semantical embedding for public announcement logic (PAL) with relativized common knowledge is presented. This embedding enables the first-time automation of this logic with off-the-shelf theorem provers for classical higher-order logic. It is demonstrated (i) how meta-theoretical studies can be automated this way and (ii) how non-trivial reasoning in the target logic (PAL), required for instance to obtain a convincing encoding and automation of the wise men puzzle, can be realized. Key to the presented semantical embedding is that evaluation domains are modelled explicitly and treated as an additional parameter in the encodings of the constituents of the embedded target logic; in previous related works, e.g. on the embedding of normal modal logics, evaluation domains were implicitly shared between meta-logic and target logic. The work presented in this article constitutes an important addition to the pluralist LogiKEy knowledge engineering methodology, which enables experimentation with logics and their combinations, with general and domain knowledge, and with concrete use cases—all at the same time.

Dyadic Deontic Logic in HOL: Faithful Embedding and Meta-Theoretical Experiments

Chapter

Jan 2022

A shallow semantical embedding of a dyadic deontic logic by Carmo and Jones in classical higher-order logic is presented. The embedding is proven sound and complete, that is, faithful. This result provides the theoretical foundation for the implementation and automation of dyadic deontic logic within off-the-shelf higher-order theorem provers and proof assistants. To demonstrate the practical relevance of our contribution, the embedding has been encoded in the Isabelle/HOL proof assistant. As a result a sound and complete (interactive and automated) theorem prover for the dyadic deontic logic of Carmo and Jones has been obtained. Experiments have been conducted which illustrate how the exploration and assessment of meta-theoretical properties of the embedded logic can be supported with automated reasoning tools integrated with Isabelle/HOL.

Proofs in Higher-Order Logic

Thesis

Full-text available

Oct 1983

Dale Miller

Expansion trees are defined as generalizations of Herbrand instances for formulas in a nonextensional form of higher-order logic based on Church's simple theory of types. Such expansion trees can be defined with or without the use of skolem functions. These trees store substitution terms and either critical variables or skolem terms used to instantiate quantifiers in the original formula and those resulting from instantiations. An expansion tree is called an expansion tree proof (ET-proof) if it encodes a tautology, and, in the form not using skolem functions, an "imbedding" relation among the critical variables be acyclic. The relative completeness result for expansion tree proofs not using skolem functions, i.e. if A is provable in higher-order logic then A has such an expansion tree proof, is based on Andrews' formulation of Takahasti's proof of the cut-elimination theorem for higher-order logic. If the occurrences of skolem functions in instantiation terms are restricted appropriately, the use of skolem functions in place of critical variables is equivalent to the requirement that the imbedding relation is acyclic. This fact not only resolves the open question of what is a sound definition of skolemization in higher-order logic but also provides a direct, syntactic proof of its correctness. Since subtrees of expansion trees are also expansion trees (or their dual) and expansion trees store substitution terms and critical variables explicitly, ET-proofs can be directly converted into sequential and natural deduction proofs. A naive translation will often produce proofs which contain a lot of redunancies and will often use implicational lines in an awkward fashion. An improved translation process is presented. This process will produce only focused proofs in which much of the redunancy has been eliminated and backchaining on implicational lines was automatically selected if it was applicable. The information necessary to construct focused proofs is provided by a certain connection scheme, called a mating,of the boolean atoms within the tautology encoded by an ET-proof.

Automating Higher Order Logics

Article

Full-text available

Jan 1984

A Mechanization of Sorted Higher-Order Logic Based on the Resolution Principle

Thesis

Jan 1994

Michael Kohlhase

Proof Theory

Book

Jan 1987

Gaisi Takeuti

Proof Transformations in Higher-Order Logic

Thesis

Jan 1987

Frank Pfenning

Higher-Order Logic Programming

Technical Report

Jan 1994

Constrained Resolution: A Complete Method for Higher Order Logic

Thesis

Jan 1972

Gérard P. Huet

Higher Order Logic

Chapter

Jan 1994

Daniel Leivant

Automated Logic for Semi-Automated Mathematics

Technical Report

Mar 1964

James R. Guard

This report defines and describes six formal systems of logic, S sub 1 through S sub 6, for which proof procedures or partial proof procedures are readily contrived. The systems S sub 1 through S sub 4 are considered in order to simplify the descriptions of S sub 5 and S sub 6. System S sub 1 is a fragment of the classical propositional calculus whose theorems are those tautologies which can be shown tautologous by assuming them to take on the value falsehood and arriving at an inconsistent assignment to the variables by not using any 'branching' rules. This system suggests an efficient means of handling the propositional connectives in the later systems. System S sub 2 is the completion of S sub 1. S sub 2 has 'branching' rules which correspond to treating certain propositional variables by cases. This differs from Gentzen's treatment by Sequenzen in that Gentzen's 'branching' rules consider the value of the antecedent or consequent of a formula by cases. System S sub 3 is a fragment of the first order predicate-function calculus. In S sub 3, formulas are proved by contradiction. The quantifiers of the denial are stripped by putting the denial in miniscope form and replacing them by Skolem functors. This is reminiscent of the Herbrand technique. However S sub 3 uses a process called matching to consider only reasonable Herbrand disjuncts.

On Connections and Higher Order Logic

Article

Jan 1989

Peter B Andrews

Automation of Higher-Order Logic

Supplementary resource (1)

Recommended publications

Term indexing for the LEO-II prover

Remote diagnosis design for a PLC-based automated system: 2-evaluation of factors affecting remote d...

Multi - mechanism Coalescence Design and Matrix Expression of Logic Action Sequences of the Over-tur...

Upgrading materials handling systems for top performance