AFFINE LOGIC FOR CONSTRUCTIVE MATHEMATICS

MICHAEL SHULMAN

doi:10.1017/bsl.2022.28

AFFINE LOGIC FOR CONSTRUCTIVE MATHEMATICS

Part of: Proof theory and constructive mathematics

Published online by Cambridge University Press: 21 July 2022

MICHAEL SHULMAN

Show author details

MICHAEL SHULMAN*: Affiliation:
DEPARTMENT OF MATHEMATICS UNIVERSITY OF SAN DIEGOSAN DIEGO, CA92110, USAE-mail: shulman@sandiego.edu

Article contents

Abstract
Introduction
A meaning explanation
The antithesis translation for propositional logic
The antithesis translation for predicate logic
Intuitionistic sets and functions
Affine sets and functions
Algebra
Order
Real analysis
Topology
Towards affine constructive mathematics
Acknowledgments
Footnotes
References

Rights & Permissions

Abstract

We show that numerous distinctive concepts of constructive mathematics arise automatically from an “antithesis” translation of affine logic into intuitionistic logic via a Chu/Dialectica construction. This includes apartness relations, complemented subsets, anti-subgroups and anti-ideals, strict and non-strict order pairs, cut-valued metrics, and apartness spaces. We also explain the constructive bifurcation of some classical concepts using the choice between multiplicative and additive affine connectives. Affine logic and the antithesis construction thus systematically “constructivize” classical definitions, handling the resulting bookkeeping automatically.

Keywords

linear logic affine logic constructive mathematics

MSC classification

Primary: 03F52: Linear logic and other substructural logics

Secondary: 03F65: Other constructive mathematics

Type: Articles
Information: Bulletin of Symbolic Logic , Volume 28 , Issue 3 , September 2022 , pp. 327 - 386

DOI: https://doi.org/10.1017/bsl.2022.28 [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press on behalf of The Association for Symbolic Logic

1 Introduction

One of the explicit motivations of Girard’s linear logic [Reference Girard19] was to recover an involutory “classical” negation while retaining “constructive content”:

…the linear negation …is a constructive and involutive negation; by the way, linear logic works in a classical framework, while being more constructive than intuitionistic logic. [Reference Girard19, p. 3]

One might therefore expect that over the past three decades some practicing constructive mathematicians would have adopted linear logic instead of intuitionistic logic;Footnote ¹ but this does not seem to be the case. One might conjecture many reasons for this. However, I will instead argue that linear logic has nevertheless been present implicitly in constructive mathematics, going back all the way to Brouwer.

Specifically, I will show that there are aspects of constructive mathematical practice that are better explained by linear logic than by intuitionistic logic. The non-involutory intuitionistic negation often leads constructive mathematicians to study both a classical concept and its formal De Morgan dual, such as equality and apartness, subgroups and antisubgroups, topological spaces and apartness spaces, and so on. We will show that such “dual pairs of propositions” can be regarded as single propositions in a model of linear logic, or more specifically affine logic, which we call the antithesis model. Notions such as apartness relations then arise by writing a classical definition in affine logic and interpreting it in this model.

The antithesis model is a special case of a Chu or Dialectica construction [Reference Chu13, Reference de Paiva41, Reference Shulman46] (the two constructions coincide in this special case), applied to the algebra of intuitionistic propositions. It is well-known that Chu and Dialectica constructions yield models of linear and affine logic (see for instance [Reference Barr7, Reference Oliva38, Reference de Paiva39]); the novelty here is in how the logic of this particular special case relates to constructive mathematics.

Since it constructs a model of affine logic from any model of intuitionistic logic in a functorial way, the antithesis model can also be used as a purely syntactic translation, transforming any definition, theorem, or proof in affine logic into one in intuitionistic logic; we call this the antithesis translation. This is analogous to other translations such as the Gödel–Gentzen double-negation translation, which constructs a model of classical logic from a model of intuitionistic logic, and therefore transforms classical theorems into intuitionistic ones; or the Girard translation, which constructs a model of intuitionistic logic from a model of linear logic, and therefore transforms intuitionistic theorems into linear ones.

Importantly, however, unlike the Gödel–Gentzen and Girard translations, the antithesis translation is not conservative. Indeed, to a linear logician, the antithesis model looks quite degenerate, particularly in the behavior of its exponentials. Thus, we should not view the antithesis translation as an “explanation” or “embedding” of affine logic into intuitionistic logic. Rather, we view it as giving a way to treat affine logic as a “high-level” or “domain-specific” language that can be “compiled” into intuitionistic logic. (Because the antithesis translation is a one-sided inverse of the Girard translation, we can in fact view affine logic as a strict extension of intuitionistic logic: the high-level language includes an “escape to assembler.”) In other words, the antithesis translation is a tool primarily for the intuitionistic mathematician, not the linear one.

This tool has several possible uses. Firstly, it formalizes a technique for “constructivizing” classical definitions: write them in affine logic and apply the antithesis translation. This method often yields a better result than the usual one of simply regarding the classical connectives as having their intuitionistic meanings; the latter frequently requires manual “tweaking” to become intuitionistically sensible.

We also obtain a uniform explanation for some instances of the fact that “constructivizing” is multi-valued. Namely, in affine logic the connectives “and” and “or” bifurcate into “additive” and “multiplicative” versions. Thus, many classical definitions can be written in affine logic in more than one way, by making different choices about whether to interpret the classical connectives as additive or multiplicative affine ones. Under the antithesis translation, this then leads to different intuitionistic versions of a classical definition, many of which occur naturally in examples and have been already written down “manually” by constructive mathematicians. (There are, of course, also other reasons for the constructive multifurcation of concepts.)

Roughly speaking, the additive disjunction “P or Q” corresponds, under the antithesis translation, to the intuitionistic disjunction, while the multiplicative disjunction “P par Q” corresponds to the intuitionistic pattern “if not P, then Q; and if not Q, then P” that is often used constructively when the intuitionistic disjunction is too strong. For instance, in intuitionistic logic the rational numbers are a field in the strong “geometric” sense that every element is either zero or invertible. The real numbers are not a field in this sense, but they are a field in the weaker “Heyting” sense that every nonzeroFootnote ² element is invertible and every noninvertible element is zero. This condition is the image under the antithesis translation of the affine statement that every element is either zero par invertible. Similarly, for real numbers $x\le y$ is not equivalent to “ $x=y$ or $x<y$ ,” but it is equivalent to the antithesis translation of “ $x=y$ par $x<y$ .” In Sections 9 and 10 we will see that a systematic use of par can solve a few tricky problems in intuitionistic constructive mathematics, such as defining a notion of “metric space” that includes the Hausdorff metric, or a union axiom for a “closure space” that is not unreasonably strong.

Secondly, we can also apply the antithesis translation to proofs. Many classical proofs are also affinely valid with little change; surprisingly often, the lack of contraction is not a problem when definitions are formulated appropriately. Hence, the antithesis translation turns such proofs into intuitionistic proofs of theorems involving apartness relations, antisubgroups, and so on: the process of “turning everything around” to deal with such concepts can be automated. This tends to work for classical proofs that may use proof by contradiction (or equivalently the law of double negation) as long as they avoid the law of excluded middle. In intuitionistic logic, the laws of double negation and excluded middle are equivalent, but linear and affine logic disentangle (some versions of) them.

Having an automatic way to produce intuitionistic definitions and proofs is more than just a convenience: it can prevent or correct mistakes. Working explicitly with apartness relations and their ilk is tedious and error-prone: it’s easy to omit one of the contrapositive conditions, or forget to check that a function is strongly continuous or that a subset is strongly extensional. Moreover, it’s not always obvious exactly what the axioms on an apartness structure should be, but the antithesis translation always seems to give the right answer (or at least a right answer).

A third possible use of the antithesis translation is more speculative. Rather than viewing affine logic and the antithesis translation as tools for doing intuitionistic constructive mathematics, one might imagine instead a constructive mathematics (in the informal sense of “mathematics with constructive content”) that uses only affine logic. The antithesis interpretation would then be a guide to the correct way to formulate concepts in affine constructive mathematics. It is not yet clear how feasible this idea is,Footnote ³ but we will make some remarks about it in Section 11.

In this paper, we will mainly focus on the first use: translating definitions. We include a few proofs, but for the most part we leave the development of “affine constructive mathematics” for future work.

1.1 Outline

In Section 2 we describe our viewpoint on affine logic informally, analogously to the BHK interpretation of intuitionistic logic; no prior familiarity with affine or linear logic is required. Then in Sections 3 and 4 we formalize this interpretation both semantically, as a Chu or Dialectica construction, and syntactically, as a translation between propositional, first-order, and higher-order logics. Many introductions to linear logic present it as a logic of “resources” or “games”; we view it as a logic of mathematics, like intuitionistic logic and classical logic, which is designed to be “constructive” in a different way than intuitionistic logic.

The rest of the paper consists of “case studies,” showing that rewriting classical definitions directly in affine logic and passing across this antithesis translation yields well-known notions in intuitionistic constructive mathematics. In Sections 5 and 6 we treat sets and functions, then algebra in Section 7, order relations in Section 8, real numbers in Section 9, and topology in Section 10. Finally, in Section 11 we speculate a bit about how one might motivate and explain an “affine constructive mathematics” on purely philosophical grounds.

2 A meaning explanation

Intuitionistic logic is often explained informally (e.g., in [Reference Troelstra and van Dalen49]) by the so-called Brouwer–Heyting–Kolmogorov (BHK) interpretation, which explains the meaning of the logical connectives and quantifiers “pragmatically” in terms of what counts as a proof of them. For instance, a few of the rules are:

• A proof of $P\to Q$ is a method converting any proof of P into a proof of Q.
• A proof of $P\lor Q$ is either a proof of P or a proof of Q.
• A proof of $\neg P$ is a method converting any proof of P into a proof of an absurdity.

This leads to the rules of intuitionistic logic; for instance, we cannot prove $P\lor \neg P$ in general since we cannot decide whether to give a proof of P or a proof of $\neg P$ .

Practicing constructive mathematicians, however, have found that it is often not sufficient to know what counts as a proof of a statement: it is often just as important, if not more so, to know what counts as a refutation of a statement. For instance, while it is of course essential to know that two real numbers are equal if they agree to any desired degree of approximation, it is also essential to know that they are unequal if there is some finite degree of approximation at which they disagree. If real numbers are defined using Cauchy sequences $x,y:\mathbb {N}\to \mathbb {Q}$ with specified rate of convergence $|x_n-x_m|< {\textstyle \frac {1}{n}}+{\textstyle \frac {1}{m}}$ , then we want to separately define

In classical logic, a refutation of P means a proof of $\neg P$ , and these two definitions are each other’s negations. But intuitionistic negation is not involutive, and this “ $x\neq y$ ” is not the logical negation of $x=y$ . Thus, when constructive mathematics is done in intuitionistic logic—as it usually is—we must define inequality as a new apartness relation with which the set of reals is equipped. In Bishop’s words:

It is natural to want to replace this negativistic definition [the logical negation of equality] by something more affirmative…Brouwer himself does just this for the real number system, introducing an affirmative and stronger relation of inequality…Experience shows that it is not necessary to define inequality in terms of negation. For those cases in which an inequality relation is needed, it is better to introduce it affirmatively…[Reference Bishop and Bridges10, p. 10]

Similar things happen all throughout constructive mathematics. In addition to knowing when an element is in a subgroup, we need to know when an element is not in a subgroup; thus we introduce antisubgroups (and similarly anti-ideals, etc.). In addition to knowing when a point is in the interior of a set, we need to know when it is in the exterior of a set; thus we introduce apartness spaces [Reference Bridges and Vîţă11].

To repeat, the problem is that the BHK interpretation and resulting intuitionistic logic privilege proofs over refutations. In the words of Patterson [Reference Patterson42]:

Once we take on the Brouwerian view that proofs should be constructions, both negation and “falsity” disappear because absurdity is not the same thing as demonstratively false. This is because a construction leading to a contradiction does not mean that we can provide a counterexample.…

In intuitionistic logic we have taken “true” to be primitive as well as “absurdity”.…Thus, a “proof leading to absurdity” is a derived notion of falsity and the only one afforded to us in intuitionistic logic.

Negative information can be just as constructive as positive information.…The correct way to use negative information in a constructive setting would be to do the “opposite” or “backward” construction in some way. [Reference Patterson42, pp. 8–9]

This suggests a BHK-like explanation of logical connectives in terms of both what counts as a proof and what counts as a refutation. We now explore what such an explanation might look like. The only requirement we impose is that no formula should be both provable and refutable (but see Remarks 3.9 and 3.13).

We start with the following explanations of conjunction and disjunction, which we denote $\sqcap $ and $\sqcup $ rather than $\land $ and $\lor $ as a warning that they will not behave quite like the usual intuitionistic or classical connectives.

• A proof of $P\sqcap Q$ is a proof of P together with a proof of Q.
• A refutation of $P\sqcap Q$ is either a refutation of P or a refutation of Q.
• A proof of $P\sqcup Q$ is either a proof of P or a proof of Q.
• A refutation of $P\sqcup Q$ is a refutation of P together with a refutation of Q.

These “proof” clauses are the usual BHK ones, while the “refutation” clauses are natural-seeming De Morgan duals. The most natural clauses for negation are:

• A proof of $\smash {{P}^{\perp }}$ is a refutation of P.
• A refutation of $\smash {{P}^{\perp }}$ is a proof of P.

This negation is involutive, $\smash {{P}^{\perp \perp }} \equiv P$ , with strict De Morgan duality: $\smash {{(P\sqcap Q)}^{\perp }} \equiv \smash {{P}^{\perp }} \sqcup \smash {{Q}^{\perp }}$ and $\smash {{(P\sqcup Q)}^{\perp }} \equiv \smash {{P}^{\perp }} \sqcap \smash {{Q}^{\perp }}$ . ( $\equiv $ denotes inter-derivability.)

A little thought suggests that one natural explanation of implication (which we indulge in some foreshadowing by writing as ) is:

• A proof of is a method converting any proof of P into a proof of Q, together with a method converting any refutation of Q into a refutation of P.
• A refutation of is a proof of P together with a refutation of Q.

Remark 2.1. This does not mean that when we prove an implication we must also prove its contrapositive explicitly. The “proofs” in these explanations, like those in the BHK interpretation, are not the “proofs” that a mathematician writes in a paper (or even formalizes on a computer). Rather they are “verifications,” “fully normalized proofs,” or “data that must be extractable from a proof.” Not every intuitionistic proof (in the ordinary sense) of $P\lor Q$ begins by deciding whether to prove P or Q, but intuitionistic logic satisfies the “disjunction property” that from any proof of $P\lor Q$ in the empty context we can extract either a proof of P or a proof of Q. Similarly, any proof of in the empty context must contain enough information to transform refutations of Q into refutations of P as well as proofs of P into proofs of Q.

Building contraposition into the definition of implication makes it unsurprising that we get . So it might seem that we are going to fall into classical logic, but this is not the case. For instance, classically we have $\neg (P \to Q) \equiv P \land \neg Q$ ; but despite the apparent presence of this law in the “refutation” clause for , we do not have . Instead we have , where $\boxtimes $ is a different kind of conjunction:

• A proof of $P\boxtimes Q$ is a proof of P together with a proof of Q.
• A refutation of $P\boxtimes Q$ is a method converting any proof of P into a refutation of Q, together with a method converting any proof of Q into a refutation of P.

Note that $P\sqcap Q$ and $P\boxtimes Q$ have the same proofs, but different refutations. Both refutation clauses are based on the idea that P and Q cannot both be true, but to refute $P\sqcap Q$ we must specify which of them fails to be true, whereas to refute $P\boxtimes Q$ we simply have to show that if one of them is true then the other cannot be.

This suggests that $P\boxtimes Q$ is stronger than $P\sqcap Q$ , and in fact we can justify on the basis of our informal explanations. Since $P\boxtimes Q$ and $P\sqcap Q$ have the same proofs, it suffices to transform any refutation of $P\sqcap Q$ into a refutation of $P\boxtimes Q$ . The former is either a refutation of P or of Q; without loss of generality assume the latter. Then we can certainly produce a refutation of Q that doesn’t even need to use a proof of P. On the other hand, given a refutation of Q it is impossible that we could also have a proof of Q; so by ex contradictione quodlibet from any proof of Q we can vacuously produce a refutation of P.

It follows that the De Morgan dual of $\boxtimes $ is a different kind of disjunction that is weaker than $\sqcup $ . (For a discussion of notation, see Notation 2.2.) Its explanation is:

• A proof of is a method converting any refutation of P into a proof of Q, together with a method converting any refutation of Q into a proof of P.
• A refutation of is a refutation of P together with a refutation of Q.

Thus has the same refutations as $P\sqcup Q$ , but more proofs: while $P\sqcup Q$ supports proof by cases, supports only the disjunctive syllogism. As noted in Section 1, encapsulates a common constructive pattern for weakening definitions when the intuitionistic “or” is too strong: rather than asserting that one of two conditions holds, we assert that if either one of two conditions fails then the other must hold. We have , a version of the classical law $(P \to Q) \equiv (\neg P \lor Q)$ .

A reader familiar with linear logic may recognize $\boxtimes $ and as its multiplicatives, while $\sqcap $ and $\sqcup $ are its additives,Footnote ⁴ with as its linear implication. Indeed, this explanation is similar to the “game semantics” of linear logic, in which a “proposition” is regarded as a game or interaction between a “prover” and a “refuter.”

Actually, our explanation justifies not fully general linear logic but affine logic, because in the nullary case (“true” and “false”) the distinctions collapse:

• There is exactly one proof of $\top $ .
• There is no refutation of $\top $ .
• There is no proof of $\bot $ .
• There is exactly one refutation of $\bot $ .

These are units for both additive and multiplicative connectives: $P \sqcap \top \equiv P \boxtimes \top \equiv P$ and . The most nontrivial part of this is the refutations of $P\boxtimes \top $ (or dually the proofs of ), which by definition consist of a method transforming any proof of $\top $ into a refutation of P, together with a method transforming any proof of P into a refutation of $\top $ . The former is essentially just a refutation of P; but given this, there can be no proof of P, so the latter method is vacuous.

The quantifiers are essentially additive; we write them as

instead of $\exists /\forall $ .

• A proof of ${\textstyle \bigsqcup } x. P(x)$ is a value a together with a proof of $P(a)$ .
• A refutation of ${\textstyle \bigsqcup } x. P(x)$ consists of a refutation of $P(a)$ for an arbitrary a.
• A proof of consists of a proof of $P(a)$ for an arbitrary a.
• A refutation of is a value a together with a refutation of $P(a)$ .

The most novel of these clauses is the one for refutations of

: just as a constructive proof of an existence statement should supply a witness, we stipulate that a constructive disproof of a universal statement should supply a counterexample. This yields De Morgan dualities

and also “Frobenius” laws involving the multiplicative connectives:

But

fails: a refutation of ${\textstyle \bigsqcup } x. (P \sqcap Q(x))$ consists of, for every a, either a refutation of P or a refutation of $Q(a)$ , while a refutation of $P \sqcap {\textstyle \bigsqcup } x.Q(x)$ must decide at the outset whether to refute P or to refute all $Q(a)$ ’s.

This explanation of the connectives and quantifiers solves the problem mentioned above with equality and inequality of real numbers. If we define

then we find that (assuming that $\smash {{(p\le q)}^{\perp }} \equiv (p> q)$ for $p,q\in \mathbb {Q}$ )

$$\begin{align*}\smash{{(x=y)}^{\perp}} \equiv {\textstyle\bigsqcup} n. |x_n-y_n|> \textstyle{\textstyle\frac{2}{n}}. \end{align*}$$

Thus, the correct notions of equality and inequality for real numbers are each other’s negations, relieving us of the need for a separate “apartness relation.”

The names linear and affine logic refer to the fact that $\boxtimes $ and are not idempotent: $P\boxtimes P {\not\equiv } P$ and . (More precisely, what fails are and .) A proof of consists of a method (well, technically two methods) for converting any refutation of P into a proof of P. Since P cannot be both provable and refutable, this is equivalently a method showing that P cannot be refuted—which is, of course, different from saying that it can be proven. Note that linear logic always satisfies the “multiplicative law of excluded middle” and the “multiplicative law of non-contradiction” $\smash {{(P\boxtimes \smash {{P}^{\perp }})}^{\perp }}$ (in fact, they are essentially the same statement). We call a proposition decidable if it satisfies the additive law of excluded middle $P\sqcup \smash {{P}^{\perp }}$ , or equivalently the additive law of non-contradiction $\smash {{(P\sqcap \smash {{P}^{\perp }})}^{\perp }}$ . According to the above informal explanations, decidability means that we have either a proof of P or a refutation of P.

It may help to understand $\boxtimes $ if we write out the meaning of , which the reader can verify is equivalent to :

• A proof of consists of methods for:
- – converting any proofs of P and Q into a proof of R,
- – converting any proof of P and refutation of R into a refutation of Q, and
- – converting any proof of P and refutation of Q into a refutation of R.
• A refutation of consists of a proof of P, a proof of Q, and a refutation of R.

More generally, a proof of consists of “all possible direct or by-contrapositive proofs” that contradict one of the hypotheses. By contrast:

• A proof of consists of:
- – a method converting any proofs of P and Q into a proof of R, and
- – a method converting any refutation of R into either a refutation of P or a refutation of Q.
• A refutation of consists of a proof of P, a proof of Q, and a refutation of R.

That is, when proving the (stronger) statement , the by-contrapositive direction must use R to determine which of P or Q fails, whereas when proving we are allowed to assume one of P and Q and contradict the other.

We define , with the following meaning:

• A proof of consists of methods for converting:
- – any proof of P into a proof of Q, and vice versa, plus
- – any refutation of P into a refutation of Q, and vice versa.
• A refutation of consists of either:
- – a proof of P and a refutation of Q, or
- – a refutation of P and a proof of Q.

Often in linear logic one defines to be instead. However, for us is preferable, due in part to its more informative disjunctive notion of refutation; see also Examples 6.7 and 6.14 and Section 8.

Finally, following Girard [Reference Girard19] we introduce two unary connectives and called exponential modalities, with the following meanings:

• A proof of is a proof of P.
• A refutation of is a method converting any proof of P into an absurdity.
• A refutation of is a refutation of P.
• A proof of is a method converting any refutation of P into an absurdity.

These exponentials deal with the potential objection that not every constructive proposition has a “strong dual.” For instance, not every set has an apartness relation. But there is always the Heyting negation , and the propositions are those whose refutations are the “tautological” ones of this form. We will call a proposition in affine logic P affirmative if . For instance, a set in the antithesis model whose affine equality is affirmative will correspond to a set in intuitionistic logic equipped with the denial inequality, .

It is also common to encounter propositions that are the Heyting negation of their strong dual. For instance, while real numbers do not satisfy , they do satisfy $(x= y) \equiv \neg (x\neq y)$ (the inequality is tight). In the antithesis model these are the propositions with , which we call refutative.

We can also understand by considering . Since Q cannot be both provable and refutable, if we can transform proofs of P into proofs of Q, then any refutation of Q already entails the impossibility of a proof of P. Thus, in proving the contrapositive direction is subsumed by the forwards direction, giving:

• A proof of is a method converting any proof of P into a proof of Q.
• A refutation of is a proof of P together with a refutation of Q.

Unlike , we only have . Thus is “usable multiple times” (since ) but “not contraposable.”

Note the proofs of are just the ordinary BHK interpretation of $P\to Q$ . This foreshadows the Girard translation of intuitionistic logic into linear logic.

Notation 2.2. Since we will be passing back and forth between intuitionistic and affine logic frequently, to minimize confusion I have tried not to duplicate any notations between the two contexts. As a mnemonic, our notations for affine connectives generally involve perpendicular lines; thus we have Footnote ⁵ in place of the intuitionistic $\land ,\lor ,\forall ,\exists ,\mathbf {1},\mathbf {0}$ . We carry this principle over to non-logical symbols as well, writing , and so on in place of the intuitionistic $\le ,<,\in $ .

The main exceptions are the affine implication , the exponentials , and equality. The symbols are associated strongly with linear logic, and sufficiently visually distinctive to need no mnemonic. And the intuitionistic $=$ can’t be made any more perpendicular, so we instead write $\circeq $ for affine equality to evoke .

In the intuitionistic context, will always use “slashed” symbols such as $\neq ,\notin ,{\not\le }$ , and so on to denote strong “affirmative” negations, rather than the weak logical negations such as $\neg (x= y)$ . In the affine context, the corresponding slashed symbols will refer to the involutive affine negation: .

3 The antithesis translation for propositional logic

Like the BHK interpretation of intuitionistic logic, the explanation of the affine connectives and quantifiers in Section 2 is informal, and nonspecific about what constitutes a “method.” However, the relationship between the two interpretations can be made precise, in the form of a “translation” of affine logic into intuitionistic logic. This is analogous to the Gödel–Gentzen double-negation translation of classical logic into intuitionistic logic and the Girard translations of intuitionistic logic into linear logic, and like them it has both a semantic and a syntactic side.

Consider the Gödel–Gentzen translation, restricted to propositional logic for simplicity. On the semantic side, this constructs a Boolean algebra from a Heyting algebra. More generally, let $\mathbf {H}$ be any bicartesian closed category, meaning a cartesian closed category with finite coproducts; a Heyting algebra is the special case of a bicartesian closed poset. We regard the objects of $\mathbf {H}$ as intuitionistic propositions, and hence use logical notations for its structure: $\land $ for cartesian products, $\lor $ for coproducts, $\mathbf {1}$ and $\mathbf {0}$ for terminal and initial object, and $\to $ for exponentials.

Remark 3.1. Constructive logics must always deal with the question of proof relevance: whether we interpret a proposition to belong to a poset, such as a Heyting algebra (the proof-irrelevant version), or a more general category (the proof-relevant version). Of course, a proof-relevant interpretation retains more information, including the algorithms implicitly defined by a constructive proof. But the natural proof-irrelevance of some models can be important, such as when defining the Dedekind real numbers in a topos. Similarly, some axioms cannot be stated consistently in the naturally proof-relevant logic of dependent type theory, such as Brouwer’s continuity principle [Reference Escardó and Xu17], or the combination of excluded middle and univalence [52]. (A referee has pointed out that this doesn’t necessarily mandate full proof-irrelevance either; one might be able to use an intermediate modality instead.) Fortunately, as we will see in this section and the next, the antithesis translation is insensitive to this question: it works just as well for categories as for posets.Footnote ⁶

Returning to the Gödel–Gentzen translation, any bicartesian closed category $\mathbf {H}$ is distributive [Reference Carboni, Lack and Walters12], so its initial object is strict, meaning any morphism with codomain $\mathbf {0}$ is an isomorphism. In particular, $\mathbf {0}$ is subterminal, and thus so is the Heyting negation

of any object. Thus the full subcategory

consists of subterminal objects, and hence is a preorder. It is closed in $\mathbf {H}$ under $\land $ and $\to $ , and contains $\mathbf {1}$ and $\mathbf {0}$ ; it is not closed under $\lor $ but it does have binary joins defined in H by $\neg \neg (P\lor Q)$ . With these operations it (or more precisely its skeleton) is a Boolean algebra. Moreover, this construction defines a left adjoint, in a suitable sense, to the forgetful functor from Boolean algebras to bicartesian closed categories. This is the semantic side of the (propositional) Gödel–Gentzen translation: it makes a model of intuitionistic logic into a model of classical logic.

The syntactic side goes in reverse, translating any formula in classical logic into a formula in intuitionistic logic. It can be obtained automatically from the semantic side, by considering the free bicartesian closed category $\mathbb {H}[\Sigma ]$ generated by some signature $\Sigma $ , whose objects and morphisms are formulas and proofs in intuitionistic logic, and its resulting Boolean algebra $\mathbb {H}[\Sigma ]_{\neg \neg }$ . Then if $\mathbb {B}[\Sigma ]$ is the free Boolean algebra generated by the same signature, whose elements and inequalities are formulas and entailments in classical logic, its universality means there is a unique Boolean algebra homomorphism $(-)^{\mathrm {N}} : \mathbb {B}[\Sigma ] \to \mathbb {H}[\Sigma ]_{\neg \neg }$ .

This is the syntactic side of the Gödel–Gentzen translation, which maps formulas and entailments in classical logic into formulas and proofs in intuitionistic logic. Its usual explicit definition can be read off from the Boolean algebra structure of $\mathbb {H}[\Sigma ]_{\neg \neg }$ above, e.g., $(P\land Q)^{\mathrm {N}} = P^{\mathrm {N}} \land Q^{\mathrm {N}}$ and $(P\lor Q)^{\mathrm {N}} = \neg \neg (P^{\mathrm {N}} \lor Q^{\mathrm {N}})$ . In particular, deriving these formulas semantically in this way means that the translation is automatically sound, i.e., maps proofs to proofs.

The situation with the Girard translation is similar. We recall the relevant category-theoretic models of (propositional) linear logic.

Definition 3.2. A $\ast $ -autonomous category [Reference Barr6] is a closed symmetric monoidal category equipped with an object $\bot $ such that for any P the double-dualization map from P to is an isomorphism. If L has finite products, a Seely comonad [Reference Melliès34, Reference Seely44] on it is a comonad such that the Kleisli category has finite products and the forgetful functor is strong symmetric monoidal (where is regarded as cartesian monoidal).

In a $\ast $ -autonomous category we write and , and if it has a Seely comonad we write . Since $\smash {{(-)}^{\perp }}$ is a self-duality, if a $\ast $ -autonomous category has products then it also has coproducts; as in Section 2 we write its products as $\sqcap $ and its coproducts as $\sqcup $ .

If we identify the objects of with objects of L as usual, with and the forgetful functor acting on objects by , then the cartesian product in must take objects P and Q to $P\sqcap Q$ ; hence we have in L.Footnote ⁷ It follows that is a cartesian closed category, with exponential .

This is the semantic side of the Girard translation. Its syntactic side can be deduced as before, as a map from the free cartesian closed category $\mathbb {H}'[\Sigma ]$ on some signature to the cartesian closed category underlying the free $\ast $ -autonomous category with finite products and Seely comonad on the same signature. This yields a map from formulas and proofs in intuitionistic logicFootnote ⁸ to those in linear logic, whose syntactic rules can be read off of the cartesian closed structure of , e.g., $(P\land Q)^{\mathrm {G}} = P^{\mathrm {G}} \sqcap Q^{\mathrm {G}}$ and .

The semantic side of the antithesis translation, therefore, will be a construction of a model of affine logic from a model of intuitionistic logic. By a model of affine logic we mean a $\ast $ -autonomous category with finite products and Seely comonad that is semicartesian, meaning that its monoidal unit is also its terminal object (and hence the dualizing object $\bot $ is also the initial object).

The semantic antithesis translation is actually an instance of both the Chu construction [Reference Chu13, Reference Chu14] and the Dialectica construction in the form of [Reference de Paiva41]. We will not describe these constructions in general, but only the specific case of interest to us, in which they coincide. On the side of subsets rather than predicates, a similar notion was already introduced by [Reference Bishop and Bridges10, Chapter 3, Section 2] under the name complemented subset; see Theorem 6.11.

Definition 3.3. For a bicartesian closed category $\mathbf {H}$ , let $\mathbf {H}_{\pm }$ be the full subcategory of $\mathbf {H}\times \mathbf {H}^{\mathrm {op}}$ determined by the pairs $P=({P}^+, {P}^-)$ such that ${P}^+ \land {P}^-$ is initial (equivalently, such that there is a morphism ${P}^+ \land {P}^-\to \mathbf {0}$ ; such a morphism is unique when it exists because $\mathbf {0}$ is subterminal).

Thus, a morphism $f : P\to Q$ in $\mathbf {H}_{\pm }$ consists of maps ${f}^+ : {P}^+ \to {Q}^+$ and ${f}^- : {Q}^- \to {P}^-$ in $\mathbf {H}$ . In general, Chu and Dialectica constructions impose additional constraints on such morphisms—the difference between the two being in the constraints—but since $\mathbf {0}$ is subterminal, those constraints are vacuous for us.

Lemma 3.4. $\mathbf {H}_{\pm }$ has finite products and coproducts.

Proof They are inherited from $\mathbf {H}\times \mathbf {H}^{\mathrm {op}}$ , with the following definitions:

$$\begin{alignat*}{2} \top &= (\mathbf{1},\mathbf{0}), & \qquad P \sqcap Q &= ({P}^+ \land {Q}^+, {P}^- \lor {Q}^-), \\ \bot &= (\mathbf{0},\mathbf{1}), & P \sqcup Q &= ({P}^+ \lor {Q}^+, {P}^- \land {Q}^-). \end{alignat*} $$

Note that we use Notation 2.2 for these.

Lemma 3.5. $\mathbf {H}_{\pm }$ is a semicartesian $\ast $ -autonomous category.

Proof The monoidal structure is defined by

A general Chu construction requires a pullback rather than a product in the refutations of $P\boxtimes Q$ , but subterminality of $\mathbf {0}$ makes that unnecessary here. We leave associativity and symmetry to the reader (or standard references on the Chu and Dialectica constructions). For the unit, since $\top = (\mathbf {1},\mathbf {0})$ we have

$$ \begin{align*} P \boxtimes \top &= ({P}^+ \land \mathbf{1}, ({P}^+ \to \mathbf{0}) \land (\mathbf{1} \to {P}^-))\\ &\cong ({P}^+, ({P}^+ \to \mathbf{0}) \land {P}^-)\\ &\cong ({P}^+, {P}^-). \end{align*} $$

Here the final isomorphism is because ${P}^+ \to \mathbf {0}$ is subterminal and there is a morphism ${P}^- \to ({P}^+ \to \mathbf {0})$ . The closed structure is defined by

We leave the verification of adjointness to the reader. Now since $\bot = (\mathbf {0},\mathbf {1})$ , we have

from which it follows immediately that

Lemma 3.6. $\mathbf {H}_{\pm }$ has a Seely comonad. Moreover:

• The Seely comonad is idempotent.
• Its Kleisli category is equivalent to H, and in particular has coproducts.
• The right adjoint is strong monoidal; thus in addition to the Seely conditions we have and .

Proof The forgetful map ${(-)}^+ : \mathbf {H}_{\pm } \to \mathbf {H}$ has a fully faithful left adjoint sending P to $(P,\neg P)$ , where

is the Heyting negation. The induced comonad and monad are

Since the left adjoint is fully faithful, this comonad and monad are idempotent, and their Kleisli (and also Eilenberg–Moore) adjunctions coincide with the adjunction we started with. The definition of $\boxtimes $ makes it clear that the right adjoint ${(-)}^+$ is strong symmetric monoidal, while for the left adjoint F we have

$$ \begin{align*} F P \boxtimes F Q &= ( P \land Q, ( P \to \neg Q) \land ( Q \to \neg P))\\ &\cong ( P \land Q, ( P \to Q \to \mathbf{0}) \land ( Q \to P \to \mathbf{0}))\\ &\cong ( P \land Q, (P \land Q \to \mathbf{0}) \land (P \land Q \to \mathbf{0}))\\ &\cong ( P \land Q, \neg (P \land Q))\\ &= F(P\land Q) \end{align*} $$

using that since $P \land Q \to \mathbf {0}$ is subterminal, it is its own cartesian square.

Lemmas 3.4–3.6 define the semantic antithesis translation, which in fact is a right adjoint to a suitable forgetful functor. As before, we obtain the syntactic antithesis translation as the unique morphism $(-)^{\pm } : \mathbb {A}[\Sigma ] \to \mathbb {H}[\Sigma ]_{\pm }$ , where $\mathbb {A}[\Sigma ]$ is the free semicartesian $\ast $ -autonomous category with products and a Seely comonad on the signature $\Sigma $ . This is a translation that maps formulas and proofs in affine logic to pairs of formulas and proofs in intuitionistic logic. It is given by explicit formulas that can be read off of the structure of $\mathbf {H}_{\pm }$ , and which are shown in Figure 1. As with the Gödel–Gentzen and Girard translations, the semantic derivation of these formulas automatically ensures soundness: any proof in affine logic automatically translates to a pair of proofs in intuitionistic logic.

Figure 1 The syntactic antithesis translation for propositional logic.

Of course, all the definitions in Figure 1 match our informal explanations of the connectives in Section 2. Thus any rigorous version of the BHK interpretation, yielding a bicartesian closed category that models propositional intuitionistic logic, can be enhanced to a model of propositional affine logic matching our meaning explanation.

Moreover, since the Kleisli category of $\mathbf {H}_{\pm }$ recovers $\mathbf {H}$ again, the Girard translation undoes the antithesis translation:

In other words, we can regard affine logic as an extension of intuitionistic logic.

A very important point, however, is that unlike the Gödel–Gentzen and Girard translation, the antithesis translation is not conservative in the logical sense. That is, there are statements in affine logic that always hold under the antithesis translation (i.e., in categories of the form $\mathbf {H}_{\pm }$ ), but are not provable in general affine logic. Some such statements include:

To a linear logician, this makes the logic look quite degenerate: much of the potential richness of the exponentials is invisible to the antithesis translation. Therefore, unlike the Gödel–Gentzen and Girard translations, we should not view the antithesis translation as a way to study affine logic by “embedding” it into intuitionistic logic. Instead, we view it as a way to use affine logic as a tool for stating and proving definitions and theorems in intuitionistic logic. (But we will return to this in Section 11.)

Remark 3.7. As noted above, the antithesis translation is sound for proofs. We will not make very extensive use of proofs in affine logic, but it is worth briefly summarizing the relevant rules (see, e.g., [Reference Girard19] for more detail).

Informally, affine logic looks like classical logic except that each hypothesis may only be used at most once (except for those with a on them). Put differently, the hypotheses of a theorem are implicitly combined with $\boxtimes $ , and since $P {\not\equiv } P\boxtimes P$ they cannot be “duplicated.” If we have $P\boxtimes Q$ we can use both P and Q (at most once each), whereas if we have $P\sqcap Q$ we can choose to use P or to use Q, but not both. Similarly, can only be instantiated at one value of x. And dually, to prove $P\boxtimes Q$ we prove P and Q with each hypothesis used only in one sub-proof, while to prove $P\sqcap Q$ we can use each hypothesis in both sub-proofs (once in each).

A hypothesis of $P\sqcup Q$ can be case-split, while a hypothesis of is used by disjunctive syllogism (e.g., proving $\smash {{P}^{\perp }}$ to conclude Q). To prove $P\sqcup Q$ we prove P or prove Q, while to prove we can assume $\smash {{P}^{\perp }}$ to prove Q or vice versa. Implication behaves as classically, including contraposition; proof by contradiction is universally valid. (Intuitionistically, proof by contradiction implies excluded middle $P\lor \neg P$ since $\neg (P\lor \neg P) \equiv (\neg P\land P)$ is a contradiction; but affinely $\smash {{(P\sqcup \smash {{P}^{\perp }})}^{\perp }} \equiv (\smash {{P}^{\perp }} \sqcap P)$ is no contradiction since we can’t use both $\smash {{P}^{\perp }}$ and P.)

Remark 3.8. Notations such as $\sqcap /\sqcup $ and are fine for writing logical formulas explicitly, but for talking about mathematics it is useful to also represent each connective by an English word. Girard suggested to pronounce $\sqcap $ as “with,” $\sqcup $ as “plus,” $\boxtimes $ as “tensor,” and as “par”; but most of these words have other meanings in mathematics and everyday English, leading to potential confusion.

When it is understood that the ambient logic is linear or affine (so that there is no danger of confusion with $\land $ and $\lor $ ), I prefer to pronounce $\boxtimes $ as simply “and,” since this conjunction implicitly combines multiple hypotheses, is left adjoint to implication, and is very often used where both intuitionistic and classical mathematics use $\land $ . Similarly, I prefer to pronounce $\sqcup $ as simply “or,” since this is the disjunction that supports proof by cases, is almost alwaysFootnote ⁹ what an intuitionistic constructive mathematician means by “or,” and about half the time is what a classical mathematician means by “or” as well. The other half of the time the classical mathematician means , for which Girard’s word “par” is at least unlikely to lead to confusion; but two less awkward-sounding possibilities are “unless” and “or else,” since is equivalent to both and . Pronouncing $\sqcap $ is trickier, but Noah Snyder has suggested “exclusive and” (“xand” for short)—there is no formal relationship to the “exclusive or,” but the word “exclusive” conveys the intuition of “exactly one of the two,” which is how a hypothesis of $P\sqcap Q$ can be used in a linear proof: as P or as Q, but not both.

Remark 3.9. One might argue that $\mathbf {H}_{\pm }$ is too large, as it contains propositions like $(\mathbf {0},\mathbf {0})$ which are very far from being either provable or refutable. We cannot constructively expect every proposition to be either provable or refutable, but we might try some weaker restriction like $\neg (\neg {P}^+ \land \neg {P}^-)$ . However, while propositions satisfying $\neg (\neg {P}^+ \land \neg {P}^-)$ are closed under finitary connectives, their closure under quantifiers is equivalent to the non-constructive law of “double-negation shift” $(\forall x. \neg \neg P(x)) \to (\neg \neg \forall x. P(x))$ . For a dramatic counterexample, let $\mathbf {H} = \mathcal {O}(\mathbb {R})$ be the open-set lattice of the real numbers, with $x:\mathbb {R}$ and and ; then $\neg (\neg {P(x)}^+ \land \neg {P(x)}^-)$ for all x, but .

Remark 3.10. In fact, already Vickers [Reference Vickers53] explicitly suggested considering separately for each proposition its affirmations and refutations:

Given an assertion, we can therefore ask:

• Under what circumstances could it be affirmed?
• Under what circumstances could it be refuted? [Reference Vickers53, p. 6]

It is thus natural to imagine propositions that can never be affirmed (i.e., proven) and also never refuted. The antithesis construction can thus be viewed as an intensional theory of the proofs and refutations of propositions without regard to “truth.” Ignoring truth is constructively sensible since we can never directly observe it (we can only affirm or refute propositions), and intensionality is sensible since two propositions that happen to have the same extension (truth circumstances) might have different affirmations or refutations depending on how they are phrased.

Vickers defines a proposition to be affirmative if it is true exactly when it can be affirmed (i.e., proven), and refutative if it is false exactly when it can be refuted. Our definition is a bit stronger, and more intensional: roughly speaking, we call a proposition affirmative if we know, by virtue of its definition, that whenever it is true it can be affirmed, so that we can refute by showing that it cannot be affirmed. Similarly, we call a proposition refutative if we know that whenever it is false it can be refuted, so that we can affirm it by showing that it cannot be refuted.

If intuitionistic logic is the logic of affirmative propositions, and co-intuitionistic logic [Reference Shramko45, Reference Trafford48] is the logic of refutative propositions, then we can view affine logic as a logic of propositions that are subject to either affirmation or refutation.

Remark 3.11. Even if $\mathbf {H}$ is a Boolean algebra, $\mathbf {H}_{\pm }$ is larger than $\mathbf {H}$ . For instance, $\{\mathbf {0},\mathbf {1}\}_{\pm } = \{ (\mathbf {0},\mathbf {1}) \le (\mathbf {0},\mathbf {0}) \le (\mathbf {1},\mathbf {0}) \}$ coincides with three-valued Łukasiewicz logic, where $(\mathbf {0},\mathbf {0})$ is called “unknown” or “undefined.”

Remark 3.12. Dan Licata has pointed out that the antithesis translation has certain parallels with the natural deduction of [Reference Lovas and Crary30] for classical (linear) logic that uses two judgments $P \;\textit{true}$ and $P\;\textit{false}$ .

Remark 3.13. There are other ways to add “constructive negation” to intuitionistic logic. We have already noted that the antithesis construction is both a Chu construction and a Dialectica construction, and both of these constructions have more general versions that also model linear logic. For instance, it is shown in [Reference Patterson42] that the “constructible falsity” logic of [Reference Nelson36] is modeled by the Chu construction $\mathrm {Chu}(\mathbf {H},\mathbf {1})$ . Compared to $\mathbf {H}_{\pm }$ (which is $\mathrm {Chu}(\mathbf {H},\mathbf {0})$ ), this drops even the requirement $\neg ({P}^+ \land {P}^-)$ , allowing propositions like $(\mathbf {1},\mathbf {1})$ that are both provable and refutable. The lattice $\mathrm {Chu}(\mathbf {H},\mathbf {1})$ is $\ast $ -autonomous but not semicartesian, so the units of $\boxtimes $ and no longer coincide with those of $\sqcap $ and $\sqcup $ . Instead we have the MIX rule [Reference Cockett and Seely15], i.e., the units of $\boxtimes $ and coincide with each other; indeed they are precisely $(\mathbf {1},\mathbf {1})$ .

4 The antithesis translation for predicate logic

To do substantial mathematics we require not just propositional logic, but at least first-order logic, and often higher-order logic or even dependent types. While attempting not to get bogged down by detail, in this section we describe antithesis translations for these richer theories. I encourage a reader who is not already an afficionado of categorical semantics to skim this section on a first reading.

4.1 First-order logic

This corresponds semantically to the following notion, due essentially to Lawvere [Reference Lawvere29].

Definition 4.1.1. Let $\mathcal {K}$ be a 2-category with a forgetful functor $U:\mathcal {K} \to \mathcal {C}\mathit {at}$ . A $\mathcal {K}$ -valued hyperdoctrine consists of:

• A category T with finite products.
• A pseudofunctor $\mathcal {P} : \mathbf {T}^{\mathrm {op}} \to \mathcal {K}$ .
• For any product projection $\pi :A\times B \to A$ in T, the functor
$$\begin{align*}U\pi^* : U\mathcal{P}(B) \to U\mathcal{P}(A\times B) \end{align*}$$
has both a left adjoint $\Sigma _B$ and a right adjoint $\Pi _B$ .
• The Beck–Chevalley condition holds, meaning that for any $f:A'\to A$ in T the induced maps are isomorphisms:

We think of the objects of T as representing types and its morphisms as terms, with the objects of $\mathcal {P}(A)$ being predicates on A. The morphism $f^* : \mathcal {P}(A) \to \mathcal {P}(A')$ of $\mathcal {K}$ induced by $f:A'\to A$ represents substitution into a predicate, while the adjoints $\Sigma _B$ and $\Pi _B$ act like existential and universal quantification. Note that $\Sigma _B$ and $\Pi _B$ are not in general morphisms of $\mathcal {K}$ .

If $\mathcal {K}=\mathcal {I}\mathit {nt}$ is the 2-category of bicartesian closed categories, with functors preserving finite products, coproducts, and exponentials, and natural isomorphisms between them, we speak of an intuitionistic hyperdoctrine, and write $\Sigma _B = \exists _B$ and $\Pi _B = \forall _B$ . Similarly, if $\mathcal {K}=\mathcal {A}\mathit {ff}$ is the 2-category of semicartesian $\ast $ -autonomous categories with finite products and a Seely comonad, with functors that preserve all this structure up to isomorphism, and natural isomorphisms between them, we speak of an affine hyperdoctrine, and write $\Sigma _B = {\textstyle \bigsqcup }_B$ and .

Example 4.1.2. If H is a complete and cocomplete cartesian closed category, then there is an intuitionistic hyperdoctrine with $\mathbf {T} = \mathbf {Set}$ and $\mathcal {P}(A) = \mathbf {H}^A$ . The adjoints $\Pi _B$ and $\Sigma _B$ are given by products and coproducts. Similarly, if L is a complete and cocomplete semicartesian $\ast $ -autonomous category with a Seely comonad, then $\mathbf {T} = \mathbf {Set}$ and $\mathcal {P}(A) = \mathbf {L}^A$ defines an affine hyperdoctrine.

Examples 4.1.3. Suppose T is a category with finite limits. Then there is a pseudofunctor $\mathcal {P} : \mathbf {T}^{\mathrm {op}} \to \mathcal {C}\mathit {at}$ sending A to the poset of subobjects of A, which is an intuitionistic hyperdoctrine if and only if T is a Heyting category. There is also such a pseudofunctor sending A to the slice category $\mathbf {T}/A$ , which is an intuitionistic hyperdoctrine if and only if T is locally cartesian closed with finite coproducts.

More generally, any full comprehension category having $\Sigma $ -types, $\Pi $ -types (with function extensionality), and finite sum types is an intuitionistic hyperdoctrine. If it also has propositional truncations, in the sense of [52], then its “h-propositions” (types with at most one element) also form an intuitionistic hyperdoctrine. If it has universe objects closed under the relevant type formers, the elements of any particular universe also form an intuitionistic hyperdoctrine.

Remark 4.1.4. Even in an affine hyperdoctrine, the base category T is still cartesian monoidal. We could allow T to be semicartesian monoidal, as then it would still have “projections” whose adjoints would supply quantifiers; something similar appears in first-order Bunched Implication [Reference O’Hearn and Pym37]. But since the antithesis translation leaves the base category T unchanged, we have no need for this generality, although we will mention it again in Remark 6.18.

We now extend the antithesis translation to hyperdoctrines.

Lemma 4.1.5. The semantic antithesis translation from Section 3 defines a 2-functor

$$\begin{align*}(-)_{\pm} : \mathcal{I}\mathit{nt} \to \mathcal{A}\mathit{ff}. \end{align*}$$

Proof Immediate. Note that since $\mathbf {H}_{\pm }$ , like its substrate $\mathbf {H}\times \mathbf {H}^{\mathrm {op}}$ , is partly covariant and partly contravariant, it can only be 2-functorial on natural isomorphisms; this is why we defined $\mathcal {I}\mathit {nt}$ and $\mathcal {A}\mathit {ff}$ to contain only these.

Theorem 4.6. If $\mathcal {P} : \mathbf {T}^{\mathrm {op}} \to \mathcal {I}\mathit {nt}$ is an intuitionistic hyperdoctrine, the composite

is an affine hyperdoctrine $\mathcal {P}_{\pm }$ .

Proof It remains to show that if $\pi ^* : \mathcal {P}(A) \to \mathcal {P}(A\times B)$ is a bicartesian closed functor with left and right adjoints $\exists _B$ and $\forall _B$ , then $(\pi ^*)_{\pm } : \mathcal {P}(A)_{\pm } \to \mathcal {P}(A\times B)_{\pm }$ also has left and right adjoints satisfying the Beck–Chevalley condition. We define these by the expected formulas:

We leave it to the reader to verify that this works.

This defines the semantic antithesis interpretation for first-order logic.

Moving now to syntax, we consider a formal system of first-order logic with types, each containing terms or elements $t:A$ that may involve variables belonging to other types, and a class of propositions that may also involve variables belonging to types. We assume finite product types $A\times B$ , whose elements are ordered pairs, and a unit type $1$ that has one element. Propositions are related by entailments

$$\begin{align*}P, Q \vdash_{x:A, y:B} R, \end{align*}$$

where $P,Q,R$ are propositions involving only the variables x (of type A) and y (of type B). In intuitionistic first-order logic, we equip the propositions with the usual intuitionistic logical operations:

$$\begin{align*}\land, \lor, \mathbf{1}, \mathbf{0}, \to, \neg, \forall, \exists \end{align*}$$

and the usual intuitionistic rules of deduction. Similarly, in affine first-order logic, we equip the propositions with the affine logical operations:

together with the usual affine rules of deduction (that is, the rules of [Reference Girard19] for linear logic, plus weakening).

Notation 4.1.7. For clarity, we sometimes annotate a quantified variable by the type to which it belongs, e.g., $\exists x^A. P(x)$ if $x:A$ .

By standard arguments (see, e.g., [Reference Jacobs25]), the syntax of either kind of first-order logic, starting from some signature of base types, terms, and propositions, presents a free hyperdoctrine of the appropriate sort. (We gloss over coherence questions here, which can be resolved as in [Reference Hofmann22, Reference Lumsdaine and Warren31], and are automatic in the proof-irrelevant case when the categories $\mathcal {P}(A)$ are posets.) Thus, as in the propositional case, by applying the semantic antithesis translation to the syntactic intuitionistic hyperdoctrine, we obtain a syntactic antithesis translation of affine first-order logic into intuitionistic first-order logic. This translation leaves the types unchanged, acts on the propositional connectives as in Figure 1, and acts on the quantifiers as shown in Figure 2. As before, the derivation of this translation from the semantic version means that it is automatically sound for proofs (though not complete).

Figure 2 The syntactic antithesis translation for first-order logic.

We now consider various additional structure that can be added to a hyperdoctrine, and their corresponding operations in syntax.

4.2 Comprehension

Let $\mathcal {C}\mathit {at}_t$ be the 2-category of categories with a terminal object.Footnote ¹⁰ We denote such terminal objects generically by $1$ .

Definition 4.2.1 [Reference Lawvere28].

Suppose $U:\mathcal {K} \to \mathcal {C}\mathit {at}$ factors through $\mathcal {C}\mathit {at}_t$ . A $\mathcal {K}$ -valued hyperdoctrine $\mathcal {P} : \mathbf {T}^{\mathrm {op}}\to \mathcal {K}$ has comprehension if for all $A\in \mathbf {T}$ and $P\in \mathcal {P}(A)$ , the following functor is representable:

$$ \begin{align*} (\mathbf{T}/A)^{\mathrm{op}} &\to \mathbf{Set},\\ (f:B\to A) &\mapsto \mathcal{P}(B)(1,f^*(P)). \end{align*} $$

We denote a representing object by $i_P : \{P\} \to P$ : it can be thought of as the subtype of A consisting of those elements that satisfy P.

Example 4.2.2. The intuitionistic hyperdoctrine $\mathcal {P}(A) = \mathbf {H}^A$ has comprehension, with $\{P\}$ the set of pairs $(a,p)$ where $a\in A$ and $p\in \mathbf {H}(\mathbf {1},P_a)$ . The same is true for the affine hyperdoctrine $\mathcal {P}(A) = \mathbf {L}^A$ .

Example 4.2.3. Examples 4.1.3 all have comprehension. The comprehension of a subobject or object of a slice category is itself, while a comprehension category includes as data a comprehension operation.

Proposition 4.2.4. If $\mathcal {P}$ is an intuitionistic hyperdoctrine with comprehension, then $\mathcal {P}_{\pm }$ is an affine hyperdoctrine with comprehension.

Proof Since $\top = (\mathbf {1},\mathbf {0})$ in $\mathcal {P}(A)$ , a morphism $\top \to P$ in $\mathcal {P}(A)$ consists of morphisms $\mathbf {1} \to {P}^+$ and ${P}^-\to \mathbf {0}$ . But the latter is unique if it exists, which it does if there is a morphism $\mathbf {1} \to {P}^+$ since ${P}^+\land {P}^- \to \mathbf {0}$ . Thus, we can define .

Note that comprehension in the antithesis model discards all information about refutations; hence in particular . More generally, we have:

Proposition 4.2.5. For $P\in \mathcal {P}(A)$ in any affine hyperdoctrine with comprehension, there is a morphism from $\top $ to in $\mathcal {P}(\{P\})$ .

Proof By definition, there is a morphism from $\top $ to $i_P^*(P)$ in $\mathcal {P}(\{P\})$ . Now we apply the functor and use the fact that .

Since is not in general idempotent (though it is in the antithesis model), this does not imply . But it does make P arbitrarily duplicable over $\{P\}$ , i.e., we have over $\{P\}$ . Thus, we have to be careful to avoid comprehension whenever we want to retain “refutational” information. This leads in particular to a wider gap between “subsets of A” and “sets that inject into A.” We will return to this point in Remark 4.7.1 and Section 5.

Nevertheless, we cannot really do mathematics without comprehension. Fortunately, it is a fairly harmless assumption: even if we start from a hyperdoctrine without comprehension, we can add comprehensions “freely,” replacing the types by “formal comprehensions” or “pre-sets” (types with an “existence predicate”).

Proposition 4.2.6. For any intuitionistic hyperdoctrine $\mathcal {P} : \mathbf {T}^{\mathrm {op}} \to \mathcal {I}\mathit {nt}$ , there is an intuitionistic hyperdoctrine with comprehension $\mathcal {P}^{\{\}} : (\mathbf {T}^{\{\}})^{\mathrm {op}} \to \mathcal {I}\mathit {nt}$ in which:

• The objects of $\mathbf {T}^{\{\}}$ are pairs $(A,P)$ with $A\in \mathbf {T}$ and $P \in \mathcal {P}(A)$ .
• The morphisms $(A,P) \to (B,Q)$ are pairs of $f:A\to B$ and $g:P \to f^*Q$ .
• The objects of $\mathcal {P}^{\{\}}(A,P)$ are those of $\mathcal {P}(A)$ .
• The morphisms $Q\to R$ in $\mathcal {P}^{\{\}}(A,P)$ are morphisms $P\land Q \to R$ in $\mathcal {P}(A)$ .

Proof The product in $\mathbf {T}^{\{\}}$ is , and the terminal object is $(1,\mathbf {1})$ . It is straightforward to show that $\mathcal {P}^{\{\}}(A,P)$ is bicartesian closed and that $\mathcal {P}^{\{\}}$ is a functor. The quantifiers are and . The comprehension of $Q \in \mathcal {P}^{\{\}}(A,P) = \mathcal {P}(A)$ is $(A, P\land Q)$ .

Remark 4.2.7. The construction of Proposition 4.2.6 appears in many places with many names. Categorically, $\mathbf {T}^{\{\}}$ is the “Grothendieck construction” of the composite ; for the Calculus of Constructions it is the “first-order deliverables” of [Reference McKinna33]. The general construction has a universal property of adding comprehensions “freely”; see, e.g., [Reference Trotta51] for the posetal version.

Proposition 4.2.8. For any affine hyperdoctrine $\mathcal {P} : \mathbf {T}^{\mathrm {op}} \to \mathcal {A}\mathit {ff}$ , there is an affine hyperdoctrine with comprehension $\mathcal {P}^{\{\}} : (\mathbf {T}^{\{\}})^{\mathrm {op}} \to \mathcal {A}\mathit {ff}$ in which:

• The objects of $\mathbf {T}^{\{\}}$ are pairs $(A,P)$ with $A\in \mathbf {T}$ and $P \in \mathcal {P}(A)$ .
• The morphisms $(A,P) \to (B,Q)$ are pairs of $f:A\to B$ and .
• The objects of $\mathcal {P}^{\{\}}(A,P)$ are those of $\mathcal {P}(A)$ .
• The morphisms $Q\to R$ in $\mathcal {P}^{\{\}}(A,P)$ are morphisms in $\mathcal {P}(A)$ .

Proof The product in $\mathbf {T}^{\{\}}$ is , and the terminal object is $(1,\top )$ . Note that . The same operations as in $\mathcal {P}(A)$ lift to make $\mathcal {P}^{\{\}}(A,P)$ semicartesian $\ast $ -autonomous with products and a Seely comonad. The quantifiers are and .

Syntactically, comprehension corresponds to an operation taking a proposition P in the context of a variable $x:A$ to a type $\{ x:A | P(x) \}$ . In the intuitionistic case, rules for this operation can be found in [Reference Jacobs25, Section 4.6]; note that P cannot contain any variables other than x. (It is possible to formulate a more general kind of comprehension without this restriction, at the expense of introducing dependent types.) The affine case is essentially identical, but due to Proposition 4.2.5 we will emphasize the essentially affirmative nature of affine comprehension by writing it as

4.3 Leibniz–Lawvere equality

This operation will not be very useful for us, but we sketch it briefly to explain why.

Definition 4.3.1 [Reference Lawvere28], [Reference Jacobs25, Section 3.4].

A $\mathcal {K}$ -valued hyperdoctrine $\mathcal {P} : \mathbf {T}^{\mathrm {op}}\to \mathcal {K}$ has Leibniz–Lawvere equality if for any diagonal $\triangle _A : A\to A\times A$ and object B, the functor $(1_B\times \triangle _A)^*$ has a partial left adjoint defined at the terminal object and satisfying the Beck–Chevalley condition.

We denote the value of this left adjoint by $\mathsf {eq}_A \in \mathcal {P}(B\times A\times A)$ .

Proposition 4.3.2. If $\mathcal {P}$ is an intuitionistic hyperdoctrine with Leibniz–Lawvere equality, then $\mathcal {P}_{\pm }$ also has Leibniz–Lawvere equality with .

Note that $(\mathsf {eq}_A, \neg \mathsf {eq}_A)$ is always affirmative. In fact, more generally we have:

Proposition 4.3.3. In any affine hyperdoctrine with Leibniz–Lawvere equality, the predicate $\mathsf {eq}_A$ is affirmative, i.e., we have a map in $\mathcal {P}(B\times A\times A)$ .

Proof By the universal property of $\mathsf {eq}_A$ , such a morphism is determined by a map in $\mathcal {P}(B\times A)$ . But , so it suffices to give a morphism $\top \to \triangle ^* \mathsf {eq}_A$ , and this is just the unit of the partial adjunction.

See [Reference Grišin21] for a more syntactic argument. Unlike the analogous Proposition 4.2.5 for comprehension, this result makes Leibniz–Lawvere equality unsuitable for us. Indeed, equality was our primary example in Section 1 of a non-affirmative proposition (with nontrivial refutations). Thus, instead of using Leibniz–Lawvere equality, we will follow [Reference Bishop9, Reference Hyland, Johnstone and Pitts24] in equipping types with equality relations (see Sections 5 and 6).

4.4 Higher-order structures

Many higher-order structures are properties of the base category T that don’t affect the hyperdoctrine over it; these are automatically preserved by the antithesis construction. For instance, we can ask that T be cartesian closed; this corresponds syntactically to enhancing the base type theory of our first-order logic to a simply typed $\lambda $ -calculus, with operation types $B^A$ whose canonical elements are abstractions $\lambda x.t$ , satisfying $\beta $ and $\eta $ conversion rules.Footnote ¹¹ (These are usually called function types, but for us “functions” will be defined to be operations that respect a given equality relation; see Sections 5 and 6.) We observe:

Proposition 4.4.1. If T is cartesian closed, then so is the $\mathbf {T}^{\{\}}$ defined in Propositions 4.2.6 and 4.2.8.

Proof The exponentials in the two cases are

where $\mathsf {ev} : B^A \times A \to B$ is the evaluation in T.

Similarly, we can ask that T be equipped with a comprehension category or category with families, unrelatedly to the hyperdoctrine. This corresponds syntactically (again, modulo coherence issues that can be addressed as in [Reference Hofmann22, Reference Lumsdaine and Warren31]) to enhancing the base type theory with dependent types, possibly with any desired type formers such as $\Sigma $ -types, $\Pi $ -types, identity types, etc. (which are, at least a priori, unrelated to the hyperdoctrine and its quantifiers). Put differently, this results in a logic-enriched dependent type theory in the sense of [Reference Aczel and Gambino3] in which the logic is that of the hyperdoctrine. Since this structure is likewise undisturbed by the antithesis construction on the hyperdoctrine, we have an antithesis translation from affine-logic-enriched type theory into intuitionistic-logic-enriched type theory. We leave it to the reader to extend Proposition 4.4.1 to such cases.

Example 4.4.2. In particular, as in Examples 4.1.3, we can regard a comprehension category with $\Sigma $ - and $\Pi $ -types as itself an intuitionistic hyperdoctrine with comprehension. That is, any type theory admits an intuitionistic-logic enrichment given by propositions-as-types. Applying the antithesis construction, we obtain a translation from affine-logic-enriched dependent type theory into ordinary intuitionistic dependent type theory. Similarly, we can apply the antithesis construction to the hyperdoctrine of h-propositions, or the elements of some fixed universe.

Remark 4.4.3. This way of applying the antithesis construction to dependent type theory acts only on the “top level” of type dependency, and we will not attempt to extend it further in this paper (although see Remark 6.18). In particular, there are by now many different approaches to “linear dependent type theory,” and it is unclear which, if any, of them would be appropriate for such an extended translation. The lack of a definite answer to this question is one obstacle to a native “affine constructive mathematics,” since dependent type theory has definite advantages over higher-order logic as a foundational system for all of mathematics. But we can still use affine logic, by way of the antithesis translation, to say useful things about the top-level logic of intuitionistic dependent type theory.

4.5 Generic predicates

This is the primary higher-order structure that does interact with a hyperdoctrine.

Definition 4.5.1. A generic predicate Footnote ¹² in a hyperdoctrine $\mathcal {P} : \mathbf {T}^{\mathrm {op}}\to \mathcal {K}$ is an object $\Omega \in \mathbf {T}$ with an element such that for any $A\in \mathbf {T}$ and $P\in \mathcal {P}(A)$ , there exists a (not necessarily unique) $f:A\to \Omega $ and isomorphism .

Example 4.5.2. If H is a small complete Heyting algebra, the intuitionistic hyperdoctrine $\mathcal {P}(A) = \mathbf {H}^A$ has a generic predicate with $\Omega $ the underlying set of H and the identity function. A similar argument applies to the affine hyperdoctrine $\mathcal {P}(A) = \mathbf {L}^A$ , if $\mathbb {L}$ is a small $\ast $ -autonomous complete lattice with a Seely comonad.

Example 4.5.3. The subobject classifier of an elementary topos is a generic predicate for the hyperdoctrine of subobjects. More generally, a preorder-valued intuitionistic hyperdoctrine with cartesian closed base and a generic predicate is a tripos [Reference Hyland, Johnstone and Pitts24].

Example 4.5.4. A comprehension category, regarded as an intuitionistic hyperdoctrine, does not generally have a generic predicate. However, its restricted hyperdoctrine of elements of some universe does have one, namely the universe.

Proposition 4.5.5. If $\Omega $ is a generic predicate for $\mathcal {P}$ , then $(\Omega ,1)$ is a generic predicate for the $\mathcal {P}^{\{\}}$ defined in Propositions 4.2.6 and 4.2.8, where $1$ is the terminal object of $\mathcal {P}(\Omega )$ .

Proposition 4.5.6. If $\mathcal {P}$ is an intuitionistic hyperdoctrine with comprehension and a generic predicate, then $\mathcal {P}_{\pm }$ also has a generic predicate.

Proof Over $\Omega \times \Omega $ we have two canonical predicates and . Let $\Omega _{\pm }$ be the comprehension of . Then to give a morphism $f:A \to \Omega _{\pm }$ is the same as to give two morphisms ${f}^+ : A\to \Omega $ and ${f}^- : A \to \Omega $ , corresponding to predicates and over A, such that is initial in $\mathcal {P}(A)$ . Thus, $\Omega _{\pm }$ is a generic predicate for $\mathcal {P}_{\pm }$ .

Example 4.5.7. In the hyperdoctrine of subobjects in a topos, $\Omega _{\pm }$ is the subobject of $\Omega \times \Omega $ consisting internally of pairs of incompatible propositions.

Syntactically, a generic predicate corresponds to having an (impredicative) type $\Omega $ of all propositions. The usual way of presenting a higher-order type theory of this sort is to define the propositions to be the terms of type $\Omega $ , or at least bijective to them. In a hyperdoctrine with a generic predicate, we have only an essentially surjective function $\mathbf {T}(A,\Omega ) \to \mathcal {P}(A)$ , but as in [Reference Hyland, Johnstone and Pitts24] we can use this to replace $\mathcal {P}(A)$ by an equivalent category whose set of objects is precisely $\mathbf {T}(A,\Omega )$ .

Thus, with Proposition 4.5.6 we can translate affine higher-order logic into intuitionistic higher-order logic, where both have comprehensions and a type of propositions. The syntactic expression of the type of propositions derived from Proposition 4.5.6 is

Note that unlike the syntactic antithesis translations for propositional and first-order logic in Figures 1 and 2, this is a definition of a type, not a proposition or predicate. The antithesis translation does not modify the collection of types or most of the operations on them, but it does change the type of propositions.

4.6 Infinity

Finally, as a starting point for concrete mathematics, we require at least a type of natural numbers that permits definitions by recursion and proofs by induction. The intuitionistic version of this is straightforward.

Definition 4.6.1. In an intuitionistic hyperdoctrine, a natural numbers type is an object $N\in \mathbf {T}$ together with morphisms $o:1\to N$ and $s:N\to N$ such that:

(i) For any objects $A,B\in \mathbf {T}$ with $f:A\to B$ and $g:A\times N\times B\to B$ , there exists a morphism $h:A\times N\to B$ making the following diagrams commute:
(ii) For any predicate $P\in \mathcal {P}(A\times N)$ , the following entailment holds:
(4.2) $$ \begin{align} P_a(0)\,\land\, \forall k. (P_a(k) \to P_a(k+1)) \;\vdash_{a:A}\; \forall n. P_a(n). \end{align} $$

Syntactically, the diagrams in (i) say that $h(a,0) = f(a)$ and $h(a,n+1) = g(a,n,h(a,n))$ , as we expect for an operation defined recursively. We have expressed the induction rule (4.2) in syntax already; the reader is free to re-express it in more semantic language. Note that h in (i) is not required to be unique; thus N is only a “weak natural numbers object” in T. Such uniqueness is irrelevant for us, as with a defined equality on the codomain (see Sections 5 and 6) operations defined by recursion will always be unique as functions.

The affine version of induction is somewhat less obvious, but the following will be appropriate for us.

Definition 4.6.3. In an affine hyperdoctrine, a natural numbers type is $(N,o,s)$ satisfying (i) of Definition 4.6.1 and such that for any predicate $P\in \mathcal {P}(A\times N)$ , the following entailment holds:

(4.4)

Note that the induction step is marked with the modality . This is natural if we think of as denoting a hypothesis that can be used more than once, as the induction step must certainly be “used” n times in order to conclude $P(n)$ . But it is also mandated by the antithesis translation.

Lemma 4.6.5. If an intuitionistic hyperdoctrine $\mathcal {P}$ contains a natural numbers type, so does its antithesis translation $\mathcal {P}_{\pm }$ .

Proof The antithesis translation of (4.4) consists of the following two entailments:

$$ \begin{align*} {P}^+(0)\,\land\, \forall k. (({P}^+(k) \to {P}^+(k+1)) \land ({P}^-(k+1) \to {P}^-(k))) \;&\vdash_{P:\Omega^{\mathbb{N}}}\; \forall n. {P}^+(n), \\ \exists n. {P}^-(n)\,\land\, \forall k. (({P}^+(k) \to {P}^+(k+1)) \land ({P}^-(k+1) \to {P}^-(k))) \;&\vdash_{P:\Omega^{\mathbb{N}}}\; {P}^-(0). \end{align*} $$

Both can be proven easily from (4.2).

On the other hand, if we drop the in (4.4), then its antithesis translation would also include a third entailment

(4.6)

$$ \begin{align} {P}^+(0)\,\land\, \exists n. {P}^-(n) \;\vdash_{P:\Omega^{\mathbb{N}}} \exists k. ({P}^+(k) \land {P}^-(k+1)), \end{align} $$

which is equivalent to excluded middle. Specifically, let $P(0) = (\top ,\bot )$ and $P(n) = (\bot ,\top )$ for $n\ge 2$ , while $P(1) = (Q,\neg Q)$ for some arbitrary statement Q; then by (4.6) we have either $\neg Q$ (if $k=0$ ) or Q (if $k=1$ ). Thus, we are forced to formulate affine induction as in (4.4).

Proposition 4.6.7. If $\mathcal {P}$ has a natural numbers type, so does the $\mathcal {P}^{\{\}}$ defined in Propositions 4.2.6 and 4.2.8.

4.7 Conclusions

In the rest of the paper, we will apply the antithesis translation to recover well-known intuitionistic definitions from naturally defined affine ones. On both intuitionistic and affine sides we will use higher-order logic with comprehension, operation types, a generic predicate, and a natural numbers type. We have seen that this combination is preserved by the antithesis translation, and that nearly all naturally occurring models of intuitionistic logic satisfy it. (One exception is that a tripos need not have comprehension, but we can add comprehensions to it as in Proposition 4.2.6.) If necessary to disambiguate between affine and intuitionistic notions, we will use the annotations $\mathfrak {A}$ for affine and $\mathfrak {I}$ for intuitionistic; e.g., “ $\mathfrak {A}$ -predicate” and “ $\mathfrak {I}$ -predicate,” or $\Omega ^{\mathfrak {A}}$ and $\Omega ^{\mathfrak {I}}$ .

Remark 4.7.1. We will frequently be discussing structured types such as groups, rings, posets, topological spaces, and even sets (types with an equality predicate). Since ordinary first-order and higher-order logic do not allow quantification over types (i.e., “for all types A” internally to the logic), theorems relating to structured types are technically metatheorems. In particular, the axioms of such structures are assumed entailments $P\vdash Q$ , or equivalently , and hence imply . In other words, axioms are affirmative.

Another take on this is possible if we use a base theory with dependent types and type universes. In this case, we can quantify over all small types (those belonging to some universe $\mathcal {U}$ ), and so it would be possible to assume non-affirmative axioms about a structured small type. However, if we also have comprehension, we can define types of small structured types (e.g., the type of small groups), and in this case by Propositions 4.2.4 and 4.2.5 the axioms will again be affirmative, or at least arbitrarily duplicable.

5 Intuitionistic sets and functions

In most of the rest of the paper, we will first state definitions in affine logic and then translate them into intuitionistic logic. But for sets and equality, we begin with the intuitionistic context to fix conventions.

As mentioned after Proposition 4.3.3, we follow Bishop’s dictum:

The totality of all mathematical objects constructed in accordance with certain requirements is called a set. The requirements of the construction, which vary with the set under consideration, determine the set.… Each set will be endowed with a binary relation $=$ of equality. This relation is a matter of convention, except that it must be an equivalence relation… [Reference Bishop and Bridges10, Section 2.1]

Thus a “Bishop set” has two ingredients: the “requirements,” which we regard as the specification of a type, and the equality, which is an equivalence relation.

Definition 5.1. A set is a type A with a predicate $=$ on $A\times A$ such that

$$ \begin{align*} \begin{array}{rll} &\vdash_{x: A}& x= x\\ x= y &\vdash_{x,y: A}& y= x\\ (x= y) \land (y= z) &\vdash_{x,y,z: A}& x= z. \end{array} \end{align*} $$

Remark 5.2. Suppose we start with a hyperdoctrine without comprehension, such as a tripos, apply Proposition 4.2.6 to obtain comprehension, and then interpret Definition 5.1. In terms of the original hyperdoctrine, the resulting notion of “set” is essentially a partial equivalence relation. It is common in tripos theory and realizability to work directly with partial equivalence relations. Instead, we divorce existence from equality, incorporating the former into a comprehension operation on types. This matches Bishop’s two-stage conception better, as well as common mathematical practice (the construction of subsets is distinct from quotient sets), and generalizes better to the affine context.

Example 5.3. If our base theory has Leibniz–Lawvere equality types $\mathsf {eq}_A$ , then every type A has a “minimal” structure of a set, with equality $\mathsf {eq}_A$ .

Notation 5.4. If A is a set and P is a predicate on its underlying type, we implicitly give the comprehension $\{ x: A | P(x) \}$ the same equality predicate as A, making it again a set.

Example 5.5. If A and B are sets, their cartesian product set is the product type $A\times B$ with .

Example 5.6. The type of propositions $\Omega $ is a set with

Definition 5.7. A relation on a set A is a predicate P on its underlying type such that

$$\begin{align*}(x= y) \land P(x) \vdash_{x,y: A} P(y). \end{align*}$$

A relation is also called a subset of A, with $x\in P$ meaning $P(x)$ . We overload notation by writing P as $\{ x: A | P(x) \}$ , though a subset is not itself a set.

The relations on a given set are closed under all the logical operations. Put differently, the subsets of a set are a sub-Heyting-algebra of the predicates on its underlying type. We write and so on.

Definition 5.8. A function between two sets is an operation $f:B^A$ such that

$$\begin{align*}\begin{array}{rll} &\vdash_{x: A} & f(x) \in B\\ (x_1= x_2) & \vdash_{x_1,x_2: A}& (f(x_1)= f(x_2)). \end{array} \end{align*}$$

The function set is defined by

Note the notation: the operation type is $B^A$ , and the function set is $A\to B$ .

Example 5.9. We can regard a predicate on A as an operation $P:\Omega ^A$ , and we have $(x= y) \land P(x) \vdash _{x,y: A} P(y)$ if and only if $ (x= y) \vdash _{x,y: A} (P(x) \leftrightarrow P(y))$ . Thus, a relation on A is the same as a function from A to the set $\Omega $ (Example 5.6), so we can define the power set of A as . Its induced equality relation is

One final remark concerns the following alternative definition of “function.”

Definition 5.10. For sets $A,B$ , an anafunction Footnote ¹³ is a relation F on $A\times B$ that is total and functional, i.e., such that

$$\begin{align*}\begin{array}{rll} &\vdash_{x: A}& \exists y^B. F(x,y)\\ F(x,y_1) \land F(x,y_2) &\vdash_{x: A, y_1:B,y_2: B} & (y_1= y_2). \end{array}\end{align*}$$

If $f:A\to B$ is a function, then $(f(x)= y)$ is an anafunction; the principle of function comprehension (a.k.a. unique choice) says that every anafunction is of this form. Function comprehension is not provable in first-order logic, higher-order logic, or logic-enriched type theory, and indeed fails in many triposes. Nevertheless, constructivists of Bishop’s school often assume it implicitly (one can argue for it by positing a closer relationship between “operations” and the existential quantifier than is implied by first-order or higher-order logic).

In the absence of function comprehension, it is often preferable to use anafunctions rather than functions. For instance, this is how one builds the topos represented by a tripos (such as a realizability topos), and in particular how one recovers the correct internal logic of a topos from its tripos of subobjects.

6 Affine sets and functions

We now switch to the affine context, for this section and the rest of the paper, except when discussing the antithesis translation. In the definition of $\mathfrak {A}$ -sets we find our first additive/multiplicative bifurcation.

Definition 6.1. A set is a type with a predicate $\circeq $ on $A\times A$ such that

$$\begin{alignat*}{2} &\;\vdash_{x: A}\;&\;& x\circeq x\\ x\circeq y &\;\vdash_{x,y: A}&\;& y\circeq x\\ (x\circeq y) \boxtimes (y\circeq z) &\;\vdash_{x,y,z: A}&\;& x\circeq z.\\ \end{alignat*} $$

A set is strong if it satisfies the stronger transitivity axiom

$$\begin{alignat*}{2} (x\circeq y) \sqcap (y\circeq z) &\;\vdash_{x,y,z: A}&\;& x\circeq z. \end{alignat*} $$

Example 6.2. As in the intuitionistic case, if our first-order affine logic has Leibniz–Lawvere equality types $\mathsf {eq}_A$ (Definition 4.3.1), then every type A has a “minimal” structure of an $\mathfrak {A}$ -set, with equality $\mathsf {eq}_A$ . This is less useful than in the intuitionistic case (Example 5.3), however, since by Proposition 4.3.3 any such $\mathfrak {A}$ -set has affirmative equality, while we are often interested in $\mathfrak {A}$ -sets with non-affirmative equality.

Notation 6.3. Recall that if P is an $\mathfrak {A}$ -predicate on an $\mathfrak {A}$ -type A, we write for the comprehension type. If A is given as a set, we implicitly give the same equality predicate.

Under the antithesis translation, an $\mathfrak {A}$ -set is an $\mathfrak {I}$ -type with two binary predicates $(=,\neq )$ such that

$$\begin{align*}\begin{array}{rll} &\vdash_{x,y: A}& \neg ((x= y)\land (x\neq y))\\ &\vdash_{x: A}& x= x\\ x= y &\vdash_{x,y: A} & y= x\\ x\neq y &\vdash_{x,y: A} & y\neq x\\ (x= y) \land (y= z) &\vdash_{x,y,z: A}& x= z\\ (x\neq z) \land (y= z) &\vdash_{x,y,z: A}& x\neq y\\ (x\neq z) \land (x= y) &\vdash_{x,y,z: A}& y\neq z. \end{array} \end{align*}$$

The axioms involving only $= $ say that $(A,= )$ is an $\mathfrak {I}$ -set, and the last two axioms say that $\neq $ is an $\mathfrak {I}$ -relation (Definition 5.7) on $A\times A$ . Given this, the first axiom is equivalent to $\vdash _{x: A}\neg (x\neq x)$ . Thus we have:

Theorem 6.4. Under the antithesis translation:

(i) An $\mathfrak {A}$ -set is an $\mathfrak {I}$ -set equipped with an inequality relation: a relation $\neq $ such that $\neg (x\neq x)$ and $(x\neq y) \to (y\neq x)$ (i.e., it is irreflexive and symmetric).
(ii) It is strong if and only if $\neq $ is an apartness, i.e., $(x\neq z) \to (x\neq y) \lor (y\neq z)$ .
(iii) Its equality is affirmative if and only if $\neq $ is denial: $(x\neq y) \equiv \neg (x= y)$ .
(iv) Its equality is refutative if and only if $\neq $ is tight: $\neg (x\neq y) \equiv (x= y)$ .

Example 6.5. If A and B are sets, their cartesian product set is the cartesian product type $A\times B$ with

Under the antithesis translation, this yields the cartesian product of $\mathfrak {I}$ -sets with the disjunctive product inequality (or product apartness):

Example 6.6. The tensor product set $A\boxtimes B$ has the same underlying type, but with equalities combined multiplicatively:

In the antithesis translation, thus yields the weaker inequality

If A and B have affirmative equality, so does $A\boxtimes B$ , but $A\times B$ need not. If A and B have strong or refutative equality, so does $A\times B$ , but $A\boxtimes B$ need not.

Example 6.7. The type $\Omega $ is a set with

(6.8)

In the antithesis translation, this yields

$$ \begin{align*} (P= Q) &\equiv ({P}^+ \leftrightarrow {Q}^+) \land ({P}^- \leftrightarrow {Q}^-),\\ (P\neq Q) &\equiv ({P}^+ \land {Q}^-) \lor ({P}^- \land {Q}^+). \end{align*} $$

We could also use $\boxtimes $ in (6.8), but using $\sqcap $ yields a more useful $\neq $ and has better formal properties (see Example 6.14 and Section 8). In neither case is the equality strong, nor is it affirmative nor refutative even if P and Q are both one or the other.

Remark 6.9. The notion of “strong set” is quite natural under the antithesis translation, since apartness relations are well-studied in intuitionistic constructive mathematics. However, to a reader familiar with linear logic (and particularly with linear proof theory), the $\sqcap $ -transitivity of a strong set may seem unreasonably strong. An assumption of $(x\circeq y)\sqcap (y\circeq z)$ means that we can choose to use either $x\circeq y$ or $y\circeq z$ but not both, so how could we ever hope to prove $x\circeq z$ ?

In fact, however, there are many sets that can be proven to be strong inside affine logic. The key is that we don’t have to start by deciding which of $x\circeq y$ and $y\circeq z$ to use: we can decompose x, y, and z and use the definition of $\circeq $ to make case distinctions, and then make different choices of $x\circeq y$ and $y\circeq z$ in different cases.

A paradigmatic example is the natural numbers $\mathbb {N}$ , for which we define equality recursively in the usual way:

We prove $(x\circeq y)\sqcap (y\circeq z) \vdash _{x,y,z:\mathbb {N}} (x\circeq z)$ by induction on $x,y,z$ . The case when x and z are both $0$ is trivial. If x is $0$ but z is a successor, then either y is $0$ , in which case we can use $y\circeq z$ to get a contradiction, or y is a successor, in which we can use $x\circeq y$ to get a contradiction. The case when x is a successor and z is $0$ is symmetric. Finally, if x is $x'+1$ and z is $z'+1$ , then if y is $0$ we can use either $x\circeq y$ or $y\circeq z$ to get a contradiction, while if y is a successor $y'+1$ then our goal reduces to the inductive hypothesis $(x'\circeq y')\sqcap (y'\circeq z') \vdash _{x',y',z':\mathbb {N}} (x'\circeq z')$ .

We now move on to discuss $\mathfrak {A}$ -relations and subsets.

Definition 6.10. A relation on a set A is a predicate P such that

$$\begin{align*}(x\circeq y) \boxtimes P(x) \vdash_{x,y: A} P(y). \end{align*}$$

A relation is strong if

$$\begin{align*}(x\circeq y) \sqcap P(x) \vdash_{x,y: A} P(y). \end{align*}$$

We also refer to a relation as a subset, writing instead of $P(x)$ , and for P itself. (Unlike in the intuitionistic case, we distinguish this notationally from a comprehension , since the latter discards refutational information.)

Theorem 6.11. Let U be an $\mathfrak {A}$ -subset of an $\mathfrak {A}$ -set A. In the antithesis translation:

(i) U is a complemented subset as in [Reference Bishop and Bridges10, Chapter 3, Definition 2.2]: a pair of $\mathfrak {I}$ -subsets of A such that
(ii) It is strong if and only if is strongly extensional (also called $\neq $ -open):

Proof The subset condition

becomes

The first two say that U and

are $\mathfrak {I}$ -subsets, and the last is the “strong disjointness” condition in (i). The “strong extensionality” condition in (ii) is exactly the contrapositive information arising from the strong subset condition.

Definition 6.12. A function between two sets is an operation $f:B^A$ such that

$$\begin{align*}\begin{array}{rll} (x_1\circeq x_2) & \vdash_{x_1,x_2: A}& (f(x_1)\circeq f(x_2)). \end{array}\end{align*}$$

The function set is defined by

Theorem 6.13. In the antithesis translation, an $\mathfrak {A}$ -function $f:A\to B$ is an $\mathfrak {I}$ -function that is strongly extensional, i.e., $(f(x_1)\neq f(x_2)) \vdash _{x_1,x_2: A} (x_1\neq x_2)$ . The inequality on $A\to B$ is $ (f\neq g) \equiv \exists x^A. (f(x)\neq g(x))$ .

Example 6.14. We have $(x\circeq y)\boxtimes P(x) \vdash P(y)$ iff

, and symmetry of $\circeq $ then implies

, i.e.,

. Therefore, relations on A are the same as functions from A to the set $\Omega $ from Example 6.7. (Note that this requires the $\sqcap $ in (6.8).) Thus we can define the power set of A to be

. Its induced equality is

In the antithesis translation, we have

Example 6.15. In the antithesis translation, an $\mathfrak {A}$ -function $f:A\times B\to C$ must be strongly extensional for the disjunctive product inequality, $(f(x_1,y_1)\neq f(x_2,y_2)) \vdash (x_1\neq x_2) \lor (y_1\neq y_2)$ . By contrast, an $\mathfrak {A}$ -function $f:A\boxtimes B\to C$ need only be strongly extensional in each variable separately: $(f(x,y_1) \neq f(x,y_2)) \vdash (y_1\neq y_2)$ and $(f(x_1,y) \neq f(x_2,y)) \vdash (x_1\neq x_2)$ . Both are useful notions; see Example 7.8.

Example 6.16. In particular, functions from B to $\mathscr {P} A = (A\to \Omega )$ classify subsets not of $A\times B$ , but of $A\boxtimes B$ . In the antithesis translation, an $\mathfrak {A}$ -subset of $A\boxtimes B$ is a pair of $\mathfrak {I}$ -subsets

such that

whereas an $\mathfrak {A}$ -subset of $A\times B$ satisfies the stronger condition

The $\mathfrak {A}$ -relations on an $\mathfrak {A}$ -set are closed under the additive connectives, as well as linear negation. This defines the additive operations of set algebra:

Here the index i in

and ${\textstyle \bigsqcup }_i$ belongs to some type I, while U is a predicate on $I\times A$ that respects the equality of A. In particular, it might be the case that I is itself an $\mathfrak {A}$ -set and U is a predicate on $I\times A$ or $I\boxtimes A$ .

We write to mean ; in the antithesis translation this means that $U\subseteq V$ and . Since commutes with $\sqcap $ , we have . By duality, means .

Like linear negation, the complement of $\mathfrak {A}$ -subsets is involutive ( ${U}^{\perp \perp } = U$ ) but not Boolean: and both assert that U is decidable.

Lemma 6.17. In the antithesis translation, an $\mathfrak {A}$ -subset is nonempty, i.e., , if and only if its affirmative part is $\mathfrak {I}$ -inhabited, i.e., $\exists x^A. (x\in U)$ .

Proof The definition of inequality on $\mathscr {P} A$ gives

Multiplicatives and exponentials do not generally preserve subsets, but they do induce operations on subsets by a reflection or coreflection process:

The poset of subsets of A thereby becomes semicartesian and $\ast $ -autonomous with a Seely comonad. In particular, as a replacement for the false equalities

and

we have the true ones

and

, and the

-coalgebras form a “Heyting algebra of affirmative subsets.” In the antithesis translation,

is the affirmative part of U with its inequality complement:

Thus an “affirmative subset” (i.e.,

) is determined by an ordinary $\mathfrak {I}$ -subset.

Remark 6.18. If $\mathfrak {A}\mathbf {Set}$ denotes the category of $\mathfrak {A}$ -sets and functions,Footnote ¹⁴ we have constructed a pseudofunctor $\mathcal {P} : \mathfrak {A}\mathbf {Set}^{\mathrm {op}}\to \mathcal {A}\mathit {ff}$ , which is in fact an affine hyperdoctrine—although, as suggested in Remark 4.1.4, we are generally more interested in quantifiers for the projections $A \boxtimes B \to A$ than $A\times B\to A$ . This affine hyperdoctrine over $\mathfrak {A}\mathbf {Set}$ seems analogous to the “tripos-to-topos” construction [Reference Hyland, Johnstone and Pitts24] in intuitionistic logic, but it differs in two important ways.

Firstly, it is unclear whether the relations on an $\mathfrak {A}$ -set A can be recovered from the category $\mathfrak {A}\mathbf {Set}$ as any sort of “subobject.” Proposition 4.2.5 is discouraging in this regard. Secondly, it is unclear whether the equality relation on an $\mathfrak {A}$ -set A admits any characterization in terms of this hyperdoctrine over $\mathfrak {A}\mathbf {Set}$ : it cannot be the Leibniz–Lawvere equality, since by Proposition 4.3.3 the latter is affirmative.

For these reasons, we will continue to work Bishop-style, with $\mathfrak {A}$ -sets defined to be types equipped with an equality predicate. However, it seems possible that this affine hyperdoctrine over $\mathfrak {A}\mathbf {Set}$ might shed some semantic light on the question of affine type dependency (see Remark 4.4.3).

Finally, we note that unique existence and “anafunctions” (see Definition 5.10) also behave sensibly. Recall that classically we can express “there is at most one x with $P(x)$ ” either as “for all $x,y$ , if $P(x)$ and $P(y)$ , then $x=y$ ” or “there do not exist $x,y$ with $x\neq y$ such that $P(x)$ and $P(y)$ .” Intuitionistically these are no longer equivalent (unless $\neq $ is tight), and only the former is “correct.” But linearly they are again equivalent:

In the antithesis translation, these statements yield the “correct” intuitionistic version augmented by a strong uniqueness “if $x\neq y$ and $P(x)$ , then

.” An even stronger sort of uniqueness would arise from the strong linear condition

which in the antithesis translation yields “if $x\neq y$ , then either

.”

Definition 6.19. For $\mathfrak {A}$ -sets $A,B$ , an anafunction from A to B is a relation F on $A\boxtimes B$ that is total and functional, i.e., such that

$$\begin{align*}\begin{array}{rll} &\vdash_{x: A}& {\textstyle\bigsqcup} y^B. F(x,y)\\ F(x,y_1) \boxtimes F(x,y_2) &\vdash_{x: A, y_1:B ,y_2: B}& (y_1\circeq y_2). \end{array}\end{align*}$$

Theorem 6.20. In the antithesis translation, an $\mathfrak {A}$ -anafunction from A to B corresponds to an $\mathfrak {I}$ -anafunction that is “strongly extensional” in the sense that

$$\begin{align*}F(x_1,y_1) \land F(x_2,y_2) \land (y_1\neq y_2) \vdash_{x_1,x_2: A, y_1,y_2: B} (x_1\neq x_2). \end{align*}$$

Proof An $\mathfrak {A}$ -anafunction consists of two $\mathfrak {I}$ -relations

on $A\times B$ such that

The fourth and sixth axioms say that F is an $\mathfrak {I}$ -anafunction. Given this, the second and seventh say

, which implies the first and fifth, and unravels the third to the claimed strong extensionality property.

A function is strongly extensional just when its corresponding anafunction is. Thus, a function comprehension principle is equally sensible affinely as intuitionistically, and in its absence we can once again work with anafunctions instead. Moreover, Theorem 6.20 implies that function comprehension is preserved by the antithesis construction: if an intuitionistic hyperdoctrine $\mathcal {P}$ satisfies function comprehension, so does the affine hyperdoctrine $\mathcal {P}_{\pm }$ .

7 Algebra

Roughly speaking, there are two approaches to intuitionistic constructive algebra. The first uses apartness only minimally; inequality usually means denial $\neg (x= y)$ and is avoided as much as possible. For instance, apartness relations are absent from [Reference Johnstone26], and are only rarely used in [Reference Mines, Richman and Ruitenburg35]. The second approach equips all sets with inequalities (often tight apartnesses),Footnote ¹⁵ and all classical definitions are augmented by “strong negative” information such as anti-subgroups and anti-ideals. This is the tradition of Heyting; see [Reference Troelstra and van Dalen50, Chapter 8].

The second approach gives more refined information. For instance, the real numbers are a field in the strong sense that any number apart from $0$ is invertible; but without apartness, all we can say is that they are a local ring in which every noninvertible element is zero. However, carrying apartness relations around is tedious and error-prone, and not every algebraic structure admits a natural apartness:

We could demand that every set come with an inequality, putting inequality on the same footing as equality…With such an approach, whenever we construct a set we must put an inequality on it, and we must check that our functions are strongly extensional. This is cumbersome and easily forgotten, resulting in incomplete constructions and incorrect proofs. [Reference Mines, Richman and Ruitenburg35, p. 31]

Moreover, rewriting all of algebra in “dual” form looks very unfamiliar to the classical mathematician, and even a constructive mathematician may find it unaesthetic.

The antithesis translation resolves this by automatically handling the “bookkeeping” of apartness relations, allowing familiar-looking definitions (written in affine logic) to nevertheless carry the correct constructive meaning (when translated into intuitionistic logic). It also reveals the above two approaches as ends of a continuum: $\mathfrak {I}$ -sets with denial inequality are the $\mathfrak {A}$ -sets with affirmative equality, while $\mathfrak {I}$ -sets with a (tight) apartness are the $\mathfrak {A}$ -sets with a (refutative) strong equality. There are also natural examples in between; see Example 7.8.

Definition 7.1. A group is an ( $\mathfrak {A}$ -)set G together with an element $e: G$ and functions $m:G\boxtimes G\to G$ and $i:G\to G$ such that

$$\begin{alignat*}{4} &\vdash_{x: G}&\;& m(x,e) \circeq x &\qquad &\vdash_{x: G}&\;& m(x,i(x)) \circeq e\\ &\vdash_{x: G}&\;& m(e,x) \circeq x &\qquad &\vdash_{x: G}&\;& m(i(x),x) \circeq e\\ &\vdash_{x,y,z: G}&\;& m(m(x,y),z) \circeq m(x,m(y,z)). \end{alignat*} $$

A group is strong if m is a function on $G\times G$ .

As usual, we write $xy$ and $x^{-1}$ instead of $m(x,y)$ and $i(x)$ .

Theorem 7.2. In the antithesis translation, an $\mathfrak {A}$ -group consists of an $\mathfrak {I}$ -group equipped with an inequality relation such that

$$\begin{align*}\begin{array}{rll} x^{-1}\neq y^{-1} &\vdash_{x,y: G} & x\neq y,\\ x u \neq x v &\vdash_{x,u,v: G} & u\neq v,\\ x u \neq y u &\vdash_{x,y,u: G} & x\neq y. \end{array}\end{align*}$$

The extra condition for G to be strong is

(7.3)

$$ \begin{alignat}{2} (x u \neq y v) &\vdash_{x,y,u,v: G} &\;& (x \neq y) \lor (u\neq v), \end{alignat} $$

which is equivalent to $\neq $ being an apartness. In particular:

• An $\mathfrak {A}$ -group with affirmative equality is precisely an $\mathfrak {I}$ -group.
• A strong $\mathfrak {A}$ -group with refutative equality is precisely a group with apartness relation in the sense of [Reference Troelstra and van Dalen50, Definition 8.2.2], i.e., an $\mathfrak {I}$ -group with a tight apartness for which the group operations are strongly extensional.
• An arbitrary $\mathfrak {A}$ -group is precisely an $\mathfrak {I}$ -group with a (symmetric irreflexive) translation invariant inequality as in [Reference Mines, Richman and Ruitenburg35, Exercise II.2.5].

The fact that (7.3) is equivalent to $\neq $ being an apartness is a standard exercise in constructive algebra. In fact, it can be proven internally in affine logic that an $\mathfrak {A}$ -group is strong if and only if it has strong equality.

Definition 7.4. A subgroup of a group G is a subset

such that

A subgroup is strong if it satisfies the stronger condition

Theorem 7.5. In the antithesis translation, an $\mathfrak {A}$ -subgroup H of G is:

(i) An $\mathfrak {I}$ -subgroup H of the $\mathfrak {I}$ -subgroup G; together with
(ii) An $\mathfrak {I}$ -subset of G satisfying the following axioms:

Moreover:

• H is strong iff the last two axioms are replaced by the following stronger one:
• An affirmative $\mathfrak {A}$ -subgroup of an affirmative $\mathfrak {A}$ -group is precisely an $\mathfrak {I}$ -subgroup of an $\mathfrak {I}$ -group, together with its logical complement .
• If G is refutative and strong, then H is refutative and strong if and only if is an antisubgroup compatible with the apartness in the sense of [Reference Troelstra and van Dalen50, Definition 8.2.4] together with its logical complement.

Definition 7.6. An $\mathfrak {A}$ -subgroup H is normal if .

In the antithesis translation, if H and G are affirmative then normality reduces to ordinary normality, whereas if they are strong and refutative it reduces to normality for an antisubgroup [Reference Troelstra and van Dalen50, Definition 8.2.7].

Theorem 7.7. Let H be a normal subgroup of G. Then defines a new equality predicate on the underlying type of G, and the resulting set is again a group, denoted $G/H$ .

Proof The closure axioms of a subgroup directly imply the axioms of an equality predicate. It remains to show that m and i are functions $G/H \boxtimes G/H \to G/H$ and $G/H \to G/H$ . For the first, we have

Similarly, for the second we have

Example 7.8. Let $G = \mathbf {2}^{\mathbb {N}}$ be the set of infinite binary sequences, with pointwise addition mod 2, and

. Then G is a strong group with refutative equality, while H is a normal subgroup that is neither strong, affirmative, nor refutative. In the quotient $G/H$ we have

That is, $x\circeq 0$ if x is eventually $0$ , and $x{{\not\circeq }} 0$ if x is $1$ infinitely often. Neither of these is the Heyting negation of the other, so $G/H$ is neither affirmative nor refutative. Similarly, $G/H$ is not strong, so in the antithesis translation its inequality is not an apartness and its multiplication is not strongly extensional for the disjunctive product inequality, though it is for the weaker equality on $G/H\boxtimes G/H$ .

In [Reference Mines, Richman and Ruitenburg35, p. 31] this example is used to argue that not all sets should have inequalities. From our perspective, it shows instead that not all groups should be required to be strong.

A (commutative) ring is an abelian group $(R,+,0)$ with a multiplication function $\cdot :R\boxtimes R\to R$ and unit $1:R$ satisfying the usual axioms; it is strong if both $+$ and $\cdot $ are defined on $R\times R$ . In the antithesis translation:

• An affirmative $\mathfrak {A}$ -ring is an ordinary $\mathfrak {I}$ -ring.
• A strong refutative $\mathfrak {A}$ -ring is a ring with apartness as in [Reference Troelstra and van Dalen50, Definition 8.3.1] (except that they also assume $0\neq 1$ ).
• A general $\mathfrak {A}$ -ring is an $\mathfrak {I}$ -ring with an inequality such that $(x\neq y) \leftrightarrow (x-y \neq 0)$ and $(xy \neq 0) \to (y\neq 0)$ .

An ideal is an additive subgroup J with . In the antithesis translation, in the affirmative case this is an ordinary $\mathfrak {I}$ -ideal, while in the strong refutative case it is an anti-ideal [Reference Troelstra and van Dalen50, Definition 8.3.6]: an additive antisubgroup with . The quotient $R/J$ of an $\mathfrak {A}$ -ring by an ideal is straightforward, and its antithesis translation yields the apartness on the quotient of an apartness ring by the complement of an anti-ideal [Reference Troelstra and van Dalen50, Proposition 8.3.8].

Definition 7.9. Let J be an ideal of the $\mathfrak {A}$ -ring R that is proper, i.e., .

• J is -prime if .
• J is -prime if .
• R is -integral if $(0)$ is proper and -prime.
• R is -integral if $(0)$ is proper and -prime.

If J is proper, then $R/J$ is -integral or -integral exactly when J is -prime or -prime, respectively. In the antithesis translation:

• A -prime affirmative $\mathfrak {A}$ -ideal in an affirmative $\mathfrak {A}$ -ring is a proper $\mathfrak {I}$ -ideal such that $(xy\in J) \vdash (x\in J) \lor (y\in J)$ . An affirmative $\mathfrak {A}$ -ring is -integral if $\neg (0= 1)$ and $(xy= 0) \vdash (x= 0)\lor (y= 0)$ ; this is [Reference Johnstone26, axiom I1].
• Similarly, an affirmative $\mathfrak {A}$ -ring is -integral if it satisfies $\neg (0= 1)$ and [Reference Johnstone26, axiom I2]: $(xy= 0) \land \neg (x= 0) \to (y= 0)$ .
• A -prime strong refutative $\mathfrak {A}$ -ideal in a strong refutative $\mathfrak {A}$ -ring is an anti-ideal in an $\mathfrak {I}$ -ring with apartness that is proper ( ) and such that , i.e., a prime anti-ideal as in [Reference Troelstra and van Dalen50, Proposition 8.3.10].
• Finally, an arbitrary $\mathfrak {A}$ -ring is -integral if and only if $1\neq 0$ and we have $(x\neq 0) \land (xy = 0) \to (y= 0)$ and also $(x\neq 0) \land (y\neq 0) \to (x y \neq 0)$ . Combined with the above characterization of $\mathfrak {A}$ -rings, this is precisely an integral domain in the sense of [Reference Mines, Richman and Ruitenburg35, Exercise II.2.7].

Definition 7.10. Let J be a proper ideal of the $\mathfrak {A}$ -ring R.

• J is -maximal if .
• J is -maximal if .
• R is a -field if $(0)$ is proper and -maximal.
• R is a -field if $(0)$ is proper and -maximal.

We write for “x is invertible”; this is the second disjunct in (either kind of) maximality for $(0)$ . The quotient $R/J$ is a -field or -field if and only if J is -maximal or -maximal, respectively. In the antithesis translation:

• An affirmative $\mathfrak {A}$ -ring is a -field just when its corresponding $\mathfrak {I}$ -ring satisfies $\neg (0= 1)$ and $(x= 0) \lor \mathsf {inv}(x)$ . These are called discrete fields (since they necessarily have decidable equality) or geometric fields [Reference Johnstone26, axiom F1].
• A general $\mathfrak {A}$ -ring is a -field just when its corresponding $\mathfrak {I}$ -ring with inequality satisfies $0\neq 1$ and $(x\neq 0) \to \mathsf {inv}(x)$ . This is precisely a field as in [Reference Mines, Richman and Ruitenburg35] with $\neq $ irreflexive (in [Reference Mines, Richman and Ruitenburg35] the zero ring is a “field” with $0\neq 0$ ).
• A -field has strong refutative equality just when its $\mathfrak {I}$ -ring has a tight apartness; these are the Heyting fields of [Reference Mines, Richman and Ruitenburg35] and the fields of [Reference Troelstra and van Dalen50, Definition 8.3.1].
• Strong refutative -maximal $\mathfrak {A}$ -ideals are the minimal anti-ideals of [Reference Troelstra and van Dalen50, Definition 8.3.10].
• Finally, the affirmative $\mathfrak {A}$ -rings that are -fields are the $\mathfrak {I}$ -rings satisfying $\neg (1= 0)$ and $\neg (x= 0) \to \mathsf {inv}(x)$ , which is [Reference Johnstone26, axiom F2].

Remark 7.11. The name “geometric field” arises because such fields are the models of a geometric theory. However, antithesis translations of -fields are also a geometric theory if we include the inequality $\neq $ as part of the theory. The apartness axiom for $\neq $ is also geometric; only the tightness axiom $\neg (x\neq y) \vdash (x= y)$ fails to be so.

In fact, writing a classical definition in affine logic and passing across the antithesis translation often (though not always) produces a geometric theory. It is a sort of refinement of the “Morleyization” (see, e.g., [Reference Johnstone27, D1.5.13]).

8 Order

When equality is a defined relation, we can either introduce order and topology as structures on a type which induce an equality, or as structures on a set that might determine the equality by a “separation” axiom. We prefer the former.

Definition 8.1. A preorder on an $\mathfrak {A}$ -type A is a predicate

on $A\times A$ with

• A preorder is strong if .
• A linear order is a preorder such that .
• A total order is a preorder such that .

If A has a preorder, then makes A into a set, and is then a relation defined on $A\boxtimes A$ . The sets-with-preorder we obtain in this way are exactly the partial orders: sets with a preorder such that is a relation on $A\boxtimes A$ and is $\sqcap $ -antisymmetric, i.e., .

Example 8.2. The equality on $\Omega $ from Example 6.7 is induced in this way from the natural preorder .

In the antithesis translation, an $\mathfrak {A}$ -partial-order contains two relations $\le $ and ${\not\le }$ , but it is often more suggestive to write $x<y$ instead of $y{\not\le } x$ .

Theorem 8.3. In the antithesis translation, a partial order on an $\mathfrak {A}$ -set A consists of two $\mathfrak {I}$ -relations $\le $ and $<$ such that

$$\begin{alignat*}{2} &\vdash_{x: A} &\;&(x\le x)\\[-1pt] (x\le y)\land (y\le z) &\vdash_{x,y,z: A} &\;&(x\le z)\\[-1pt] (x\le y) \land (y\le x) &\vdash_{x,y: A}&\;& (x= y)\\[-1pt] (x<y)\land (y\le z) &\vdash_{x,y,z: A}&\;& (x<z)\\ (x\le y) \land (y<z) &\vdash_{x,y,z: A}&\;& (x<z)\\[-1pt] (x<y) &\vdash_{x,y: A}&\;& (x\neq y)\\[-1pt] (x\neq y) &\vdash_{x,y: A}&\;& ((x<y) \lor (y<x)). \end{alignat*} $$

That is, $\le $ is an $\mathfrak {I}$ -partial-order, $<$ is a “bimodule” over it, and for $x,y: A$ we have $(x\neq y) \equiv ((x<y) \lor (y<x))$ . Moreover, the $\mathfrak {A}$ -partial-order is…

• …strong if and only if $<$ is cotransitive: $(x<z) \vdash (x<y) \lor (y<z)$ .
• …linear if and only if $(x<y)\to (x\le y)$ (hence $<$ is transitive).
• …total if and only if $\le $ is total, $(x\le y)\lor (y\le x)$ .

Such “order pairs” appear often in constructive mathematics, but the only abstract such definition I know of was in the Lean 2 proof assistant.Footnote ¹⁶ Often either $\le $ or $<$ is the other’s negation (i.e., is affirmative or refutative), but not always:

Example 8.4. Conway’s surreal numbers [Reference Conway16] are defined in classical logic by:

– If $L,R$ are any two sets of numbers, and no member of L is $\ge $ any member of R, then there is a number $\{ L | R \}$ . All numbers are constructed in this way.
– $x\ge y$ iff (no $x^R\le y$ and $x\le $ no $y^L$ ).

(For $x = \{L|R\}$ , $x^L$ and $x^R$ denote typical members of L or R respectively.) Leaving aside the problematic inductive nature of this definition, we can write it affinely as

where

is its negation

In the antithesis translation, this yields a simultaneous inductive definition of $\le $ and $<$ , neither of which is the Heyting negation of the other; see [Reference Forsberg and Setzer18] and [52, Section 11.6]. Omitting R yields the plump ordinals (see [Reference Taylor47] and [52, Example 11.17]).

Recall that classically, if $\le $ is a total order we can define $x<y$ by $\neg (y\le x)$ or by $(x\le y)\land (x\neq y)$ , and recover $x\le y$ as $\neg (y< x)$ or as $(x=y)\lor (x<y)$ . For an “order pair” as in Theorem 8.3 that is linear, the former holds, but the latter generally fails. However, in affine logic we can say:

Theorem 8.5. Let be a refutative linear order, and write . Then we have

(8.6)

(8.7)

Proof For any partial order we have

This certainly implies

. Conversely, linearity of

means

, while refutativity implies

, so that

implies

. This gives (8.6), while (8.7) is simply its De Morgan dual.

Thus, while the constructive $\le $ does not mean “less than or equal to,” at least in some cases (such as $\mathbb {R}$ ; see Section 9) it does mean “less than par equal to” (or perhaps, as suggested in Remark 3.8, “less than unless equal to” or “less than or else equal to”).

Remark 8.8. In classical and intuitionistic mathematics, preorders can be identified with thin categories: categories in which there is at most one arrow with any given domain and codomain. The situation is a bit more subtle in affine logic, and depends on choosing a correct definition of “category.” Since in general we do not compare objects of a category for “equality,” the objects of a category should form only a type rather than a set. Similarly, since we only compare arrows for equality if they are known to have the same domain and codomain, rather than a single $\mathfrak {A}$ -set of arrows we should have a collection of such sets $\hom _{\mathscr {C}}(x,y)$ indexed by pairs of objects $x,y$ . (This requires our theory to have dependent types.)

Now an $\mathfrak {A}$ -set A in which “all elements are equal” carries no more information than the proposition ${\textstyle \bigsqcup } x^A.\top $ , which is affirmative. Thus, a “thin category” consists of a type together with an affirmative binary relation ${\textstyle \bigsqcup } x^{\hom _{\mathscr {C}}(x,y)}.\top $ that is transitive and reflexive, and hence coincides with an affirmative preorder. In general, therefore, whenever preorders are being treated like categories (for instance, when they are equipped with Grothendieck topologies to define sheaf toposes), they should be assumed affirmative.

9 Real analysis

Recall from Remark 6.9 that the natural numbers type $\mathbb {N}$ is a strong set. In fact it is a strong total order, with order relation defined recursively:

The integers $\mathbb {Z}$ are the type $\mathbb {N}\times \mathbb {N}$ with

and the rational numbers $\mathbb {Q}$ are $\mathbb {Z}\times \mathbb {N}$ with

These total orders are affirmative, refutative, strong, and decidable, as are the induced equalities

. In the antithesis translation, they yield the usual posets of numbers.

We define addition and multiplication by recursion on $\mathbb {N}$ , and then by the usual formulas on $\mathbb {Z}$ and $\mathbb {Q}$ , making $\mathbb {Z}$ a -integral strong ring and $\mathbb {Q}$ a strong -field.

Definition 9.1. The Cauchy real numbers are the partially ordered $\mathfrak {A}$ -set

The set $\mathbb {R}_c$ is a strong linear order and a strong ring that is a -field.

Theorem 9.2. In the antithesis translation, the $\mathfrak {A}$ -set $\mathbb {R}_c$ is the usual such $\mathfrak {I}$ -set with its usual linear order and induced equality and apartness.

The Dedekind real numbers are a little more surprising. We first note that, just as in classical logic (but not intuitionistic logic), the notion of “one-sided cut” in affine logic doesn’t depend on the side, or whether the cuts are open or closed.

Definition 9.3. Let .

• L is a lower set if .
• L is upwards-open if .
• L is upwards-closed if .

Dually, we have upper sets, downwards-open, and downwards-closed.

Theorem 9.4. The following $\mathfrak {A}$ -sets are isomorphic:

Proof The isomorphisms are:

Definition 9.5. We write $\mathcal {C}$ for any of the sets in Theorem 9.4, and we call its elements cuts.

We give $\mathcal {C}$ the partial order induced from containment of lower sets. Thus, if we write $x_{\smash {\mathring {L}}},x_{\smash {\overline {L}}},x_{\smash {\mathring {U}}},x_{\smash {\overline {U}}}$ for the four representations of , we have

Using totality of the order on $\mathbb {Q}$ , we can show that this order on $\mathcal {C}$ is linear. If we identify

with the cut

, then $\mathbb {Q}$ is fully order-embedded in $\mathcal {C}$ , and moreover for any

and

we have

Thus, we can define a cut x by specifying any one of the relations

, and

on $\mathbb {Q}$ which has the appropriate property. This is usually more congenial than working explicitly with upper or lower subsets of $\mathbb {Q}$ .

In intuitionistic logic, it is common to work with two-sided cuts instead. But because an $\mathfrak {A}$ -subset is a complemented $\mathfrak {I}$ -subset, in the antithesis translation our one-sided $\mathfrak {A}$ -cuts become two-sided $\mathfrak {I}$ -cuts.

Theorem 9.6. In the antithesis translation, $\mathcal {C}$ corresponds to the set of pairs $(L,U)$ of $\mathfrak {I}$ -subsets of $\mathbb {Q}$ such that L is an upwards-open lower set, U is a downwards-open upper set, and $L<U$ . Its induced order is

$$ \begin{align*} ((L_1,U_1) \le (L_2,U_2)) &\equiv ((L_1\subseteq L_2) \land (U_2 \subseteq U_1)),\\ ((L_1,U_1) < (L_2,U_2)) &\equiv \exists r. ((r\in L_2) \land (r\in U_1)). \end{align*} $$

Proof By Theorem 6.11, an element of $\mathscr {P}\mathbb {Q}$ is a disjoint pair

of subsets of $\mathbb {Q}$ . To say that it is an $\mathfrak {A}$ -lower-set means that L is an $\mathfrak {I}$ -lower-set and

is an $\mathfrak {I}$ -upper-set. Given this, disjointness is equivalent to

. And to say that it is $\mathfrak {A}$ -upwards-open means that L is $\mathfrak {I}$ -upwards-open and

is $\mathfrak {I}$ -downwards-closed. Finally, the bijection between open and closed upper cuts (or, dually, lower ones) is also true intuitionistically:

The $\mathfrak {I}$ -set of pairs $(L,U)$ in Theorem 9.6 is also called the set of (rational) cuts [Reference Richman43], or sometimes the interval domain. It is distinct from $\mathbb {R}$ even classically, containing additionally all closed intervals $[a,b]$ for $-\infty \le a\le b\le \infty $ .

Definition 9.7. The Dedekind real numbers $\mathbb {R}_d$ are the $\mathfrak {A}$ -set of $x:\mathcal {C}$ with

• boundedness: ,
• -locatedness: .

All cuts are “ -located,” , by -excluded-middle. In the antithesis translation, $\mathbb {R}_d$ is the usual set of Dedekind reals.

Remark 9.8. As we did for $\mathbb {N}$ in Remark 6.9, we can prove entirely in affine logic that the real numbers form a strong set. Suppose ; to show we assume $r:\mathbb {Q}$ with and must prove . Now there is an $s:\mathbb {Q}$ with , and since y is -located we have . Doing a case split on this, if we use to conclude , while if we use to conclude , a contradiction.

Now, there are at least two natural ways to define addition on $\mathcal {C}$ :

If $x,y:\mathbb {R}_d$ , one can prove that , and in the antithesis translation they give the usual addition on Dedekind reals:

(9.9)

(9.10)

However, for cuts, and are distinct. In the antithesis translation, with $+$ for $\mathfrak {I}$ -cuts defined using (9.9) and (9.10), we have , but is weaker than $q < x+y$ . One place where this matters is in defining metric spaces.

Definition 9.11. A cut-metric on an $\mathfrak {A}$ -type X is an operation $d:\mathcal {C}^{X\times X}$ with

For any cut-metric, defines a preorder. If d is symmetric, this is already an equality making X a set; otherwise we can symmetrize it as in Section 8. (We can also symmetrize d directly with .) If X is already a set and d a function, the usual metric separation condition $(d(x,y)\circeq 0) \vdash (x\circeq y)$ makes its equality coincide with that obtained in this way.

In particular, if $d(x,y): \mathbb {R}_d$ for all $x,y$ , then the antithesis translation of X is an $\mathfrak {I}$ -quasi-metric space, and an $\mathfrak {I}$ -metric space if we impose symmetry.

Now suppose X is a cut-metric space and we have

and

. As observed intuitionistically in [Reference Richman43], $\mathcal {C}$ is a complete lattice (which $\mathbb {R}_d$ is not, constructivelyFootnote ¹⁷ ); thus we can define the distance from a to B as an infimum:

Rather than defining infima in $\mathfrak {A}$ -posets in general, we simply make this explicit:

Even if each $d(a,b)$ is a Dedekind real, $d(a,B)$ may not be. But the observation of Richman [Reference Richman43] is that if we treat $d(a,B)$ as a cut, then its inequality relations to rational (hence also real) numbers are exactly what we would expect of such a “distance.” In the antithesis translation, these become:

If B is affirmative, $q \le d(a,B)$ becomes Richman’s $\forall b^B. (q \le d(a,b))$ . We also have

At least in the Dedekind-real case, we can then write

to get

whose antithesis translation, when $B,B'$ are affirmative, reduces to Richman’s:

$$ \begin{align*} (d(a,B) \le d(a,B')) \equiv \forall \varepsilon. \forall \smash{b'}^X. ((b'\in B') \to \exists b^X. ((b\in B) \land (d(a,b) < d(a,b') + \varepsilon))). \end{align*} $$

Still following [Reference Richman43], we can define the (directed) Hausdorff distance between two subsets as

However, unlike Richman, we can show:

Theorem 9.12. The Hausdorff distance is a cut-metric on $\mathscr {P} X$ .

Proof The proof of the triangle inequality is essentially the same as that of its “upper portion” in [Reference Richman43, Section 6]. We must show if

then

. By definition of

, we have

. Now

and

yield

and

such that

Thus, for any

we get

with

, then from b we get

with

. Hence

, so that

In [Reference Richman43, Section 6] Richman notes that the cut-valued Hausdorff distance fails the triangle inequality if addition of cuts is defined by (9.9) and (9.10). He concludes that one should forget the “lower cut” part of the Hausdorff distance. We conclude instead that the relevant addition of cuts is , not (9.9) and (9.10). Indeed, (9.9) and (9.10) are suspect right away, as it is not even clear whether they can be obtained simultaneously as the antithesis translation of any single definition of addition for $\mathfrak {A}$ -cuts.

The other example in [Reference Richman43, Section 6] where cuts seem to have problems can also be resolved with affine logic. Suppose (intuitionistically) A is an abelian group and p a prime with $\bigcap _n p^n A = 0$ , i.e., $(\forall n. \exists c. (a = p^n c)) \vdash _{a\in A} (a= 0)$ . Define

In classical mathematics, this defines an “ultranorm,” i.e., $ |a+b| \le \sup (|a|,|b|).$ Intuitionistically, if we interpret $|a|$ as a cut and $\sup $ as the binary supremum (the union of lower parts and intersection of upper parts), then the upper part of $|a+b| \le \sup (|a|,|b|)$ holds but the lower part can fail.

Our solution is to replace this “additive” binary supremum with a multiplicative one. Returning to affine logic, for cuts $x,y$ we define

Now let A be an abelian $\mathfrak {A}$ -group and

, and define

I claim that then we have

This is again just like the “upper part” proof from [Reference Richman43]: if

and

, then $a \circeq p^n c$ and $b \circeq p^n d$ for some $c,d$ , so that $a+b \circeq p^n(c+d)$ and hence

. So in both cases, it is not that cuts are inadequate, but that the operations on cuts sometimes need to use multiplicative connectives rather than additive ones.

Remark 9.13. Another problematic area of analysis for constructive mathematics is the theory of measure spaces. Already in [Reference Bishop and Bridges10] complemented subsets were used as the domain for a constructive measure, and [Reference Bartels and Trimble8] formulates an abstract notion of measurable space based on a Chu construction like ours. This suggests that affine logic would also be a natural context for constructive measure theory.

Remark 9.14. We end this section with an example where proof-relevance matters. An $\mathfrak {I}$ -sequence of real (or rational) numbers is Cauchy if

(9.15)

$$ \begin{align} \forall \varepsilon. \exists k. \forall nm. (n>k\land m>k \to |x_n-x_m|\le \varepsilon), \end{align} $$

and diverges [Reference Bishop and Bridges10, Section 2.3] if

(9.16)

$$ \begin{align} \exists \varepsilon. \forall k. \exists nm. (n>k \land m>k \land |x_n-x_m|>\varepsilon). \end{align} $$

These are formal De Morgan duals, so if we define Cauchy-ness of an $\mathfrak {A}$ -sequence by

(9.17)

then its linear negation is divergence.

However, in the absence of countable choice, it is often better to consider a Cauchy sequence as coming with a function $K_{\varepsilon }$ , and Bishop presumably understands a divergent sequence to come with functions $N_k,M_k$ . But if we write out the assertions of such functions by hand, the corresponding formulas

(9.18)

$$ \begin{align}\exists K. \forall \varepsilon. \forall nm. (n>K_{\varepsilon} \land m>K_{\varepsilon} \to |x_n-x_m|\le\varepsilon), \end{align} $$

(9.19)

$$ \begin{align}\exists \varepsilon. \exists NM. \forall k. (N_k>k \land M_k > k \land |x_{N_k}-x_{M_k}|>\varepsilon) \end{align}$$

are no longer De Morgan duals. Gödel’s “Dialectica” interpretation [Reference Gödel20, Reference Hofstra23, Reference de Paiva40, Reference de Paiva41] automatically does this sort of “Skolemization,” so that (9.15) and (9.16) would be interpreted as (9.18) and (9.19) respectively; but this doesn’t solve the problem that the two pairs are not each other’s negations.

Instead, we can write (9.15) and (9.16) using the propositions-as-types interpretation into dependent type theory. This gives

(9.20)

$$ \begin{align}\textstyle \prod_{\varepsilon} \sum_k \prod_{n,m} (n>k\land m>k \to |x_n-x_m|\le \varepsilon), \end{align}$$

(9.21)

$$ \begin{align}\textstyle\sum_{\varepsilon} \prod_k \sum_{n,m} (n>k \land m>k \land |x_n-x_m|>\varepsilon), \end{align}$$

which include the Skolem functions automatically, due to the “type-theoretic axiom of choice” $\prod _{x:A} \sum _{y:B} C(x,y) \cong \sum _{f:A\to B} \prod _{x:A} C(x,f(x))$ . Moreover, (9.20) and (9.21) are still De Morgan duals with respect to $\Sigma $ and $\Pi $ . Therefore, we can obtain them from the antithesis translation of (9.17) applied to the propositions-as-types hyperdoctrine mentioned in Examples 4.1.3.

10 Topology

Finally, we consider point-set topologies. There are many classically equivalent ways to define a topology; first we consider neighborhood relations.

Note that the preorder on $\Omega ^A$ makes sense even if A is only a type, making $\Omega ^A$ into a set.

Definition 10.1. A topology on a type A is a predicate

on $A\times \Omega ^A$ with

Isotony implies each

is a relation on the set $\Omega ^A$ . We define a preorder on A by

, making A into a set as well, such that

is a relation on $A \boxtimes \Omega ^A$ . Moreover, an arbitrary predicate $U:\Omega ^A$ is contained in a smallest relation

, and by definition of equality on A we have

. Thus,

is determined by its behavior on subsets, so we may consider it to be a relation on $A\boxtimes \mathscr {P} A$ . If we instead assume A is given as a set and

as a relation on $A\boxtimes \mathscr {P} A$ , then to ensure that the equality coincides with the one constructed above we must impose the $T_0$ axiom

Now a relation on $A\boxtimes \mathscr {P} A$ is equivalently a function $\mathsf {int} : \mathscr {P} A \to \mathscr {P} A$ , and Definition 10.1 translates into an affine version of an “interior operator”:

(i) ,
(ii) ,
(iii) $\mathsf {int}(A) \circeq A$ ,
(iv) ,
(v) $\mathsf {int}(\mathsf {int}(U)) \circeq \mathsf {int}(U)$

plus the following form of the $T_0$ axiom:

(vi) .

The most interesting of these axioms is (iv), which is a version of the classical

(10.2)

$$ \begin{align} \mathsf{int}(U)\cap \mathsf{int}(V) \subseteq \mathsf{int}(U\cap V) \end{align} $$

that uses an additive intersection on the right but a multiplicative one on the left. This may look more surprising than the (equivalent) binary additivity axiom for in Definition 10.1, since in the latter $\boxtimes $ is a logical connective while $\sqcap $ is a set operation. However, the following example suggests that this odd-looking mixture of additive and multiplicative intersections is exactly right:

Example 10.3. Any cut-metric space (Definition 9.11) has a topology defined by

To prove binary additivity, from

we can get $\varepsilon _U$ and $\varepsilon _V$ , and then choose

to prove

. Note that here we need to use both hypotheses at once, so they must be combined with $\boxtimes $ rather than $\sqcap $ . We then have to show that given y with

we have

, i.e., that

. For

we use

and the hypothesis from

, and dually. Note that here we need to use the same hypothesis

in proving both subgoals

and

, so they must be combined with $\sqcap $ rather than $\boxtimes $ .

Axiom (iv) is further clarified by writing it in terms of :

(10.4)

Since ${(-)}^{\perp }$ is involutive, in linear logic $\mathsf {cl}$ and $\mathsf {int}$ contain the same data. But intuitionistically, “closure operators” do not respect unions: a point may lie in the closure of $U\cup V$ without our being able to decide which of U or V it lies in the closure of. Our (10.4) remedies this by taking one of the unions to be multiplicative.

On the other hand, classically and intuitionistically the converse of (10.2) always holds, so (10.2) is equivalent to closure of the fixed points of $\mathsf {int}$ (the open sets) under binary intersections. It is harder to express (iv) using “open $\mathfrak {A}$ -subsets.”

We now move on to the antithesis translation of an $\mathfrak {A}$ -topology. This is rather complicated, since not only does give rise to two relations, each $\mathfrak {A}$ -subset U is actually two (disjoint) $\mathfrak {I}$ -subsets. We start with some familiar special cases.

Theorem 10.5. Under the antithesis translation, an $\mathfrak {A}$ -topology such that

corresponds exactly to a $T_0 \mathfrak {I}$ -topology on a type A. If we write $x \ll U$ for the $\mathfrak {I}$ -relation “x is in the interior of U,” then the induced inequality on A is

Proof The assumption implies that is determined by a single $\mathfrak {I}$ -relation $\ll $ between points of A and $\mathfrak {I}$ -subsets of A. The axioms on then translate to the usual definition of an $\mathfrak {I}$ -topology in terms of a neighborhood relation. Finally, our definition of equality in a topological space corresponds to the $T_0$ axiom.

If U is an $\mathfrak {I}$ -subset of an $\mathfrak {I}$ -set A with inequality $\neq $ , we write

This is the same as saying that x belongs to the inequality complement of U, i.e.,

in the antithesis translation.

Theorem 10.6. Under the antithesis translation, an $\mathfrak {A}$ -topology such thatFootnote ¹⁸

(10.7)

(10.8)

corresponds to a point-set pre-apartness space satisfying the reverse Kolmogorov property in the sense of [Reference Bridges and Vîţă11, p. 20],Footnote ¹⁹ i.e., an $\mathfrak {I}$ -set with an inequality $\neq $ and a relation $\bowtie $ between points and $\mathfrak {I}$ -subsets such that

(10.9)

$$ \begin{alignat}{2} \hspace{-47pt}(x \bowtie K)& \vdash &\;& (x\notin K) \end{alignat} $$

(10.10)

$$ \begin{alignat}{2} \hspace{-95pt}(x\bowtie K) \land (L\subseteq K) &\vdash &\;& (x\bowtie L) \end{alignat} $$

(10.11)

$$ \begin{alignat}{2} &\kern1pt\vdash&\;& (x\bowtie \emptyset) \end{alignat} $$

(10.12)

$$ \begin{alignat}{2} \hspace{-77pt}(x\bowtie K) \land (x\bowtie L) &\vdash&\;& (x\bowtie K\cup L) \end{alignat} $$

(10.13)

$$ \begin{alignat}{2} \forall x.((x\bowtie K) \to (x\notin L)) &\vdash &\;& \forall x.((x\bowtie K) \to (x\bowtie L)) \end{alignat} $$

(10.14)

$$ \begin{alignat}{2} \hspace{-111.5pt}(x\bowtie K) \land \neg(y\bowtie K) &\vdash &\;& (x\neq y), \end{alignat} $$

and which also satisfies the additional “forwards Kolmogorov property” that

(10.15)

$$ \begin{alignat}{2} (x\neq y) &\vdash &\;& (x\bowtie\{y\}) \lor (y\bowtie \{x\}). \end{alignat} $$

Proof Since , and is affirmative by (10.7), the converse of (10.8) also holds. Thus is determined by its behavior on refutative $\mathfrak {A}$ -subsets, and hence by one $\mathfrak {I}$ -relation between points and $\mathfrak {I}$ -subsets. We define . But note that $(\neg K,K)$ is not an $\mathfrak {A}$ -subset, and as noted above is determined by its behavior on $\mathfrak {A}$ -subsets; thus $x\bowtie K$ is also equivalent to . In particular, reflexivity of implies (10.9).

Statements (10.10)–(10.12) are straightforward translations of $\mathfrak {A}$ -isotony and additivity. The direct translation of $\mathfrak {A}$ -transitivity is $ (x\bowtie K) \vdash (x \bowtie \{ y | \neg (y\bowtie K) \}),$ which is equivalent to (10.13) and (10.14) together. Our definition of equality yields

$$\begin{align*}(x\neq y) \equiv \exists K.((x\bowtie K) \land \neg (y\bowtie K)) \lor \exists K.(\neg(x\bowtie K) \land (y\bowtie K)). \end{align*}$$

However, if $x\bowtie K$ and $\neg (y\bowtie K)$ , then $y\in \{ z|\neg (z\bowtie K) \}$ , whence $x\bowtie \{y\}$ , whereas conversely if $x\bowtie \{y\}$ then we can take . Thus, we have

$$\begin{align*}(x\neq y) \equiv (x\bowtie\{y\}) \lor (y\bowtie \{x\}).\end{align*}$$

The right-to-left implication follows from (10.9), while the left-to-right is (10.15).

Thus, $\mathfrak {I}$ -topologies and apartnesses are special cases of $\mathfrak {A}$ -topologies. But in some sense these restrictions on $\mathfrak {A}$ -topologies miss the point, because virtually no naturally defined $\mathfrak {A}$ -topologies satisfy them! In the antithesis translation, a general $\mathfrak {A}$ -topology consists of two relations $\ll $ and ${\not\ll }$ between points and complemented subsets (Theorem 6.11); and even for Dedekind-metric spaces neither is the Heyting negation of the other, and both parts of a complemented subset are used.

Example 10.16. Recall that in Example 10.3 we showed that any $\mathfrak {A}$ -cut-metric space has an underlying $\mathfrak {A}$ -topology. In the antithesis translation, this topology becomes

This is degenerate only in that the relation ${\not\ll }$ only depends on

, not on U. But since both conjuncts in

remain true under shrinking $\varepsilon $ , we can distribute the quantifiers and take a minimum of the two $\varepsilon $ ’s to write

where the first conjunct depends only on U and the second only on

. Thus, we may think of

as “x is in the interior of U and is apart from

.”

In the general case, we can write the axioms of an $\mathfrak {A}$ -topology in terms of $\ll $ and ${\not\ll }$ , as in Figure 3. But they are not very familiar, because we are used to spaces that are degenerate in the manner of Example 10.16: with ${\not\ll }$ depending only on , and $\ll $ the conjunction of two properties depending on U and respectively. This suggests the following definition.

Figure 3 The antithesis translation of an $\mathfrak {A}$ -topology.

Definition 10.17. A unified topology on an $\mathfrak {I}$ -type A consists of three predicates $\ll ,\bowtie ,\approx $ on $A\times \Omega ^A$ such that:

• $\ll $ is a topology in the usual sense:
$$\begin{alignat*}{2} (x\ll U) &\vdash&\;& (x\in U) \notag\\ (x\ll U) \land (U\subseteq V) &\vdash&\;& (x\ll V)\notag\\ &\vdash&\;& (x\ll A) \notag\\ (x\ll U) \land (x\ll V) &\vdash&\;& (x\ll U\cap V)\notag\\ (x \ll U) &\vdash&\;& (x\ll \{ y | y \ll U \}). \notag \end{alignat*} $$
• $\bowtie $ satisfies the following apartness axioms:
(*) $$ \begin{align} (x\bowtie K) &\vdash \neg (x\in K)\qquad \end{align} $$

(10.18) $$ \begin{alignat}{2} (x\bowtie K) \land (L\subseteq K) &\vdash&\;& (x\bowtie L)\notag\\ &\vdash&\;& (x\bowtie \emptyset) \notag\\ (x\bowtie K) \land (x\bowtie L) &\vdash&\;& (x\bowtie K\cup L)\notag\\ (x \bowtie K) &\vdash&\;& (x \bowtie \{ y | y \approx K \}). \end{alignat} $$
• $\approx $ satisfies the following “closure space” axioms:
(*) $$ \begin{alignat}{2} (x\in K) &\vdash&\;& (x\approx K)\notag\\ (x\approx K) \land (K\subseteq L) &\vdash&\;& (x\approx L) \notag\\ &\vdash&\;& \neg (x\approx \emptyset) \end{alignat} $$

(10.19) $$ \begin{align} \hspace{-33pt}(x\approx (K \cup L)) \land (x \bowtie K) &\vdash (x\approx L) \\ \hspace{-33pt}(x \approx \{ y | y \approx K \}) &\vdash (x \approx K).\notag \end{align} $$
• The following compatibility condition holds:
(10.20) $$ \begin{alignat}{2} (x\ll U) \land (x\approx K) &\vdash&\;& \exists y. (y\in U\cap K). \end{alignat} $$

In the presence of the other axioms, either of the axioms ( $*$ ) implies the other. Note that transitivity for $\bowtie $ (10.18) involves $\approx $ , while binary additivity for $\approx $ (10.19) (in constructively sensible form derived from ) involves $\bowtie $ .

Theorem 10.21. Given a unified topology, if we define

then we obtain an $\mathfrak {A}$ -topology (in the antithesis translation) as in Figure 3.

Not every $\mathfrak {A}$ -topology has this form, but those coming from cut-metrics do, with

Example 10.22. Recall the Hausdorff cut-metric on $\mathscr {P} X$ from Theorem 9.12:

In the antithesis translation, if $A,B$ are affirmative, then:

• $d(A,B) < q$ means that there is a $q'<q$ such that for any point $a\in A$ , there exists a point $b\in B$ with $d(a,b)<q'$ .
• $q\le d(A,B)$ means for any $q'<q$ , there is a point $a\in A$ such that every point $b\in B$ has $q' \le d(a,b)$ .

Thus, in this case:

• $A\ll \mathcal {U}$ means there is an $\varepsilon>0$ such that $\mathcal {U}$ contains all subsets B for which there is an $\varepsilon '<\varepsilon $ such that every point of A is $\varepsilon $ -close to some point of B.
• $A \bowtie \mathcal {K}$ means there is an $\varepsilon>0$ such that for every $B\in \mathcal {K}$ and $\varepsilon '<\varepsilon $ there is a point of A that is at least $\varepsilon '$ -far from every point of B.
• $A \approx \mathcal {K}$ means for any $\varepsilon>0$ there is a $B\in \mathcal {K}$ and an $\varepsilon '<\varepsilon $ such that every point of A is $\varepsilon '$ -close to a point of B.

Thus, the antithesis translation suggests that rather than taking one of neighborhoods, apartness, or nearness as primary, it is more natural to have all structures in parallel. Of course, Definition 10.17 is rather unwieldy; but Definition 10.1 is quite simple, suggesting it may be easier to just stay in affine logic. In the next section we consider this possibility more seriously.

11 Towards affine constructive mathematics

So far, we have viewed affine logic as a tool for producing definitions and theorems in intuitionistic logic, through the antithesis translation. However, there are other reasons one might care about the “affine constructive mathematics” we have started developing in this paper. One is that it admits other interesting models.

Example 11.1. Linear logicians are familiar with many $\ast $ -autonomous categories, such as coherence spaces and phase spaces. As in Example 4.1.2, any complete semicartesian $\ast $ -autonomous category with Seely comonad yields an affine hyperdoctrine over Set. I expect there are “realizability linear triposes” coming from linear combinatory algebras [Reference Abramsky and Lenisa1, Reference Abramsky, Haghverdi and Scott2]. In addition, Dialectica constructions [Reference de Paiva40] also act on fibrations [Reference Hofstra23]. However, many of these models are not semicartesian, and hence move beyond affine logic to general linear logic.

Example 11.2. Any Boolean algebra is semicartesian and $\ast $ -autonomous, with $\boxtimes \equiv \sqcap $ , , and . Thus, linear logic also specializes directly to classical logic.

More generally, on a Boolean algebra we can take any meet-preserving comonad to be , such as the interior operator of a topology acting on a powerset. Thus, any classical topological space X gives rise to an affine tripos whose propositions are subsets of X, with the affirmative and refutative ones being open and closed respectively. This relates to the “modal” view of sheaves from [Reference Awodey and Kishida4, Reference Awodey, Kishida and Kotzsch5].

Example 11.3. Łukasiewicz logic is a semicartesian $\ast $ -autonomous structure on the unit interval $[0,1]$ , with $\top =1$ , $\bot =0$ , and

It also admits a Seely comonad defined by

and

for $P<1$ . An $\mathfrak {A}$ -set in this model is precisely a metric space with all distances $\le 1$ . (The distance $d(x,y)$ is actually the inequality $x{{\not\circeq }} y$ .) It is strong iff it is an ultrametric space, and affirmative iff it is discrete. Functions are nonexpansive maps, and anafunctions (Definition 6.19) are nonexpansive maps between metric completions. The $\mathfrak {A}$ -set $\Omega =[0,1]$ has its usual metric $|x-y|$ , and the function set $A\to B$ has the supremum metric. For a fixed affirmative $\mathfrak {A}$ -set A, the $\mathfrak {A}$ -subsets of A are fuzzy sets with universe A, with their usual induced metric. Finally, (closed upper) $\mathfrak {A}$ -cuts $x:\mathcal {C}$ are non-decreasing right-continuous functions $\mathbb {R}\to \mathbb {R}$ ; hence bounded $\mathfrak {A}$ -cuts are cumulative distribution functions of random variables, with Dedekind $\mathfrak {A}$ -reals corresponding to constant random variables.

However, what about the philosophical constructivist, in the tradition of Bishop, say? I believe that one can also motivate affine constructive mathematics on purely philosophical grounds; what follows is one attempt.

We begin by agreeing with Brouwer’s critique of excluded middle, “P or not P,” as a source of non-constructivity. However, the classical mathematican’s belief in this law is not contentless; one may say that the constructivist and the classicist are using the word “or” to mean different things. The constructivist using intuitionistic logic expresses the classical mathematician’s “or” as $\neg \neg (P\lor Q)$ , but the classical mathematician may rebel against the implication that she is unconsciously inserting double negations everywhere. A more even-handed approach is to stipulate both kinds of “or” on an equal footing: the constructivist’s $P\sqcup Q$ says that we know which of P or Q holds; while the classicist’s says…something else.

Before addressing exactly what it says, we consider negation. Intuitionistically, $\neg P$ means that any proof of P would lead to an absurdity. But after this definition, one immediately observes that it is not very useful and should be avoided. So why did we bother defining negation in that way? A more useful notion of “negation” is the polar opposite of a statement, i.e., the most natural and emphatic way to disprove it. The opposite of “every x satisfies $P(x)$ ,” in this sense, is “there is an x that fails $P(x)$ ”: a respectable constructive disproof of a universal claim should provide a counterexample. Similarly, the opposite of “P and Q” is “either P fails or Q fails,” and so on. This negation is involutive, with strict De Morgan duality for quantifiers, conjunctions, and disjunctions.

The most natural way that “if P then Q” can fail is if P is true and Q is false. But the opposite of “P and not Q” is “Q or not P,” so the involutivity of negation means that the latter should be equivalent to “if P then Q.” In particular, the tautology “if P then P” is equivalent to “P or not P,” i.e., excluded middle. Thus, the “or” appearing here must be the classical one . That is, “if P then Q” (which we may as well start writing as ) is equivalent to .

This tells us what means: it means , i.e., if P fails then Q must be true. But any sort of disjunction is symmetric, so should also be equivalent to . Thus, contraposition must hold: is equivalent to . This, in turn, implies that we can do proofs by contradiction.

Proof by contradiction is generally considered non-constructive. For instance, a constructive proof of “there exists an x such that $P(x)$ ” ought to specify x, whereas proof by contradiction seems to subvert this. But does it really? If we try to prove “there exists an x such that $P(x)$ ” by contradiction, we would begin by assuming “for all x, not $P(x)$ ”…and we can only use that assumption by specifying an x!

Non-constructivity only enters if we use that assumption more than once, giving different values of x, and derive a contradiction without determining which value of x satisfies P. Thus, we can remain constructive in the presence of proof by contradiction by imposing an “affinity” restriction that each hypothesis can be used at most once.Footnote ²⁰ This is essentially the content of Girard’s comment:

…take a proof of the existence or the disjunction property; we use the fact that the last rule used is an introduction, which we cannot do classically because of a possible contraction. Therefore, in the…intuitionistic case, $\vdash $ serves to mark a place where contraction…is forbidden…. Once we have recognized that the constructive features of intuitionistic logic come from the dumping of structural rules on a specific place in the sequents, we are ready to face the consequences of this remark: the limitation should be generalized to other rooms, i.e., weakening and contraction disappear. [Reference Girard19, p. 4]

We now let $\sqcap $ and $\boxtimes $ be the De Morgan duals of $\sqcup $ and

, and calculate

Thus, to maintain the “deduction theorem” that we prove

by proving R with Q as an extra hypothesis, we must implicitly combine hypotheses with $\boxtimes $ .

The behavior of $\sqcap $ can be deduced by duality: a hypothesis $P\sqcap Q$ may as well be used by contradiction, requiring us to show $\smash {{(P\sqcap Q)}^{\perp }} \equiv \smash {{P}^{\perp }} \sqcup \smash {{Q}^{\perp }}$ ; and since this is the constructive “or” it requires us to either show $\smash {{P}^{\perp }}$ or $\smash {{Q}^{\perp }}$ . Thus, to use a hypothesis $P\sqcap Q$ we must either use P or Q, but not both. Note the utter reversal of the historical origin of the linear connectives:

The most hidden of all linear connectives is par [ ], which came to light purely formally as the De Morgan dual of [ $\boxtimes $ ] and which can be seen as the effective part of a classical disjunction. [Reference Girard19, p. 5].

Finally, the linearity/affinity restriction is sometimes too onerous. For instance, the axioms of a group must be used many times in the proof of any theorem in group theory. Since we are here regarding the affinity restriction as simply a syntactic discipline to which we subject ourselves in order to maintain constructivity, we may allow ourselves to ignore it in certain cases as long as we keep track of where this happens and prevent ourselves in some other way from introducing nonconstructivity in those cases. This is the purpose of the modality : it marks hypotheses, like the axioms of a group, that we allow ourselves to use more than once. The price we pay is that when checking an axiom of the form , we cannot use proof by contradiction (or more precisely, if we try to do so, the hypothesis we get to contradict is not $\smash {{P}^{\perp }}$ but the weaker ). But this is rarely bothersome: when was the last time you saw someone prove that something is a group by assuming that it isn’t and deriving a contradiction? (See also Remark 4.7.1.)

Whether or not the reader finds the foregoing discussion convincing, I believe it proves that it is possible to argue for affine logic, rather than intuitionistic logic, on philosophical constructivist grounds. Ultimately, of course, the proof of the pudding is in the eating: whether affine constructive mathematics can stand on its own depends on how much useful mathematics can be developed purely in affine logic. In this paper we have only scratched the surface by exploring a few basic definitions, with the antithesis translation as a guide for their correctness.

Acknowledgments

My understanding of constructive mathematics and linear logic has been greatly influenced by Toby Bartels, Martín Escardó, Andrej Bauer, Dan Licata, Valeria de Paiva, Todd Trimble, and Peter LeFanu Lumsdaine. I am also grateful to the referees for stimulating comments and suggestions, and for suggesting the word “antithesis.” This material is based on research sponsored by the United States Air Force Research Laboratory under agreement numbers FA9550-15-1-0053, FA9550-16-1-0292, and FA9550-21-1-0009. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright notation thereon. The views and conclusions contained herein are those of the author and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of the United States Air Force Research Laboratory, the U.S. Government, or Carnegie Mellon University.

Footnotes

1 I will use “intuitionistic” to refer to the formal logic codified by Heyting, and “constructive” for the programme of doing mathematics with “constructive content.” This is unfaithful to the original philosophical meaning of “intuitionistic,” but for better or for worse the phrase “intuitionistic logic” has come to refer to Heyting’s logic, and I have been unable to think of a satisfactory alternative. I will use “classical” for classical mathematics and classical nonlinear logic, and “linear” (resp. “affine”) for the “classical” form of linear (resp. affine) logic that has an involutive negation.

2 Here “nonzero” means “apart from zero.”

3 For instance, can one really formulate mathematics sufficiently carefully that it can all be done in affine logic, without contraction? Is there a form of affine dependent type theory that would be appropriate for such a mathematics?

4 The origin of the terminology is apparently the fact that the distributive law in linear logic is $P \boxtimes (Q\sqcup R) \equiv (P\boxtimes Q) \sqcup (P\boxtimes R)$ , i.e., “multiplication distributes over addition.”

5 There is no uniformity in notation for linear logic. The most common notation for $\boxtimes $ is $\otimes $ , but are also used, whereas has been denoted by . Notations for $\sqcap /\sqcup $ include $\&/\oplus $ , $\land /\lor $ , and $\times /+$ . Our and $\sqcap /\sqcup $ visually represent De Morgan duality, do not clash with other standard notations that I know of, and are easily distinguishable.

6 Univalent type theory [52] combines proof-relevance and proof-irrelevance in one framework, relating them by propositional truncation; thus one can choose between the two case-by-case rather than globally. But it is not clear to me how to incorporate this flexibility into the antithesis model.

7 The definition of Seely comonad is usually expanded out more explicitly in terms of coherent isomorphisms such as these, but we will not need that. This is also apparently the origin of the sobriquet “exponential” for the modalities such as , since “exponentials turn additives into multiplicatives” is akin to $\exp (a+b) = \exp (a)\cdot \exp (b)$ .

8 We do not actually get a translation of all of intuitionistic logic, because may not have coproducts. Thus, there is no obvious way to interpret $(P\lor Q)$ ; as noted already by Seely [Reference Seely44] it is hard to semantically justify Girard’s formula . However, as we will see, in our case of interest these coproducts exist automatically.

9 We will see in the rest of the paper that $\sqcap $ and do often appear in affine representations of concepts from intuitionistic constructive mathematics. But the mathematician using intuitionistic logic has to write out the corresponding more complicated statement using $\land $ , $\lor $ , and $\to $ , and hence is not used to using the words “and” and “or” for $\sqcap $ and respectively.

10 If we wanted to define comprehension for linear hyperdoctrines in addition to affine ones, we would need to replace the terminal object $1$ by the monoidal unit.

11 These rules must hold up to a judgmental equality of terms, the syntactic counterpart of equality of morphisms in T. This is distinct from the equality propositions of Section 4.3.

12 Called a “weak generic object” in [Reference Jacobs25, Section 5.2].

13 This term is inspired by the “anafunctors” of [Reference Makkai32].

14 Strictly speaking we should either quotient these functions by pointwise equality or consider $\mathfrak {A}\mathbf {Set}$ to be some sort of “e-category,” but we will not delve into these waters.

15 Recall that for us, an inequality relation is irreflexive and symmetric, an apartness relation additionally satisfies $(x\neq z) \vdash (x\neq y) \lor (y\neq z)$ , and is tight if $\neg (x\neq y) \vdash (x= y)$ . In [Reference Troelstra and van Dalen50] an “apartness” is necessarily tight (otherwise they speak of a “pre-apartness”), and in [Reference Mines, Richman and Ruitenburg35] it seems that no axioms are demanded in general of a relation called “inequality.”

16 https://github.com/leanprover/lean2/blob/master/library/algebra/order.lean#L102; it was removed in Lean 3. I am indebted to Floris van Doorn for pointing this out.

17 The “strongly monotonic cuts” or “MacNeille reals” are also a complete lattice, but their meets and joins involve double-negation, making them less useful than those of $\mathcal {C}$ ; see [Reference Richman43, Section 3].

18 A simpler attempt at (10.8) would be , but that is inconsistent with reflexivity at least in the antithesis translation, since .

19 [Reference Bridges and Vîţă11] writes $x\notin K$ to mean $\neg (x\in K)$ ; our $x\notin K$ is written there as $x\in \mathord {\sim }K$ .

20 Or a “linearity” restriction that it must be used exactly once, but this is harder to justify philosophically, since affinity is sufficient to ensure constructivity.

References

Abramsky, S. and Lenisa, M., Linear realizability and full completeness for typed lambda-calculi . Annals of Pure and Applied Logic , vol. 134 (2005), no. 2, pp. 122–168.CrossRef Google Scholar

Abramsky, S., Haghverdi, E., and Scott, P., Geometry of interaction and linear combinatory algebras . Mathematical Structures in Computer Science , vol. 12 (2002), no. 5, pp. 625–665.CrossRef Google Scholar

Aczel, P. and Gambino, N., Collection principles in dependent type theory , Types for Proofs and Programs (P. Callaghan, Z. Luo, J. McKinna, and R. Pollack, editors), Springer, Berlin, 2002, pp. 1–23.Google Scholar

Awodey, S. and Kishida, K., Topology and modality: The topological interpretation of first-order modal logic . The Review of Symbolic Logic , vol. 1 (2008), no. 2, pp. 146–166.CrossRef Google Scholar

Awodey, S., Kishida, K., and Kotzsch, H.-C., Topos semantics for higher-order modal logic . Logique et Analyse , vol. 57 (2014), no. 228, pp. 591–636.Google Scholar

Barr, M., *-Autonomous Categories, Lecture Notes in Mathematics, vol. 752, Springer, Berlin, 1979.CrossRef Google Scholar

Barr, M.,

$\ast$ -autonomous categories and linear logic . Mathematical Structures in Computer Science , vol. 1 (1991), no. 2, pp. 159–178.CrossRef Google Scholar

Bartels, T. and Trimble, T., Cheng space, 2012. Available at https://ncatlab.org/nlab/show/Cheng+space.Google Scholar

Bishop, E., Foundations of Constructive Analysis, McGraw-Hill Series in Higher Mathematics, McGraw-Hill, New York, 1967.Google Scholar

Bishop, E. and Bridges, D., Constructive Analysis , Springer, Heidelberg, 1985.CrossRef Google Scholar

Bridges, D. S. and Vîţă, L. S., Apartness and Uniformity: A Constructive Development, Springer, Berlin–Heidelberg, 2011.CrossRef Google Scholar

Carboni, A., Lack, S., and Walters, R. F. C., Introduction to extensive and distributive categories . Journal of Pure and Applied Algebra, vol. 84 (1993), no. 2, pp. 145–158.CrossRef Google Scholar

Chu, P.-H., Constructing *- autonomous categories, M.Sc. thesis, McGill University, 1978.Google Scholar

Chu, P.-H., Constructing *-autonomous categories , *-Autonomous Categories, Lecture Notes in Mathematics, vol. 752, Springer, Berlin, 1979, Chapter Appendix.Google Scholar

Cockett, J. R. B. and Seely, R. A. G., Proof theory for full intuitionistic linear logic, bilinear logic, and MIX categories. Theory and Applications of Categories, vol. 3 (1997), no. 5, pp. 85–131.Google Scholar

Conway, J. H., On Numbers and Games, second ed., A K Peters, Natick, 2001.Google Scholar

Escardó, M. and Xu, C., The inconsistency of a Brouwerian continuity principle with the Curry–Howard interpretation, 13th International Conference on Typed Lambda Calculi and Applications (T. Altenkirch, editor), Schloss Dagstuhl-Leibniz-Zentrum für Informatik, Dagstuhl, 2015, pp. 153–164.Google Scholar

Forsberg, F. N. and Setzer, A., A finite axiomatisation of inductive-inductive definitions, Logic, Construction, Computation (U. Berger, H. Diener, P. Schuster, and M. Seisenberger, editors), De Gruyter, Berlin, 2013, pp. 259–288.Google Scholar

Girard, J.-Y., Linear logic . Theoretical Computer Science , vol. 50 (1987), no. 1, pp. 1–101.CrossRef Google Scholar

Gödel, K., Über eine bisher noch nicht benützte Erweiterung des finiten Standpunktes . Dialectica , vol. 12 (1958), pages 280–287.CrossRef Google Scholar

Grišin, V. N., Predicate and set-theoretic calculi based on logic without contractions . Mathematics of the USSR-Izvestiya , vol. 18 (1982), no. 1, pp. 41–59.CrossRef Google Scholar

Hofmann, M., On the interpretation of type theory in locally Cartesian closed categories, Proceedings of Computer Science Logic (L. Pacholski and J. Tiuryn, editors), Lecture Notes in Computer Science, Springer, Berlin, 1994, pp. 427–441.Google Scholar

Hofstra, P., The Dialectica monad and its cousins, Models, Logics, and Higher-Dimensional Categories: A Tribute to the Work of Mihály Makkai (B. Hart, T. G. Kucera, A. Pillay, P. J. Scott, and R. A. G. Seely, editors), American Mathematical Society, 2011, pp. 107–137.CrossRef Google Scholar

Hyland, J. M. E., Johnstone, P. T., and Pitts, A. M., Tripos theory . Mathematical Proceedings of the Cambridge Philosophical Society , vol. 88 (1980), no. 2, pp. 205–231.CrossRef Google Scholar

Jacobs, B., Categorical Logic and Type Theory , Studies in Logic and the Foundations of Mathematics, vol. 141, North-Holland, Amsterdam, 1999.Google Scholar

Johnstone, P. T., Rings, fields, and spectra. Journal of Algebra, vol. 49 (1977), no. 1, pp. 238–260.CrossRef Google Scholar

Johnstone, P. T., Sketches of an Elephant: A Topos Theory Compendium, vol. 2 , Oxford Logic Guides, 43, Oxford Science Publications, Oxford, 2002.Google Scholar

Lawvere, F. W., Equality in hyperdoctrines and comprehension schema as an adjoint functor , Applications of Categorical Algebra, American Mathematical Society, Providence, 1970, pp. 1–14.Google Scholar

Lawvere, F. W., Adjointness in foundations. Reprints in Theory and Applications of Categories, vol. 16 (2006), pp. 1–16 (electronic). Reprinted from Dialectica, vol. 23 (1969).Google Scholar

Lovas, W. and Crary, K., Structural normalization for classical natural deduction, 2006. Available at https://www.cs.cmu.edu/~wlovas/papers/clnorm.pdf.Google Scholar

Lumsdaine, P. L. and Warren, M. A., The local universes model: An overlooked coherence construction for dependent type theories . ACM Transactions on Computational Logic , vol. 16 (2015), no. 3, pp. 23:1–23:31.CrossRef Google Scholar

Makkai, M., Avoiding the axiom of choice in general category theory . Journal of Pure and Applied Algebra , vol. 108 (1996), no. 2, 109–173.CrossRef Google Scholar

McKinna, J., Deliverables: A categorical approach to program development in type theory, Ph.D. thesis, University of Edinburgh, 1992.CrossRef Google Scholar

Melliès, P.-A., Categorical semantics of linear logic , Interactive Models of Computation and Program Behaviour (P.-L. Curien, H. Herbelin, J.-L. Krivine, and P.-A. Melliès, editors), Panoramas et Synthèses, vol. 27, Société Mathématique de France, Paris, 2009, pp. 1–196.Google Scholar

Mines, R., Richman, F., and Ruitenburg, W., A Course in Constructive Algebra, Springer, Berlin, 1988.CrossRef Google Scholar

Nelson, D., Constructible falsity . The Journal of Symbolic Logic , vol. 14 (1949), no. 1, pp. 16–26.CrossRef Google Scholar

O’Hearn, P. W. and Pym, D. J., The logic of bunched implications, this Journal, vol. 5 (1999), no. 2, pp. 215–244.Google Scholar

Oliva, P., An analysis of Gödel’s Dialectica interpretation via linear logic . Dialectica , vol. 62 (2008), no. 2, pp. 269–290.CrossRef Google Scholar

de Paiva, V., The Dialectica categories, Categories in Computer Science and Logic (J. Gray and A. Scedrov, editors), Contemporary Mathematics, vol. 92, American Mathematical Society, Providence, 1989.Google Scholar

de Paiva, V., A Dialectica-like model of linear logic , Category Theory and Computer Science (D. H. Pitt, D. E. Rydeheard, P. Dybjer, A. M. Pitts, and A. Poigné, editors), Springer, Berlin–Heidelberg, 1989, pp. 341–356.CrossRef Google Scholar

de Paiva, V., Dialectica and Chu constructions: Cousins? Theory and Applications of Categories , vol. 17 (2006), no. 7, pp. 127–152.Google Scholar

Patterson, A. L., Implicit programming and the logic of constructible duality, Ph.D. thesis, University of Illinois at Urbana-Champaign, 1998.Google Scholar

Richman, F., Generalized real numbers in constructive mathematics . Indagationes Mathematicae , vol. 9 (1998), no. 4, pp. 595–606.CrossRef Google Scholar

Seely, R., Linear logic, *-autonomous categories and cofree coalgebras, Categories in Logic and Computer Science (J. W. Gray and A. Scedrov, editors), Contemporary Mathematics, vol. 92, American Mathematical Society, Providence, 1989.Google Scholar

Shramko, Y., Dual intuitionistic logic and a variety of negations: The logic of scientific research . Studia Logica , vol. 80 (2005), pp. 347–367.CrossRef Google Scholar

Shulman, M., The 2-Chu–Dialectica construction and the polycategory of multivariable adjunctions . Theory and Applications of Categories , vol. 35 (2020), no. 4, pp. 89–136.Google Scholar

Taylor, P., Intuitionistic sets and ordinals . The Journal of Symbolic Logic , vol. 61 (1996), no. 3, pp. 705–744.CrossRef Google Scholar

Trafford, J., Co-constructive logics for proofs and refutations . Studia Humana , vol. 3 (2015), no. 4, 22–40.CrossRef Google Scholar

Troelstra, A. S. and van Dalen, D., Constructivism in Mathematics , vol. I , Studies in Logic and the Foundations of Mathematics, vol. 121, North-Holland, Amsterdam, 1988.Google Scholar

Troelstra, A. S. and van Dalen, D., Constructivism in Mathematics , vol. II , Studies in Logic and the Foundations of Mathematics, vol. 123, North-Holland, Amsterdam, 1988.Google Scholar

Trotta, D., An algebraic approach to the completions of elementary doctrines, preprint, 2021, arXiv:2108.03415.Google Scholar

Univalent Foundations Program, Homotopy Type Theory: Univalent Foundations of Mathematics, first ed., 2013. Available at http://homotopytypetheory.org/book/.Google Scholar

Vickers, S., Topology via Logic , Cambridge Tracts in Theoretical Computer Science, vol. 5, Cambridge University Press, Cambridge, 1996.Google Scholar

Figure 1 The syntactic antithesis translation for propositional logic.

Figure 2 The syntactic antithesis translation for first-order logic.

Figure 3 The antithesis translation of an $\mathfrak {A}$-topology.

Article contents

AFFINE LOGIC FOR CONSTRUCTIVE MATHEMATICS

Abstract

Keywords

MSC classification

1 Introduction

1.1 Outline

2 A meaning explanation

3 The antithesis translation for propositional logic

4 The antithesis translation for predicate logic

4.1 First-order logic

4.2 Comprehension

Definition 4.2.1 [Reference Lawvere28].

4.3 Leibniz–Lawvere equality

Definition 4.3.1 [Reference Lawvere28], [Reference Jacobs25, Section 3.4].

4.4 Higher-order structures

4.5 Generic predicates

4.6 Infinity

4.7 Conclusions

5 Intuitionistic sets and functions

6 Affine sets and functions

7 Algebra

8 Order

9 Real analysis

10 Topology

11 Towards affine constructive mathematics

Acknowledgments

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests