1 Introduction

This paper follows the line of work that focuses on determining whether there is a weakest theory that is essentially undecidable. We formulate two natural essentially undecidable theories in the language of concatenation that are purely universally axiomatised.

A countable first-order theory is called essentially undecidable if any consistent extension, in the same language, is undecidable (there is no algorithm for deciding whether an arbitrary sentence is a theorem). A countable first-order theory is called essentially incomplete if any recursively axiomatizable consistent extension is incomplete. It is known that a theory is essentially undecidable if and only if it is essentially incomplete. Indeed, if a theory is not essentially undecidable, then by definition it has a consistent decidable extension which can be extended to a decidable complete consistent theory (see Chapter 1 of Tarski et al. [18]). On the other hand, if a theory is not essentially incomplete, then by definition it has a complete consistent recursively axiomatizable extension which clearly is decidable. Two theories that are known to be essentially undecidable are Robinson arithmetic \( {\textsf {Q}} \) and the related theory \( {\textsf {R}} \). Rosser‘s generalization of Gödel‘s first incompleteness theorem is usually taken as the statement that \( {\textsf {Q}} \) is essentially undecidable.

$$\begin{aligned} \begin{array}{r l c c r l } &{}{\textsf {The Axioms of } {\textsf {R}} } &{} &{} &{} &{} {\textsf {The Axioms of } {\textsf {Q}} } \\ {{\textsf {R}}}_{1} &{} {\overline{n}} + {\overline{m}} = \overline{n+m} &{} &{} &{} {\textsf {Q}}_{1} &{} \forall x y \; [ \ x \ne y \rightarrow \mathrm {S} x \ne \mathrm {S} y \ ] \\ {\textsf {R}}_{{2} } &{} {\overline{n}} \times {\overline{m}} = \overline{n \times m} &{} &{} &{} {\textsf {Q}}_{2} &{} \forall x \; [ \ \mathrm {S} x \ne 0 \ ] \\ {\textsf {R}}_{{3} } &{} {\overline{n}} \ne {\overline{m}} \text{ if } n \ne m &{} &{} &{} {\textsf {Q}}_{3} &{} \forall x \; [ \ x=0 \vee \exists y \; [ \ x = \mathrm {S} y \ ] \ ] \\ {\textsf {R}}_{{4} } &{} \forall x \; [ \ x \le {\overline{n}} \rightarrow \bigvee _{k \le n } x = {\overline{k}} \ ] &{} &{} &{} {\textsf {Q}}_{4} &{} \forall x \; [ \ x+0 = x \ ] \\ {\textsf {R}}_{{5} } &{} \forall x \; [ \ x \le {\overline{n}} \vee {\overline{n}} \le x \ ] &{} &{} &{} {\textsf {Q}}_{5} &{} \forall x y \; [ \ x+ \mathrm {S} y = \mathrm {S} ( x+y) \ ] \\ &{} &{} &{} &{} {\textsf {Q}}_{6} &{} \forall x \; [ \ x \times 0 = 0 \ ] \\ &{} &{} &{} &{} {\textsf {Q}}_{7} &{} \forall x y \; [ \ x \times \mathrm {S} y = (x \times y) + x \ ] \end{array} \end{aligned}$$

The main objective of this paper is to show that two theories, \( \textsf {WD} \) and \( {\textsf {D}} \), are mutually interpretable with \( {\textsf {R}} \) and \( {\textsf {Q}} \), respectively. The axioms of \( \textsf {WD} \) and \( {\textsf {D}} \) are given below using juxtaposition instead of the binary function symbol of the formal language. Later we regard \( {\textsf {D}} \) as the theory without the axiom \( {\textsf {D}}_{5} \). We do this because the two versions of \( {\textsf {D}} \) are mutually interpretable. The theories \( \textsf {WD} \) and \( {\textsf {D}} \) are theories in the language of concatenation extended with a binary relation symbol. That is, the language \( \lbrace 0, 1, \circ , \preceq \rbrace \) where 0 and 1 are constant symbols, \( \circ \) is a binary function symbol and \( \preceq \) is a binary relation symbol. The intended structure \( {\mathfrak {D}} \) is the free semigroup with two generators extended with the prefix relation which we denote \( \preceq ^{ {\mathfrak {D}} } \). As in number theory, each element \( \alpha \) in the universe is associated with a canonical term \( {\overline{\alpha }} \). The inclusion of a binary relation in the languages makes it easier to define \(\varSigma _1\)-formulas and to give a purely universal \(\varSigma _1\)-complete axiomatization of \( {\mathfrak {D}} \). We observe that any theory that proves all true \( \varSigma _1\)-sentences is an extension of \( \textsf {WD} \). Indeed, instances of \( \textsf {WD}_{1} \) and \( \textsf {WD}_{2} \) are true \( \varSigma _1\)-sentences, and each instance of \( \textsf {WD}_{3} \) follows from the true \( \varSigma _1\)-sentences \( \forall x \preceq {\overline{\alpha }} \; [ \ \bigvee _{ \gamma \preceq ^{ {\mathfrak {D}} } \alpha } x = \overline{ \gamma } \ ] \) and \( \bigwedge _{ \gamma \preceq ^{ {\mathfrak {D}} } \alpha } {\overline{\gamma }} \preceq {\overline{\alpha }} \). The theory \( \textsf {WD} \) is thus the weakest \( \varSigma _1\)-complete axiomatization of \( {\mathfrak {D}} \) (modulo closure under logical implication) and the theory \( {\textsf {D}} \) is a natural finitely axiomatizable extension of \( \textsf {WD} \). A variant of \( {\textsf {D}} \) where we have an identity element was introduced in Kristiansen and Murwanashyaka [9] as a \(\varSigma _1 \)-complete axiomatization of the structure \( {\mathfrak {D}} \) extended with the empty string. In [9, 10], we identify a number of decidable and undecidable fragments of \( {\mathfrak {D}} \) and related structures.

$$\begin{aligned} \begin{array}{r l c r l } &{}{\textsf {The Axioms of } \textsf {WD} } &{} &{} &{} {\textsf {The Axioms of } {\textsf {D}} } \\ \textsf {WD}_{1} &{} {\overline{\alpha }} \ {\overline{\beta }} = \overline{ \alpha \beta } &{} &{} {\textsf {D}}_{1} &{} \forall x y z \; [ \ (x y) z = x (y z) \ ] \\ \textsf {WD}_{2} &{} {\overline{\alpha }} \ne {\overline{\beta }} \text{ if } \alpha \ne \beta &{} &{} {\textsf {D}}_{2} &{} \forall x y \; [ \ x \ne y \rightarrow ( \ x 0 \ne y 0 \wedge x 1 \ne y 1 \ ) \ ] \\ \textsf {WD}_{3} &{} \forall x \; [ \ x \preceq {\overline{\alpha }} \leftrightarrow \bigvee _{ \gamma \preceq ^{ {\mathfrak {D}} } \alpha } x= {\overline{\gamma }} \ ] &{} &{} {\textsf {D}}_{3} &{} \forall x y \; [ \ x0 \ne y1 \ ] \\ &{} &{} &{} {\textsf {D}}_{4} &{} \forall x \; [ \ x \preceq 0 \leftrightarrow x=0 \ ] \\ &{} &{} &{} {\textsf {D}}_{5} &{} \forall x \; [ \ x \preceq 1 \leftrightarrow x=1 \ ] \\ &{} &{} &{} {\textsf {D}}_{6} &{} \forall x y \; [ \ x \preceq y 0 \leftrightarrow ( \ x= y 0 \vee x \preceq y \ ) \ ] \\ &{} &{} &{} {\textsf {D}}_{7} &{} \forall x y \; [ \ x \preceq y 1 \leftrightarrow ( \ x= y 1 \vee x \preceq y \ ) \ ] \end{array} \end{aligned}$$

The theory of concatenation \( \textsf {TC} \) was introduced by Grzegorczyk in [4] where he also showed that it is undecidable. The language of \( \textsf {TC} \) consists only of the two constant symbols 0 and 1 and the binary function symbol \( \circ \). The intended model of \( \textsf {TC} \) is a free semigroup with at least two generators. That is, a structure of the form \( ( \varGamma ^+, a_1, \ldots , a_n, ^\frown )\) where \(\varGamma = \lbrace a_1, \ldots , a_n \rbrace \) is a finite alphabet with at least two symbols, \( \varGamma ^+ \) is the set of all finite non-empty strings over \(\varGamma \) and \( ^\frown \) is the binary operator that concatenates elements of \(\varGamma ^+\). Grzegorczyk‘s motivation for introducing the theory \( \textsf {TC} \) was that, as computation involves manipulation of text, the notion of computation can be formulated on the basis of discernibility of text without reference to natural numbers. Then, undecidability of first-order logic and essential undecidability can be explained using a theory of strings thereby avoiding complicated coding of syntax based on natural numbers. In [5], Grzegorczyk and Zdanowski showed that TC is essentially undecidable. This was further improved in Ganea [3], Visser [19] and Švejdar [16] where it was shown that TC is mutually interpretable with Robinson arithmetic. In Higuchi and Horihata [6], it was shown that TC is minimal essentially undecidable. That is, removing any one of the axioms of \( \textsf {TC} \) gives a theory that is not essentially undecidable. We will refer to \( \textsf {TC} \) as \(\textsf {TC}^{{-\varepsilon } } \) and we will let \( \textsf {TC} \) refer to the theory where we have an identity element. The two theories are known to be mutually interpretable (see Grzegorczyk and Zdanowski [5] and Visser [19]). In the article [6], it was also shown that a weak theory of concatenation \({\textsf {WTC}^{-\varepsilon } } \) is minimal essentially undecidable and mutually interpretable with \( {\textsf {R}} \). That \({\textsf {WTC}^{-\varepsilon } } \) is minimal essentially undecidable means that removal of any one of the axiom schemas of \({\textsf {WTC}^{-\varepsilon } } \) gives a theory that is not essentially undecidable.

$$\begin{aligned}&\begin{array}{r l } &{}{\textsf {The Axioms of } \textsf {TC}^{{-\varepsilon } } } \\ \\ {\textsf {TC}^{-\varepsilon }_1 } &{} \forall x y z \; [ \ x(yz) = (xy)z \ ] \\ {\textsf {TC}^{-\varepsilon }_2 } &{} \forall x y z w \; [ \ ( \ xy = zw \rightarrow \big ( \ ( \ x=z \wedge y=w) \vee \ \\ &{} \exists u \; [ \ ( \ z= xu \wedge uw = y \ ) \vee ( \ x = zu \wedge uy = w \ ) \ ] \ \big ) \ ] \\ {\textsf {TC}^{-\varepsilon }_3 } &{} \forall x y \; [ \ xy \ne 0 \ ] \\ {\textsf {TC}^{-\varepsilon }_4 } &{} \forall x y \; [ \ xy \ne 1 \ ] \\ {\textsf {TC}^{-\varepsilon }_5 } &{} 0 \ne 1 \end{array} \\&\begin{array}{r l } &{}{\textsf {The Axioms of } {\textsf {WTC}^{-\varepsilon } } } \\ \\ {\textsf {WTC}^{-\varepsilon }_1 } &{} \forall x y z \; [ \ ( \ x(yz) \sqsubseteq _{ {\textsf {s}} } {\overline{\alpha }} \vee (xy)z \sqsubseteq _{ {\textsf {s}} } {\overline{\alpha }} \ ) \rightarrow x(yz) = (xy)z \ ] \\ {\textsf {WTC}^{-\varepsilon }_2 } &{} \forall x y z w \; [ \ ( \ xy = zw \wedge xy \sqsubseteq _{ {\textsf {s}} } {\overline{\alpha }} \ ) \rightarrow \big ( \ ( \ x=z \wedge y=w) \vee \ \\ &{} \exists u \; [ \ ( \ z= xu \wedge uw = y \ ) \vee ( \ x = zu \wedge uy = w \ ) \ ] \ \big ) \ ] \\ {\textsf {WTC}^{-\varepsilon }_3 } &{} \forall x y \; [ \ xy \ne 0 \ ] \\ {\textsf {WTC}^{-\varepsilon }_4 } &{} \forall x y \; [ \ xy \ne 1 \ ] \\ {\textsf {WTC}^{-\varepsilon }_5 } &{} 0 \ne 1 \end{array} \end{aligned}$$

We use \( x \sqsubseteq _{ {\textsf {s}} } y \) as shorthand for

$$\begin{aligned} y = x \vee \exists uv \; [ \ y = ux \vee y = xv \vee y = (ux) v \vee y= u (xv) \ ] \ . \end{aligned}$$

The diagram in Fig. 1 summarizes the relationships between \( {\textsf {R}}\), \( {\textsf {Q}} \), \( \textsf {WD} \), \( {\textsf {D}} \), \( \textsf {WTC}^{ -\varepsilon } \) and \(\textsf {TC}^{ -\varepsilon } \).

Fig. 1
figure 1

\(S \rightarrow T\) means S is interpretable in T but S does not interpret T. \(S \leftrightarrow T\) means S and T are mutually interpretable

Kristiansen and Murwanashyaka [11] have also introduced two essentially undecidable theories, \( \textsf {WT} \) and \( {\textsf {T}} \), that are purely universally axiomatised. While \( \textsf {WD} \) and \( {\textsf {D}} \) are theories of concatenation, \( \textsf {WT} \) and \( {\textsf {T}} \) are theories of terms. The intended model of \( \textsf {WT} \) and \( {\textsf {T}} \) is the extended term algebra given by the language \({\mathscr {L}}_{ {\textsf {T}} } = \lbrace \perp , \langle \cdot , \cdot \rangle , \sqsubseteq \rbrace \) where \( \perp \) is a constant symbol, \( \langle \cdot , \cdot \rangle \) is a binary function symbol and \( \sqsubseteq \) is a binary relation symbol. The universe is the set of all variable-free \( {\mathscr {L}}_{ {\textsf {T}} } \)-terms and \( \langle \cdot , \cdot \rangle \) is interpreted as the function that maps a pair (st) to the term \( \langle s, t \rangle \). Every variable-free term is thus realized as itself. The relation symbol \( \sqsubseteq \) is interpreted as the subterm relation. The theory \( \textsf {WT} \) has a compact axiomatization that consists only of analogues of \( \textsf {WD}_{2} \) and \( \textsf {WD}_{3} \). An analogue of \( \textsf {WD}_{1} \) is not necessary since it holds by pure logic. The theory \( {\textsf {T}} \) has two axioms

$$\begin{aligned} \textsf {T}_{1} \equiv \ \forall x y \; [ \ \langle x, y \rangle \ne \perp \ ] \ \text{ and } \textsf {T}_{2} \equiv \ \forall x y z w \; [ \ \langle x, y \rangle = \langle z, w \rangle \rightarrow ( \ x=z \wedge y=w \ ) ] \end{aligned}$$

that describe \( \langle \cdot , \cdot \rangle \); and two axioms

$$\begin{aligned}&\textsf {T}_{3} \equiv \ \forall x \; [ \ x \sqsubseteq \perp \leftrightarrow x= \perp \ ] \ \text{ and } \\&\textsf {T}_{4} \equiv \ \forall x y z \; [ \ x \sqsubseteq \langle y , z \rangle \leftrightarrow ( \ x = \langle y , z \rangle \vee x \sqsubseteq y \vee x \sqsubseteq z \ ) \ ] \end{aligned}$$

that describe \( \sqsubseteq \). Kristiansen and Murwanashyaka give interpretations of \( {\textsf {R}} \) and \( {\textsf {Q}} \) in \( \textsf {WT} \) and \( {\textsf {T}} \), respectively. A result of Visser [20] ensures that \( \textsf {WT} \) is interpretable in \( {\textsf {R}} \) since it is locally finitely satisfiable. They conjecture that \( {\textsf {T}} \) and \( {\textsf {Q}} \) are mutually interpretable.

2 Preliminaries

In this section, we clarify a number of notions that we only glossed over in the previous section. We also introduce a number of intermediate theories that will be useful in showing that \( \textsf {WD} \) and \( {\textsf {R}} \) are mutually interpretable and that \( {\textsf {D}} \) and \( {\textsf {Q}} \) are mutually interpretable.

We consider the structures

$$\begin{aligned} \mathfrak {D^{-} } = ( \lbrace {\varvec{0}}, {\varvec{1}} \rbrace ^{ + } , {\varvec{0}}, {\varvec{1}}, ^\frown ) \ \ \text{ and } \ \ {\mathfrak {D}} = ( \lbrace {\varvec{0}}, {\varvec{1}} \rbrace ^{ + } , {\varvec{0}}, {\varvec{1}}, ^\frown , \preceq ^{ {\mathfrak {D}} } ) \end{aligned}$$

where \( \lbrace {\varvec{0}}, {\varvec{1}} \rbrace ^{ + } \) is the set of all finite non-empty strings over the alphabet \(\lbrace {\varvec{0}}, {\varvec{1}} \rbrace \), the binary operator \(^\frown \) concatenates elements of \( \lbrace {\varvec{0}}, {\varvec{1}} \rbrace ^{ + } \) and \( \preceq ^{ {\mathfrak {D}} } \) denotes the prefix relation, i.e., \(x \preceq ^{ {\mathfrak {D}} } y \) if and only if \(y = x\) or there exists \(z \in \lbrace {\varvec{0}}, {\varvec{1}} \rbrace ^{ + } \) such that \(y = x ^\frown z \). The structure \( \mathfrak {D^{-} } \) is thus the free semigroup with two generators. We call elements of \( \lbrace {\varvec{0}}, {\varvec{1}} \rbrace ^{ + } \) bit strings. The structures \( \mathfrak {D^{-} } \) and \( {\mathfrak {D}} \) are first-order structures over the languages \({\mathscr {L}}_{ \textsf {BT} }^{-} = \lbrace 0, 1, \circ \rbrace \) and \({\mathscr {L}}_{ \textsf {BT} } = \lbrace 0, 1, \circ , \preceq \rbrace \), respectively.

In first-order number theory, each natural number n is associated with a numeral \( {\overline{n}} \). Each non-empty bit string \( \alpha \in \lbrace {\varvec{0}}, {\varvec{1}} \rbrace ^{ + } \) is associated by recursion with a unique \({\mathscr {L}}_{ \textsf {BT} }^{-}\)-term \( {\overline{\alpha }} \), called a biteral, as follows: \( \overline{ {\varvec{0}} } \equiv 0\), \( \overline{ {\varvec{1}} } \equiv 1\), \( \overline{ \alpha {\varvec{0}} } \equiv ( {\overline{\alpha }} \circ 0 )\) and \( \overline{ \alpha {\varvec{1}} } \equiv ( {\overline{\alpha }} \circ 1 )\). The biterals are important if we, for example, want to show that certain sets are definable since we then need to talk about elements of \( \lbrace {\varvec{0}}, {\varvec{1}} \rbrace ^{ + } \) in the formal theory.

In the language \({\mathscr {L}}_{ \textsf {BT} } \), we can define the \(\varSigma _1\)-formulas as follows: atomic formulas and their negations are \(\varSigma _1\)-formulas. If \( \alpha \) and \( \beta \) are \(\varSigma _1\)-formulas, then \( \alpha \wedge \beta \), \( \alpha \vee \beta \) and \( \exists x \; \alpha \) are \(\varSigma _1\)-formulas. Furthermore, if the variable x does not occur in the term t, then \(\exists x \; [ \ x \preceq t \wedge \alpha \ ] \) and \(\forall x \; [ \ x \preceq t \rightarrow \alpha \ ] \) are also \(\varSigma _1\)-formulas. We use \(\exists x \preceq t \ \alpha \) and \(\forall x \preceq t \ \alpha \) as abbreviations for \(\exists x \; [ \ x \preceq t \wedge \alpha \ ] \) and \(\forall x \; [ \ x \preceq t \rightarrow \alpha \ ] \), respectively. To define \(\varSigma _1\)-formulas over the language \({\mathscr {L}}_{ \textsf {BT} }^{-} \), we first need to define a binary relation in order to have bounded quantifiers. Two natural choices are

  • \( x \preceq _{ {\textsf {i}} } y \equiv \ y = x \vee \exists z \; [ \ y = x z \ ] \)

  • \( x \sqsubseteq _{ {\textsf {s}} } y \equiv \ y = x \vee \exists uv \; [ \ y = ux \vee y = xv \vee y = (ux) v \vee y= u (xv) \ ] \).

Over the structure \( {\mathfrak {D}}^{-}\), the defined relation \( \preceq _{ {\textsf {i}} } \) is realized as the prefix relation while \( \sqsubseteq _{ {\textsf {s}} } \) is realized as the substring relation. Given a bit string \(\alpha \), the set of those bit strings that are substrings of \(\alpha \) is denoted \( \langle ... \alpha \rangle \) and consists of those bit strings \(\beta \) such that \( \beta = \alpha \) or there exist bit strings u and v such that \(\alpha = u \beta \) or \( \alpha = \beta v \) or \(\alpha = u \beta v \). We choose to work with \( \sqsubseteq _{ {\textsf {s}} } \) since the intended interpretation of \(\preceq \) is the prefix relation. We observe that the number of substrings of a string \( \alpha \) is quadratic in the length of \( \alpha \), whereas the number of prefixes of \( \alpha \) is linear in the length of \( \alpha \). This means that the choice between \( \preceq _{ {\textsf {i}} } \) and \( \sqsubseteq _{ {\textsf {s}} } \) could make a difference in the context of very weak theories.

Having introduced \(\varSigma _1\)-formulas, it is natural to try to find \(\varSigma _1\)-complete axiomatizations of the structures \({\mathfrak {D}}^{-} \) and \({\mathfrak {D}}\). That is, to find theories that prove all true \(\varSigma _1\)-sentences (sentences are formulas without free variables) and that are such that the non-logical axioms are true over the intended structure. A natural first step is to introduce the theories \( \textsf {WBT} \) and \( \textsf {WD} \) defined below. It is not difficult to see that these two theories are \(\varSigma _1\)-complete. The theories \( \textsf {WBT} \) and \( \textsf {WD} \) are not finitely axiomatizable but they are the weakest possible \(\varSigma _1\)-complete axiomatizations of \({\mathfrak {D}}^{-} \) and \({\mathfrak {D}}\), respectively, modulo closure under logical implication. Once we have \( \textsf {WBT} \) and \( \textsf {WD} \), the theories \( \textsf {BTQ} \) and \( {\textsf {D}} + \forall x \; [ \ x \preceq 1 \leftrightarrow x = 1 \ ] \) defined below are natural finitely axiomatizable extensions (T is an extension of S if the language of S is a subset of the language of T and every theorem of S is a theorem of T). The reason for not having \( \forall x \; [ \ x \preceq 1 \leftrightarrow x = 1 \ ] \) as an axiom of \( {\textsf {D}} \) is that it is not necessary for essential undecidability (we could, of course, very well have worked with the theory where we have \( \forall x \; [ \ x \preceq 1 \leftrightarrow x = 1 \ ] \) instead of \( \forall x \; [ \ x \preceq 0 \leftrightarrow x = 0 \ ] \)). Although the theories \( \textsf {WBT} \), \( \textsf {BTQ}\), \( \textsf {WD} \) and \( {\textsf {D}} \) are \(\varSigma _1\)-complete, it is not at all obvious that they are essentially undecidable. When proving that \( {\textsf {R}} \) is essentially undecidable, the axiom schema \( \textsf {R}_{5} \equiv \forall x \; [ \ x \le {\overline{n}} \vee {\overline{n}} \le x \ ] \) is essential. It is however not straightforward to define a binary relation that provably satisfies the analogue of \( \textsf {R}_{5} \). The method of relative interpretability then becomes important for establishing that these theories are essentially undecidable.

$$\begin{aligned}&\begin{array}{r l c c r l } &{}{\textsf {The Axioms of } \textsf {WBT } } &{} &{} &{} &{}{\textsf {The Axioms of } \textsf {WD } } \\ \textsf {WBT}_{1} &{} {\overline{\alpha }} \ {\overline{\beta }} = \overline{ \alpha \beta } &{} &{} &{} \textsf { WD}_{1 } &{} {\overline{\alpha }} \ {\overline{\beta }} = \overline{ \alpha \beta } \\ \textsf {WBT}_{2} &{} {\overline{\alpha }} \ne {\overline{\beta }} \text{ if } \alpha \ne \beta &{} &{} &{} \textsf {WD}_{2} &{} {\overline{\alpha }} \ne {\overline{\beta }} \text{ if } \alpha \ne \beta \\ \textsf {WBT}_{3} &{} \forall x \; [ \ x \sqsubseteq _{ {\textsf {s}} } {\overline{\alpha }} \rightarrow \bigvee _{ \gamma \in \langle ... \alpha \rangle } x= {\overline{\gamma }} \ ] &{} &{} &{} \textsf {WD}_{3} &{} \forall x \; [ \ x \preceq {\overline{\alpha }} \leftrightarrow \bigvee _{ \gamma \preceq ^{ {\mathfrak {D}} } \alpha } x= {\overline{\gamma }} \ ] \end{array} \\&\begin{array}{r l } &{}{\textsf {The Axioms of } \textsf {BTQ } } \\ \\ \textsf {BTQ}_{1} &{} \forall x y z \; [ \ (xy)z = x (yz) \ ] \\ \textsf {BTQ}_{2} &{} \forall x y \; [ \ x \ne y \rightarrow ( \ x 0 \ne y 0 \wedge x 1 \ne y 1 \ ) \ ] \\ \textsf {BTQ}_{3} &{} \forall x y \; [ \ x0 \ne y1 \ ] \\ \textsf {BTQ}_{4} &{} \forall x \; [ \ x =0 \vee x=1 \vee \exists y \; [ \ x=y0 \vee x= y1 \ ] \ ] \end{array} \\&\begin{array}{r l } &{}{\textsf {The Axioms of } {\textsf {D}} } \\ \textsf {D}_{1 } &{} \forall x y z \; [ \ (x y) z = x (y z) \ ] \\ {\textsf {D}}_{2} &{} \forall x y \; [ \ x \ne y \rightarrow ( \ x 0 \ne y 0 \wedge x 1 \ne y 1 \ ) \ ] \\ {\textsf {D}}_{3} &{} \forall x y \; [ \ x0 \ne y1 \ ] \\ {\textsf {D}}_{4} &{} \forall x \; [ \ x \preceq 0 \leftrightarrow x=0 \ ] \\ {\textsf {D}}_{5} &{} \forall x y \; [ \ x \preceq y 0 \leftrightarrow ( \ x= y 0 \vee x \preceq y \ ) \ ] \\ {\textsf {D}}_{6} &{} \forall x y \; [ \ x \preceq y 1 \leftrightarrow ( \ x= y 1 \vee x \preceq y \ ) \ ] \end{array} \\&\begin{array}{r l c c r l } &{}{\textsf {The Axioms of } \textsf {BT } } &{} &{} &{} &{}{\textsf {The Axioms of } {\textsf {C}} } \\ &{} \textsf {BTQ}_{1} , \ \textsf {BTQ}_{2} , \ \textsf {BTQ}_{3} , \ \textsf {WBT}_{3} &{} &{} &{} &{} {\textsf {D}}_{1} , \ {\textsf {D}}_{2} , \ {\textsf {D}}_{3} , \ \textsf {WD}_{3} \end{array} \end{aligned}$$

The overall relationship among the various theories introduced so far is summarized in Fig.  2. The constants 0 and 1 are atoms in \( \textsf {BTQ} \). This means that \( \textsf {BTQ} \vdash \forall xy \; [ \ xy \ne 0 \ ] \wedge \forall xy \; [ \ xy \ne 1 \ ] \) (see Lemma 8 in Ganea [3] for a proof). This observation is used in the proof of Theorem 7 where we show that \( {\textsf {D}} \) is interpretable in \( \textsf {BTQ} \).

Fig. 2
figure 2

\(S \rightarrow T\) means S is interpretable in T but S does not interpret T. \(S \leftrightarrow T\) means S and T are mutually interpretable. \(S \dashrightarrow T\) means S is interpretable in T but it is unknown whether S interprets T

We recall the method of relative interpretability introduced by Tarski [18] for showing that first-order theories are essentially undecidable. Let \({\mathscr {L}}_{1}\) and \({\mathscr {L}}_{2}\) be computable first-order languages. A relative translation \(\tau \) from \({\mathscr {L}}_{1}\) to \({\mathscr {L}}_{2}\) is a computable map given by:

  1. 1.

    An \({\mathscr {L}}_{2}\)-formula \(\delta (x)\) with exactly one free variable. The formula \(\delta (x)\) is called a domain.

  2. 2.

    For each n-ary relation symbol R of \({\mathscr {L}}_{1}\), an \({\mathscr {L}}_{2}\)-formula \( \psi _{R}(x_{1},...,x_{n})\) with exactly n free variables. The equality symbol \(=\) is treated as a binary relation symbol.

  3. 3.

    For each n-ary function symbol f of \({\mathscr {L}}_{1}\), an \({\mathscr {L}}_{2}\)-formula \(\psi _{f}(x_{1},...,x_{n},y)\) with exactly \(n+1\) free variables.

  4. 4.

    For each constant symbol c of \({\mathscr {L}}_{1}\), an \({\mathscr {L}}_{2}\)-formula \(\psi _{c}(y)\) with exactly one free variable.

We extend \(\tau \) to a translation of atomic \({\mathscr {L}}_{1}\)-formulas by mapping a \({\mathscr {L}}_{1}\)-term t to a \({\mathscr {L}}_{2}\)-formula \( ( t )^{ \tau , w } \) with a free variable w that denotes the value of t:

  1. 5.

    For each n-ary relation symbol R of \({\mathscr {L}}_{1}\)

    $$\begin{aligned} \big ( R(t_1, \ldots , t_n ) \big )^{ \tau } \equiv \ \exists v_1 \ldots v_n \; \left[ \ \bigwedge _{ i = 1 }^{ n } \delta (v_i ) \ \wedge \ \bigwedge _{ j = 1 }^{ n } ( t_j )^{ \tau , v_j } \ \wedge \ \psi _{ R } ( v_1, \ldots , v_n ) \ \right] \end{aligned}$$

    where \( v_1, \ldots , v_n \) are distinct variable symbols that do not occur in \( t_1, \ldots , t_n \) and

    1. (a)

      for each variable symbol x of \({\mathscr {L}}_{1}\), \( \; (x)^{ \tau , w } \equiv \ w = x \; \)

    2. (b)

      for each constant symbol c of \({\mathscr {L}}_{1}\), \( \; (c)^{ \tau , w } \equiv \ \psi _{c} (w) \; \)

    3. (c)

      for each n-ary function symbol f of \({\mathscr {L}}_{1}\)

      $$\begin{aligned} \big ( f(t_1, \ldots , t_n ) \big )^{ \tau , w } \equiv \ \exists w_1 \ldots w_n \; \left[ \ \bigwedge _{ i = 1 }^{ n } \delta (w_i ) \ \wedge \ \bigwedge _{ j = 1 }^{ n } ( t_j )^{ \tau , w_j } \ \wedge \ \psi _{ f } ( w_1, \ldots , w_n , w ) \ \right] \end{aligned}$$

      where \( w_1, \ldots , w_n \) are distinct variable symbols that do not occur in \( \bigwedge _{ j = 1 }^{ n } ( t_j )^{ \tau , w } \; \).

We extend \(\tau \) to a translation of all \({\mathscr {L}}_{1}\)-formulas as follows:

  1. 6.

    \(( \lnot \phi )^{\tau } \equiv \ \lnot \phi ^{\tau } \)

  2. 7.

    \(( \phi \oslash \psi )^{\tau } \equiv \phi ^{\tau } \oslash \psi ^{\tau } \) for \(\oslash \in \lbrace \wedge , \vee , \rightarrow , \leftrightarrow \rbrace \)

  3. 8.

    \(( \exists x \; \phi )^{\tau } \equiv \ \exists x \; [ \ \delta (x) \wedge \phi ^{\tau } \ ] \)

  4. 9.

    \(( \forall x \; \phi )^{\tau } \equiv \ \forall x \; [ \ \delta (x) \rightarrow \phi ^{\tau } \ ] \; \).

Let \( {S} \) be an \({\mathscr {L}}_{1}\)-theory and let \( {T} \) be an \({\mathscr {L}}_{2}\)-theory. We say that \( {S} \) is (relatively) interpretable in \( {T} \) if there exists a relative translation \(\tau \) such that

  • \( {T} \vdash \exists x \; \delta (x) \)

  • For each function symbol f of \({\mathscr {L}}_{1}\)

    $$\begin{aligned} {T} \vdash \bigwedge _{i=1}^{n} \delta (x_{i}) \rightarrow \exists ! y \; [ \ \delta (y) \wedge \psi _{f}(x_{1},...,x_{n}, y) \ ] \ . \end{aligned}$$
  • For each constant symbol c of \({\mathscr {L}}_{1}\)

    $$\begin{aligned} {T} \vdash \exists ! y \; [ \delta (y) \wedge \psi _{c}(y) ] \ . \end{aligned}$$
  • \( {T} \) proves \(\phi ^{\tau }\) for each non-logical axiom \(\phi \) of \( {S} \). If equality is not translated as equality, then \( {T} \) must prove the translation of each equality axiom.

If \( {S} \) is relatively interpretable in \( {T} \) and \( {T} \) is relatively interpretable in \( {S} \), we say that \( {S} \) and \( {T} \) are mutually interpretable.

The following proposition summarizes important properties of relative interpretability (see Tarski et al. [18] for the details).

Proposition 1

Let \( {S} \), \( {T} \) and \( {U} \) be computably enumerable first-order theories.

  1. 1.

    If \( {S} \) is interpretable in \( {T} \) and \( {T} \) is consistent, then \( {S} \) is consistent.

  2. 2.

    If \( {S} \) is interpretable in \( {T} \) and \( {T} \) is interpretable in \( {U} \), then \( {S} \) is interpretable in \( {U} \).

  3. 3.

    If \( {S} \) is interpretable in \( {T} \) and \( {S} \) is essentially undecidable, then \( {T} \) is essentially undecidable.

3 Intermediate theories

We show in our Master’s thesis [12] that \( \textsf {WBT} \) and \( \textsf {WTC}^{{ -\varepsilon } } \) are equivalent, i.e., they prove the same formulas. We do not include a proof of this result, but the interested reader may find the fairly straightforward proof in [12] . We also show in [12] that \( \textsf {BTQ} \) and \( {\textsf {TC}^{ -\varepsilon } } \) are mutually interpretable. Although this result is not trivial, we omit a proof since there is another way of seeing that \( \textsf {BTQ} \) is mutually interpretable with \( {\textsf {Q}} \). The theory \( \textsf {BTQ} \) is a fragment of the theory \( {\textsf {F}} \), introduced first by Alfred Tarski at the end of Chapter 3 of [18]. The theory \( \textsf {BTQ} \) also resembles the theory \(\textsf {Q}^{\textsf {bin}} \) in Visser [19]. One can view \(\textsf {Q}^{\textsf {bin}} \) as the analogue of \( \textsf {BTQ} \) where we have an identity element and associativity has been weakened to \( \forall x y \; [ \ (xy) 0 = x (y0) \wedge (xy) 1 = x (y 1 ) \ ] \). The theory \( {\textsf {F}} \) differs from \( \textsf {BTQ} \) in that the axiom \( \textsf {BTQ}_{2} \) is replaced with the axioms

$$\begin{aligned} \textsf {F}_{2} \equiv \ \forall x y z \; [ \ zx = zy \rightarrow x = y \ ] \ \text{ and } \ \textsf {F}_{3} \equiv \ \forall x y z \; [ \ xz = y z \rightarrow x = y \ ] \ . \end{aligned}$$

In Ganea [3], it is shown that \( {\textsf {F}} \! - \! \textsf {F}_{2} \) is mutually interpretable with \( {\textsf {Q}} \). Clearly, \( \textsf {BTQ} \) is interpretable in \( {\textsf {F}} \! - \! \textsf {F}_{2} \). The other way, we can interpret \( {\textsf {F}} \! - \! \textsf {F}_{2} \) in \( \textsf {BTQ} \) by simply relativizing quantification to the domain

$$\begin{aligned} J = \lbrace z : \ \forall x y \; [ \ xz = yz \rightarrow x =y \ ] \rbrace \ . \end{aligned}$$

It follows from \( \textsf {BTQ}_{2} \) that \(0,1 \in J \). We now show that J is closed under \(\circ \). Suppose \(z , w \in J \). We need to show that \(zw \in J\). We have

$$\begin{aligned} \begin{array}{r c l c l} x (zw) = y (zw) &{} \Rightarrow &{} (x z) w = (y z) w &{} &{} ( \text{ by } \textsf {BTQ}_{1} \ ) \\ &{} \Rightarrow &{} x z = y z &{} &{} ( \text{ since } w \in J \ ) \\ &{} \Rightarrow &{} x = y &{} &{} ( \text{ since } z \in J \ ) \ . \end{array} \end{aligned}$$

Hence, \( zw \in J \). Thus, J satisfies the domain condition. Clearly, the axioms of \( {\textsf {F}} \! - \! \textsf {F}_{2} \) hold on J. We show that the axiom \( \textsf {BTQ}_{4} \) holds on J. Suppose \(x \in J \). By \( \textsf {BTQ}_{4}\), if \( x \ne 0 \wedge x \ne 1\), then there exists y such that \( x=y0 \vee x= y1 \). We have

$$\begin{aligned} \begin{array}{r c l c c l } u y = v y &{} \Rightarrow &{} (uy ) 0 = (vy ) 0 \wedge (uy ) 1 = (vy ) 1 \\ &{} \Rightarrow &{} u (y 0 ) = v (y 0 ) \wedge u (y 1 ) = v (y 1 ) &{} &{} &{} ( \text{ by } \textsf {BTQ}_{1} \ ) \\ &{} \Rightarrow &{} u x = v x \\ &{} \Rightarrow &{} u = v &{} &{} &{} ( \text{ since } x \in J \ ) \ . \end{array} \end{aligned}$$

Hence, \( x \in J\) implies \( y \in J \). Thus, \( \textsf {BTQ}_{4} \) holds on J. It is not difficult to verify that the other axioms also hold on J. We can thus state the following theorem which will be used implicitly to show that \( {\textsf {D}} \) and \( {\textsf {Q}} \) are mutually interpretable.

Theorem 1

\( \textsf {BTQ} \) and \( {\textsf {Q}} \) are mutually interpretable.

The theories \( \textsf {BT} \) and \( {\textsf {C}} \) are examples of theories that lie strictly between \( {\textsf {R}} \) and \( {\textsf {Q}} \) w.r.t. relative interpretability. It follows from \( \forall x y \; [ \ x \ne y \rightarrow ( \ x 0 \ne y 0 \wedge x 1 \ne y 1 \ ) \ ] \) and \( \forall x y \; [ \ x0 \ne y1 \ ] \) that any model of \( \textsf {BT} \) or \( {\textsf {C}} \) is infinite. If \( \textsf {BT} \) (\( {\textsf {C}} \)) were interpretable in \( \textsf {WBT} \) (\( \textsf {WD} \)), it would be interpretable in a finite sub-theory. Since any finite sub-theory of \( \textsf {WBT} \) or \( \textsf {WD} \) clearly has a finite model, we cannot interpret \( \textsf {BT} \) (\( {\textsf {C}} \)) in \( \textsf {WBT} \) (\( \textsf {WD} \)). We now show that \(\textsf {BTQ} \) is not interpretable in \( \textsf {BT} \). Similar reasoning shows that \( {\textsf {D}} \) is not interpretable in \( {\textsf {C}} \). We let \( \textsf {BT}_{4} \) denote \( \textsf {WBT}_{3} \) and for \(1 \le i \le 3\) we let \(\textsf {BT}_{i} \) denote \( \textsf {BTQ}_{i}\).

Theorem 2

\( \textsf {BTQ} \) is not interpretable in \( \textsf {BT} \).

Proof

Suppose \(\tau \) is an interpretation of BTQ in BT. Then, there is a finite subset \(\varSigma \) of the axioms of BT such that

$$\begin{aligned} \varSigma \vdash \bigwedge _{j= 1}^{4} ( \textsf {BTQ}_{j} )^{ \tau } \ . \end{aligned}$$

To see that \(\tau \) cannot exist, it suffices to show that the theory given by \(\varSigma \) is interpretable in \(\textsf {BT} \! - \! \textsf {BT}_{4} \), which is not essentially undecidable by minimality of BTQ (see Lemma 11).

To see that \(\varSigma \) is interpretable in \(\textsf {BT} \! - \! \textsf {BT}_{4} \), it suffices to show that for each natural number \(n\ge 1\), the theory \(\textsf {BT}_{{ \le n} }\) is interpretable in BT. The theory \(\textsf {BT}_{{ \le n} }\) is like BT except that the axiom schema

$$\begin{aligned} \textsf {BT}_{4} \equiv \ \forall x \; [ \ x \sqsubseteq _{ {\textsf {s}} } {\overline{\alpha }} \rightarrow \bigvee _{ \gamma \in \langle ... \alpha \rangle } x= {\overline{\gamma }} \ ] \ . \end{aligned}$$

is limited to those \(\alpha \) such that the length of \(\alpha \), denoted \(\vert \alpha \vert \), is bounded by n. To do this, we define by recursion a sequence of domains

$$\begin{aligned} I_1 \supseteq I_2 \supseteq I_3 \supseteq ...... \end{aligned}$$

such that we obtain an interpretation of \(\textsf {BT}_{{ \le n } }\) in BT by simply relativizing quantification to \(I_n\). So, we proceed to construct these domains. We will omit parentheses most of the time since we have the axiom

$$\begin{aligned} \textsf {BT}_{1} \equiv \ \forall x y z \; [ \ (xy) z = x (yz) \ ] \end{aligned}$$

in \( \textsf {BT} \! - \! \textsf {BT}_{4} \).

Construction of \(I_1\):

We let

$$\begin{aligned} A_1 = \lbrace y : \ \forall x \; [ \ xy0 \ne 0 \wedge xy1 \ne 1 \ ] \ \rbrace . \end{aligned}$$

Suppose \(x00=0 \vee x01 = 1\). Then, \(11x00=110 \vee 11x01 = 111\). By \({\textsf {BT}}_{2}\), we then have \(11x0=11 \), contradicting \({\textsf {BT}}_{3}\). Hence, \(0 \in A_1\). By similar reasoning, we have \(1\in A_1\). We now show that \(A_1\) is closed under \(\circ \). Suppose \(y, y^{\prime } \in A_1\). By \( \textsf {BT}_{1} \), we have

$$\begin{aligned} x(y y^{\prime }) 0 = (xy) y^{\prime } 0 \ \ \text{ and } \ \ x(y y^{\prime }) 1 = (xy) y^{\prime } 1 \ . \end{aligned}$$

Hence, \(x(y y^{\prime }) 0 = 0 \vee x(y y^{\prime }) 1 = 1\) contradicts \(y^{\prime } \in A_1\). Thus, \(y y^{\prime } \in A_1\). Hence, \(A_1\) is closed under \(\circ \).

We let

$$\begin{aligned} B_1 = \lbrace x\in A_1 : \ x0 \ne 0 \wedge x1 \ne 1 \ \rbrace . \end{aligned}$$

Suppose \(00=0 \vee 01=1\). Then, \(1100= 110 \vee 1101=111\). By \({\textsf {BT}}_{2}\), we then have \(110= 11 \), contradicting \({\textsf {BT}}_{3}\). Thus, \(0 \in B_1\). By similar reasoning, we have \(1 \in B_1\). We now show that \(B_1\) is closed under \(\circ \). Suppose \(y, y^{\prime } \in B_1 \). We observe that

$$\begin{aligned} y y^{\prime } 0 = 0 \ \vee \ y y^{\prime } 1 = 1 \end{aligned}$$

contradicts \(y^{\prime } \in A_1\). Hence, \(y y^{\prime } \in B_1\). Thus, \(B_1\) is closed under \(\circ \).

We let

$$\begin{aligned} I_1 = \lbrace y \in B_1 : \ \forall x \in B_1 \; [ \ xy \ne 0 \wedge xy \ne 1 \ ] \ \rbrace \ . \end{aligned}$$

We clearly cannot have \(x1=0 \vee x0=1\) since \(1x1=10 \vee 1x0=11\) would follow, contradicting \({\textsf {BT}}_{3}\). It follows from the definition of \(B_1\) that we have

$$\begin{aligned} \forall x \in B_1 \; [ \ x0 \ne 0 \wedge x1 \ne 1 \ ] \ . \end{aligned}$$

Hence, \(0,1 \in I_1\). We now show that \(I_1\) is closed under \(\circ \). Suppose \(y, y^{\prime } \in I_1\), \( \; x\in B_1\) and \(x (y y^{\prime }) = 0 \vee x (y y^{\prime }) = 1\). Since \(y \in I_1 \subseteq B_1 \) and \(B_1 \) is closed under \(\circ \), we have \(x y \in B_1\). By \( \textsf {BT}_{1}\), we have have \((x y) y^{\prime } = 0 \vee (xy) y^{\prime } = 1\), contradicting \(y^{\prime } \in I_1\). Hence, \(y y^{\prime } \in I_1\). Thus, \(I_1\) is closed under \(\circ \). This means that \(I_1\) satisfies the domain conditions.

We interpret \(\textsf {BT}_{{ \le 1 } } \) in \( \textsf {BT} \! - \! \textsf {BT}_{4} \) by simply relativizing quantification to \(I_1\). Since \(\textsf {BT}_{1} \), \( \textsf {BT}_{2} \) and \( \textsf {BT}_{3} \) are universal sentences that are theorems of \( \textsf {BT} \! - \! \textsf {BT}_{4} \), they hold in \(I_1\). It remains to show that \( \textsf {BT} \! - \! \textsf {BT}_{4} \) proves the translation of

  1. (1)

    \(\forall x \; [ \ x \sqsubseteq _{ {\textsf {s}} } 0 \rightarrow x = 0 \ ] \)

  2. (2)

    \(\forall x \; [ \ x \sqsubseteq _{ {\textsf {s}} } 1 \rightarrow x = 1 \ ] \) .

We only show (1). The case (2) is handled similarly. So, suppose \(x \in I_1\) and the translation of \( x \sqsubseteq _{ {\textsf {s}} } 0\) holds. We then have one of the following cases for some \(z, w \in I_1\):

  1. (i)

    \(0 = x \)

  2. (ii)

    \( 0 = zx \)

  3. (iii)

    \(0 = xw \)

  4. (iv)

    \(0 = (zx) w\)

  5. (v)

    \(0 = z (xw) \).

We notice that (ii)-(v) contradict the definition of \(I_1\). We thus see that the translation of (1) is a theorem of \( \textsf {BT} \! - \! \textsf {BT}_{4} \). Similarly, the translation of (2) is a theorem of \( \textsf {BT} \! - \! \textsf {BT}_{4} \).

Construction of \(I_{n+1}\) :

Let \(\varGamma _n\) denote the set of all nonempty biterals of length at most n. Let \( \sqsubseteq _{ {\textsf {s}} }^{ I_n} \) denote the realization of \( \sqsubseteq _{ {\textsf {s}} } \) in \(I_n \), i.e.

$$\begin{aligned} x \sqsubseteq _{ {\textsf {s}} }^{ I_n} y \equiv \ y = x \vee \exists uv \in I_n \; [ \ y = ux \vee y= xv \vee y = u x v \ ] \ . \end{aligned}$$

We assume \(I_n\) has been constructed and satisfies

$$\begin{aligned} \bigwedge _{ \alpha \in \varGamma _n } \forall x \in I_n \; [ \ x \sqsubseteq _{ {\textsf {s}} }^{ I_n } {\overline{\alpha }} \rightarrow \bigvee _{ \gamma \in \langle ... \alpha \rangle } x= {\overline{\gamma }} \ ] \ . \end{aligned}$$

We let

$$\begin{aligned} I_{n+1}= & {} \lbrace y \in I_n : \bigwedge _{ \alpha \in \varGamma _n } \Big ( \ \forall x \in I_n \; [ \ xy = {\overline{\alpha }} 0 \rightarrow \bigvee _{ \gamma , \delta \in \langle ... \alpha {\varvec{0}} \rangle } ( \ x= {\overline{\gamma }} \wedge y = {\overline{\delta }} \ ) \ ]\\&\wedge \forall x \in I_n \; [ \ xy = {\overline{\alpha }} 1 \rightarrow \bigvee _{ \gamma , \delta \in \langle ... \alpha {\varvec{1}} \rangle } ( \ x= {\overline{\gamma }} \wedge y = {\overline{\delta }} \ ) \ ] \ \Big ) \ \rbrace \ . \end{aligned}$$

By \(\textsf {BT}_{3}\), we cannot have \(x1 = {\overline{\alpha }} 0 \vee x0 = {\overline{\alpha }} 1 \). By \(\textsf {BT}_{2}\), we have

$$\begin{aligned} x0 = {\overline{\alpha }} 0 \ \Rightarrow \ x = {\overline{\alpha }} \end{aligned}$$

and

$$\begin{aligned} x1 = {\overline{\alpha }} 1 \ \Rightarrow \ x = {\overline{\alpha }} \ . \end{aligned}$$

Hence, \(0,1 \in I_{ n+1} \). We now show that \(I_{ n+1} \) is closed under \(\circ \). Suppose \(y_0, y_1 \in I_{ n+1 } \). Let \(x \in I_n \). We observe that \(x y_0 \in I_n \) since \(x, y_0 \in I_n \) and \(I_n\) is closed under \( \circ \). By \( \textsf {BT}_{1} \), we have \( x (y_0 y_1) = (x y_0) y_1 \). Then

$$\begin{aligned} \begin{array}{r c l c c l} x ( y_0 y_1 ) = {\overline{\alpha }} 0 &{} \Rightarrow &{} \bigvee _{ \gamma , \delta \in \langle ... \alpha {\varvec{0}} \rangle } ( \ xy_0= {\overline{\gamma }} \wedge y_1 = {\overline{\delta }} \ ) &{}&{}&{} ( \ y_1 \in I_{ n+1 } \text{ and } xy_0 \in I_n \ ) \\ &{} \Rightarrow &{} \bigvee _{ \zeta , \eta , \delta \in \langle ... \alpha {\varvec{0}} \rangle } ( \ x= {\overline{\zeta }} \wedge y_0 = {\overline{\eta }} \wedge y_1 = {\overline{\delta }} \ ) &{}&{} &{} ( \ x, y_0 \in I_n \text{ and } \gamma \in \varGamma _n \ ) \end{array} \end{aligned}$$

and

$$\begin{aligned} \begin{array}{r c l c c l} x ( y_0 y_1 ) = {\overline{\alpha }} 1 &{} \Rightarrow &{} \bigvee _{ \gamma , \delta \in \langle ... \alpha {\varvec{1}} \rangle } ( \ xy_0= {\overline{\gamma }} \wedge y_1 = {\overline{\delta }} \ ) &{}&{} &{} ( \ y_1 \in I_{ n+1 } \text{ and } xy_0 \in I_n \ ) \\ &{} \Rightarrow &{} \bigvee _{ \zeta , \eta , \delta \in \langle ... \alpha {\varvec{1}} \rangle } ( \ x= {\overline{\zeta }} \wedge y_0 = {\overline{\eta }} \wedge y_1 = {\overline{\delta }} \ ) &{}&{} &{} ( \ x, y_0 \in I_n \text{ and } \gamma \in \varGamma _n \ ) \ . \end{array} \end{aligned}$$

It is not difficult to see that \( \textsf {BT} \! - \! \textsf {BT}_{4} \) proves each instance of the axiom schemas

\(\textsf {WBT}_{1} \):

\( {\overline{\alpha }} \ {\overline{\beta }} = \overline{ \alpha \beta }\)

\(\textsf {WBT}_{2} \):

\( {\overline{\alpha }} \ne {\overline{\beta }} \) if \( \alpha \ne \beta \).

This implies

$$\begin{aligned} x ( y_0 y_1 ) = {\overline{\alpha }} 0 \ \Rightarrow \ \bigvee _{ \beta , \zeta \in \langle ... \alpha {\varvec{0}} \rangle } ( \ x= {\overline{\zeta }} \wedge y_0 y_1 = {\overline{\beta }} \ ) \end{aligned}$$

and

$$\begin{aligned} x ( y_0 y_1 ) = {\overline{\alpha }} 1 \ \Rightarrow \ \bigvee _{ \beta , \zeta \in \langle ... \alpha {\varvec{1}} \rangle } ( \ x= {\overline{\zeta }} \wedge y_0 y_1 = {\overline{\beta }} \ ) \ . \end{aligned}$$

Hence, \(y_0 y_1 \in I_{ n+1} \). Thus, \(I_{n+1}\) is closed under \(\circ \). This proves that \(I_{n+1}\) satisfies the domain conditions. It is clear that we get an interpretation of \(\textsf {BT}_{{ \le n+1} } \) in \( \textsf {BT} \! - \! \textsf {BT}_{4} \) by simply relativizing quantification to \(I_{ n+1} \). \(\square \)

By a similar argument, we see that \( {\textsf {D}} \) is not interpretable in \( {\textsf {C}} \).

Theorem 3

\( {\textsf {D}} \) is not interpretable in \( {\textsf {C}} \).

The theories \( {\textsf {C}} \) and \( \textsf {BT} \) are strictly between \( {\textsf {R}} \) and \( {\textsf {Q}} \) w.r.t. interpretability. A natural question is whether they are mutually interpretable. It is not difficult to see that our interpretation of \( \textsf {WD} \) in \( \textsf {WBT} \) given at the end of Sect. 4 is also an interpretation of \( {\textsf {C}} \) in \( \textsf {BT} \). It is not clear however whether \( \textsf {BT} \) is interpretable in \( {\textsf {C}} \).

Open Problem 1

Is \( \textsf {BT} \) interpretable in \( {\textsf {C}} \)?

4 The theory \(\textsf {WD}\)

We now show that the theory \(\textsf {WD}\) is mutually interpretable with \({\textsf {R}}\). We give an interpretation of \( {\textsf {R}} \) in \( \textsf {WD} \). To show that \( \textsf {WD} \) is interpretable in \( {\textsf {R}} \) we invoke the result of Visser [20]: a recursively enumerable theory is interpretable in \( {\textsf {R}} \) if and only if it is locally finitely satisfiable (each finite sub-theory has a finite model). We also show how to interpret \( \textsf {WD} \) in \( \textsf {WBT } \), from which follows that \( \textsf {WD} \) is interpretable in \({\textsf {R}} \) since \( \textsf {WBT} \) and \( \textsf {WTC}^{{ -\varepsilon } } \) are equivalent, and \( \textsf {WTC}^{{ -\varepsilon } } \) and \( {\textsf {R}} \) are mutually interpretable (see Fig. 2). The theory \(\textsf {WD}\) is purely universally axiomatised, in contrast to \( \textsf {WTC}^{{ -\varepsilon } } \) which has \(\varPi _2\)-axioms.

Definition 1

The first-order theory \(\textsf {WD}\) contains the following non-logical axioms: for each \(\alpha , \beta \in \lbrace {\varvec{0}}, {\varvec{1}} \rbrace ^{+}\)

$$\begin{aligned} \begin{array}{r c l } \textsf {WD}_{1} &{} &{} {\overline{\alpha }} \ {\overline{\beta }} = \overline{ \alpha \beta }\\ \textsf {WD}_{2} &{} &{} {\overline{\alpha }} \ne {\overline{\beta }} \text{ if } \alpha \ne \beta \\ \textsf {WD}_{3} &{} &{} \forall x \; [ \ x \preceq {\overline{\alpha }} \leftrightarrow \bigvee _{ \gamma \preceq ^{ {\mathfrak {D}} } \alpha } x= {\overline{\gamma }} \ ] \ . \end{array} \end{aligned}$$

We start by showing that a theory of arithmetic we call \(\textsf {R}^{-}\) is interpretable in \(\textsf {WD}\). The theory \(\textsf {R}^{-}\) has been shown to be mutually interpretable with \({\textsf {R}}\) (see Jones and Shepherdson [8]).

Definition 2

The first-order theory \(\textsf {R}^{-}\) contains the following non-logical axioms: for each \(n, m \in {\mathbb {N}}\)

$$\begin{aligned} \begin{array}{r c l } \textsf {R}^{{-}_1} &{} &{} {\overline{n}} \times {\overline{m}} = \overline{ n m } \\ \textsf {R}^{{-}_2} &{} &{} {\overline{n}} \ne {\overline{m}} \text{ if } n \ne m \\ \textsf {R}^{{-}_3} &{} &{} \forall x \; [ \ x \le {\overline{n}} \leftrightarrow \bigvee _{ k \le n } x = {\overline{k}} \ ] \ . \end{array} \end{aligned}$$

The most difficult step in interpreting \(\textsf {R}^{-}\) in \(\textsf {WD}\) is translating \(\times \). We start by proving two lemmas which show how we intend to translate \(\times \). In the structure \({\mathfrak {D}}\), we can associate each natural number k with the bit string \( {\varvec{1}}^k {\varvec{0}} \). For \(n \ne 0\) and \(m >1\), we can then translate

$$\begin{aligned} {\overline{n}} \times {\overline{m}} = \overline{n m} \end{aligned}$$

by saying that the sequence

$$\begin{aligned} \left( \overline{ {{1}}^n }, \overline{ {{1}}^{n+n} } , \overline{ {{1}}^{n+n+n} } , \ldots , \overline{ {{1}}^{nm} } \right) \end{aligned}$$

exists. The formulas given in Lemmas 1 and 2 try to capture this way of viewing multiplication. But we have to be careful since we are reasoning in a weak theory. For example, we lack full associativity. For readability, we dispense with parentheses whenever possible and expect terms to be read from right to left. That is, xyz should be regarded as shorthand for (xy)z. Although we do not have full associativity, by \( \textsf {WD}_{1} \), we have \( \textsf {WD} \vdash ( {\overline{\alpha }} {\overline{\beta }} ) {\overline{\gamma }} = {\overline{\alpha }} ( {\overline{\beta }} {\overline{\gamma }} ) \) for all \( \alpha , \beta , \gamma \in \lbrace {\varvec{0}}, {\varvec{1}} \rbrace ^{+}\). Then, by \( \textsf {WD}_{3} \), we have

$$\begin{aligned} \textsf {WD} \vdash \forall x \preceq {\overline{\alpha }} \; \forall y \preceq {\overline{\beta }} \; \forall z \preceq {\overline{\gamma }} \; [ \ (xy)z = x (yz) \ ] \ . \end{aligned}$$

The bounded quantifiers in Lemmas 1 and 2 are there to make full use of the axiom scheme \( \textsf {WD}_{3} \).

Lemma 1

Let \(n \ne 0\) and \(m \ge 2\). Then, \( \textsf {WD} \) proves

  1. (1)

    \( \overline{ {{1}}^{nm} } \preceq \overline{ {{1}}^{nm} } \) and \(\overline{ {{1}}^n } \), \( \overline{ {{1}}^m }\) and \(\overline{ {{1}}^{nm} } \) are the unique elements such that

    1. (a)

      \( \overline{ {{1}}^n } \preceq \overline{ {{1}}^n {{0}} } \ \wedge \ \overline{ {{1}}^n {{0}} } = \overline{ {{1}}^n } 0 \)

    2. (b)

      \( \overline{ {{1}}^m } \preceq \overline{ {{1}}^m {{0}} } \ \wedge \ \overline{ {{1}}^m {{0}} } = \overline{ {{1}}^m } 0 \)

    3. (c)

      \( \overline{ {{1}}^{nm} } \preceq \overline{ {{1}}^{nm} {{0}} } \ \wedge \ \overline{ {{1}}^{nm} {{0}} } = \overline{ {{1}}^{nm} } 0 \).

Furthermore, let

$$\begin{aligned} {\mathfrak {w}} \equiv \ 0 00 \overline{ {{1}}^n } 0 1 00 \overline{ {{1}}^{n+n} } 0 11 00 \overline{ {{1}}^{n+n+n} } 0 111 00 \ldots 00 \overline{ {{1}}^{nm} } 0 \overline{ {{1}}^{m} } 00 \ . \end{aligned}$$

Then, \( {\mathfrak {w}} \) provably (in \(\textsf {WD} \)) satisfies:

  1. (2)

    \(\exists u \preceq {\mathfrak {w}} \; [ \ u 00 \overline{ {{1}}^{nm } } 0 \overline{ {{1}}^{m } } 00 \preceq {\mathfrak {w}} \ ] \)

  2. (3)

    it holds that \( ( \forall w^{ \prime } \preceq {\mathfrak {w}} ) ( \forall y^{ \prime } \preceq \overline{ {{1}}^m } ) ( \forall z^{ \prime } \preceq \overline{ {{1}}^{nm } } ) \ \Psi _0 ( w^{ \prime }, y^{ \prime } , z^{ \prime } ) \) where

    $$\begin{aligned} \Psi _0 ( w^{ \prime }, y^{ \prime } , z^{ \prime } )\equiv & {} \ ( \ y^{ \prime } \ne 1 \ \wedge \ w^{ \prime } 00 z^{ \prime } 0 y^{ \prime } 00 \preceq {\mathfrak {w}} \ ) \rightarrow \\&( \exists w^{ \prime \prime } \preceq {\mathfrak {w}} ) ( \exists y^{ \prime \prime } \preceq y^{ \prime } ) ( \exists z^{ \prime \prime } \preceq \overline{ {{1}}^{nm } } ) \big [ \ \\ z^{ \prime }= & {} z^{ \prime \prime } \overline{ {{1}}^n } \ \wedge \ y^{ \prime } = y^{ \prime \prime } 1 \ \wedge \ w^{ \prime \prime } 00 z^{ \prime \prime } 0 y^{ \prime \prime } 00 \preceq {\mathfrak {w}} \ \wedge \ \Phi _0 ( y^{ \prime \prime }, z^{ \prime \prime } ) \ \big ] \end{aligned}$$

    and

    $$\begin{aligned} \Phi _0 ( y^{ \prime \prime }, z^{ \prime \prime } ) \equiv \ y^{ \prime \prime } = 1 \rightarrow z^{ \prime \prime } = \overline{ {{1}}^n } \ . \end{aligned}$$

Proof

Since we have the axiom schema \(\textsf {WD}_{1}\), we can skip parentheses in \( {\mathfrak {w}} \). Clause (1) holds due to the axiom schemas \(\textsf {WD}_{3}\) and \(\textsf {WD}_{2}\). We verify clause (a). By the right-left implication of \(\textsf {WD}_{3}\), we have \(\overline{ {{1}}^n } \preceq \overline{ {{1}}^n {{0}} } \). By how biterals are defined, we have \( \overline{ {{1}}^n {{0}} } = \overline{ {{1}}^n } 0 \). For uniqueness, suppose

$$\begin{aligned} x \preceq \overline{ {{1}}^n {{0}} } \ \wedge \ x 0 = \overline{ {{1}}^n {{0}} } \ . \end{aligned}$$

By \(\textsf {WD}_{3}\), we have

$$\begin{aligned} x \preceq \overline{ {{1}}^n {{0}} } \rightarrow \bigvee _{ \gamma \preceq ^{ {\mathfrak {D}} } {{1}}^n {{0}} } x = {\overline{\gamma }} \ . \end{aligned}$$

Then, by \(\textsf {WD}_{2}\), in order for \(x 0 = \overline{ {{1}}^n {{0}} } \) to hold we must have \(x = \overline{ {{1}}^n } \). By similar reasoning, one verifies clauses (b) and (c).

Clause (2) holds due to the axiom schema \(\textsf {WD}_{1}\) and the right-left implication of \(\textsf {WD}_{3}\). Let

$$\begin{aligned} u = 0 00 \overline{ {{1}}^n } 0 1 00 \overline{ {{1}}^{n+n} } 0 11 00 \overline{ {{1}}^{n+n+n} } 0 111 00 \ldots 00 \overline{ {{1}}^{n (m-1)} } 0 \overline{ {{1}}^{m-1} } \ . \end{aligned}$$

By the right-left implication of \(\textsf {WD}_{3 }\), we have \(u \preceq {\mathfrak {w}} \). By \(\textsf {WD}_{1}\), we have

$$\begin{aligned} u 00 \overline{ {{1}}^{nm } } 0 \overline{ {{1}}^{m } } 00 = {\mathfrak {w}} \ . \end{aligned}$$

Then, by the right-left implication of \(\textsf {WD}_{3 }\), we have

$$\begin{aligned} u 00 \overline{ {{1}}^{nm } } 0 \overline{ {{1}}^{m } } 00 \preceq {\mathfrak {w}} \ . \end{aligned}$$

To see that clause (3) holds, suppose

$$\begin{aligned} w^{ \prime } \preceq {\mathfrak {w}} \ \wedge \ z^{ \prime } \preceq \overline{ {{1}}^{nm } } \ \wedge \ y^{ \prime } \preceq \overline{ {{1}}^m } \ \wedge \ y^{ \prime } \ne 1 \ \wedge \ w^{ \prime } 00 z^{ \prime } 0 y^{ \prime } 00 \preceq {\mathfrak {w}} \qquad \quad (*) \end{aligned}$$

holds. By the axiom schemas \(\textsf {WD}_{3}\) and \(\textsf {WD}_{2}\), the third and fourth conjunct of (*) imply \( y^{ \prime } = \overline{ {{1}}^{k+1} } \) where \(2 \le k+1 \le m\). Given \( w^{ \prime } \preceq {\mathfrak {w}} \ \wedge \ z^{ \prime } \preceq \overline{ {{1}}^{nm } } \), the axiom schema \(\textsf {WD}_{3}\) gives us a set \(\varGamma \) of pairs (ab) where \(a \preceq {\mathfrak {w}} \) and \(b \preceq \overline{ {{1}}^{nm } } \) such that \(( w^{ \prime } , z^{ \prime } ) \in \varGamma \). For each pair \((a, b) \in \varGamma \), we use \(\textsf {WD}_{1}\) to compute \(a 00 b 0 y^{ \prime } 00 \) and then use the left-right implication of \(\textsf {WD}_{3}\) and \(\textsf {WD}_{2}\) to determine whether \(a 00 b 0 y^{ \prime } 00 \preceq {\mathfrak {w}} \). We are then led to the conclusion

  • \(w^{ \prime } = 0 00 \overline{ {{1}}^n } 0 1 00 \overline{ {{1}}^{n+n} } 0 11 00 \overline{ {{1}}^{n+n+n} } 0 111 00 \ldots 00 \overline{ {{1}}^{nk} } 0 \overline{ {{1}}^{k} } \)

  • \(z^{ \prime } = \overline{ {{1}}^{n(k+1) } } \).

By \(\textsf {WD}_{1}\) and the right-left implication of \(\textsf {WD}_{3}\), we have

  1. (I)

    \(w^{ \prime } = 0 00 \overline{ {{1}}^n } 0 1 00 \overline{ {{1}}^{n+n} } 0 11 00 \overline{ {{1}}^{n+n+n} } 0 111 00 \ldots 00 \overline{ {{1}}^{nk} } 0 \overline{ {{1}}^{k} } \preceq {\mathfrak {w}} \)

  2. (II)

    \(z^{ \prime } = \overline{ {{1}}^{n (k+1) } } = \overline{ {{1}}^{nk } } \ \overline{ {{1}}^{n } }\)

  3. (III)

    \(y^{ \prime } = \overline{ {{1}}^{k+1} } = \overline{ {{1}}^{k} } \ 1 \).

We can then let

$$\begin{aligned} w^{ \prime \prime } = 0 00 \overline{ {{1}}^n } 0 1 00 \overline{ {{1}}^{n+n} } 0 11 00 \overline{ {{1}}^{n+n+n} } 0 111 00 \ldots 00 \overline{ {{1}}^{n (k-1) } } 0 \overline{ {{1}}^{k-1} } \end{aligned}$$

and \( z^{ \prime \prime } = \overline{ {{1}}^{n k } } \ \wedge \ y^{ \prime \prime } = \overline{ {{1}}^{ k } } \). If \(k =1 \), we let \( w^{ \prime \prime } = 0 \). If \( \overline{ {{1}}^{k} } = 1 \), then by \(\textsf {WD}_{2}\) we must have \(k=1\). We thus see that clause (3) holds. \(\square \)

Lemma 2

Let

$$\begin{aligned}&\phi _{ M } (X, Y, Z ) \equiv \ \big ( \ ( \ X = 0 \ \vee \ Y = 0 \ ) \ \wedge \ Z = 0 \ \big ) \ \vee \ \big ( \ Y = 10 \ \wedge \ Z = X \ \big ) \\&\quad \vee \Big ( \ X \ne 0 \ \wedge \ Y \ne 0 \ \wedge \ Y \ne 10 \ \wedge \ \exists x \preceq X \ \exists y \preceq Y \; \exists z \preceq Z \; \exists w \; \\&\qquad \Big [X = x0 \ \wedge \ Y = y0 \ \wedge \ Z= z0 \ \wedge \ z\preceq z \ \ \\&\quad \wedge \exists u \preceq w \; [ \ u 00 z 0 y 00 \preceq w \ ] \\&\quad \wedge ( \forall w^{ \prime } \preceq w ) ( \forall y^{ \prime } \preceq y ) ( \forall z^{ \prime } \preceq z ) \ \Psi ( w^{ \prime }, y^{ \prime } , z^{ \prime } , x, w ) \ \Big ] \ \Big ) \end{aligned}$$

where

$$\begin{aligned}&\Psi ( w^{ \prime }, y^{ \prime } , z^{ \prime } , x, w ) \equiv \ (y^{ \prime } \ne 1 \ \wedge \ w^{ \prime } 00 z^{ \prime } 0 y^{ \prime } 00 \preceq w)\\&\quad \rightarrow ( \exists w^{ \prime \prime } \preceq w ) ( \exists y^{ \prime \prime } \preceq y^{ \prime } ) ( \exists z^{ \prime \prime } \preceq z ) \\&\quad \big [z^{ \prime } = z^{ \prime \prime } x \ \wedge \ y^{ \prime } = y^{ \prime \prime } 1 \ \wedge \ w^{ \prime \prime } 00 z^{ \prime \prime } 0 y^{ \prime \prime } 00 \preceq w \ \wedge \ \Phi ( y^{ \prime \prime }, z^{ \prime \prime } , x ) \ \big ] \end{aligned}$$

and

$$\begin{aligned} \Phi ( y^{ \prime \prime }, z^{ \prime \prime } , x ) \equiv \ y^{ \prime \prime } = 1 \rightarrow z^{ \prime \prime } = x \ . \end{aligned}$$

Then, for each natural number n and m, we have

$$\begin{aligned} \textsf {WD} \vdash \forall Z \; [ \ \phi _{M} ( \overline{ {{1}}^n {{0}} }, \overline{ {{1}}^m {{0}} }, Z) \leftrightarrow Z = \overline{ {{1}}^{nm } {{0}} } \ ] \ . \end{aligned}$$

Proof

We start by showing that

$$\begin{aligned} \textsf {WD} \vdash \forall Z \; [ \ Z = \overline{ {{1}}^{nm } {{0}} } \rightarrow \phi _{M} ( \overline{ {{1}}^n {{0}} }, \overline{ {{1}}^m {{0}} }, Z) \ ] \ . \qquad \quad (**) \end{aligned}$$
  • If \(n= 0\) or \(m= 0\), then the first disjunct in the definition of \( \phi _{ M } (X, Y, Z )\) holds.

  • If \(m= 1\), then the second disjunct in the definition of \( \phi _{ M } (X, Y, Z )\) holds.

  • If \(n \ne 0\) and \(m \ge 2\), then by \(\textsf {WD}_{1}\) and \(\textsf {WD}_{2}\) we have

    $$\begin{aligned} \overline{ {{1}}^n } 0 = \overline{ {{1}}^n {{0}} } \ne 0 \ \wedge \ \overline{ {{1}}^m } 0 = \overline{ {{1}}^m {{0}} } \ne 0 \ \wedge \ \overline{ {{1}}^m {{0}} } \ne 1 0 \ . \end{aligned}$$

    Then, by Lemma 1, the third disjunct in the definition of \( \phi _{ M } (X, Y, Z )\) holds since we can choose

    $$\begin{aligned} w= 000 \overline{ {{1}}^n } 0 1 00 \overline{ {{1}}^{n+n} } 0 11 00 \ldots 00 \overline{ {{1}}^{nm} } 0 \overline{ {{1}}^{m} } 00 \ . \end{aligned}$$

Thus, (**) holds.

We now prove that

$$\begin{aligned} \textsf {WD} \vdash \forall Z \; [ \ \phi _{M} ( \overline{ {{1}}^n {{0}} }, \overline{ {{1}}^m {{0}} }, Z) \rightarrow Z = \overline{ {{1}}^{nm } {{0}} } \ ] \ . \qquad \quad (***) \end{aligned}$$

So, suppose \(\phi _{M} ( \overline{ {{1}}^n {{0}} } , \overline{ {{1}}^m {{0}} }, Z)\) holds. If \(n=0\) or \(m \le 1\), then it follows from the definition of \(\phi _{M} ( \overline{ {{1}}^n {{0}} } , \overline{ {{1}}^m {{0}} }, Z)\) that \(Z = \overline{ {{1}}^{nm} {{0}} }\). We therefore assume that \(n\ne 0\) and \(m > 1\). Then, there exist \( x \preceq \overline{ {{1}}^n {{0}} } \), \( y_1 \preceq \overline{ {{1}}^m {{0}} } \) and \( z_1 \preceq Z \) such that

$$\begin{aligned} \overline{ {{1}}^n {{0}} } = x 0 \wedge \overline{ {{1}}^m {{0}} } = y_1 0 \wedge Z = z_1 0 \wedge z_1 \preceq z_1 \ . \end{aligned}$$

By \(\textsf {WD}_{3}\) and \(\textsf {WD}_{2}\), we have

$$\begin{aligned} x = \overline{ {{1}}^{n} } \wedge y_1 = \overline{ {\varvec{1}}^{m} } \ . \end{aligned}$$

Furthermore, by the definition of \(\phi _{M} ( \overline{ {{1}}^n {{0}} } , \overline{ {{1}}^m {{0}} }, Z)\), there exist w and \(w_1 \preceq w \) such that

$$\begin{aligned} w_1 00 z_1 0 y_1 00 \preceq w \ . \end{aligned}$$

By \(\textsf {WD}_{1}\), it suffices to show that \(z_1 = \overline{ {{1}}^{nm} }\) to conclude that \(Z = \overline{ {{1}}^{nm} {{0}} }\). By the axiom schema \( \textsf {WD}_{2}\), we have \( \overline{ {{1}}^{m } } \ne 1\) since \(m>1\). By the axiom schema \( \textsf {WD}_{3}\), we have \( \overline{ {{1}}^{m } } \preceq \overline{ {{1}}^{m } } \). Then, \( \Psi ( w_1, y_1 , z_1, x, w ) \) in the definition of \(\phi _{M} ( \overline{ {{1}}^n {{0}} } , \overline{ {{1}}^m {{0}} }, Z)\) implies that there exist \( w_{ 2 } \preceq w \), \(y_{ 2 } \preceq y_1 \) and \( z_{ 2 } \preceq z_1 \) such that

$$\begin{aligned} z_1 = z_{ 2 } \overline{ {{1}}^{n } } \ \wedge \ y_1 = y_{ 2 } 1 \ \wedge \ w_{ 2 } 00 z_{2 } 0 y_{ 2 } 00 \preceq w \ . \end{aligned}$$

By the axiom schemas \(\textsf {WD}_{3}\) and \(\textsf {WD}_{2}\), we must have

$$\begin{aligned} y_{ 2} = \overline{ {{1}}^{m -1 } } \ . \end{aligned}$$

If \(m-1 = 1\), then \( \Phi ( y_2 , z_2 , x ) \) in the definition of \(\phi _{M} ( \overline{ {{1}}^n {{0}} } , \overline{ {{1}}^m {{0}} }, Z)\) implies \( z_{ 2 } = \overline{ {{1}}^{n } } \). By \(\textsf {WD}_{1}\), this implies

$$\begin{aligned} z_1 = z_2 \overline{ {{1}}^{n } } = \overline{ {{1}}^{n } } \ \overline{ {{1}}^{n } } = \overline{ {{1}}^{n+n} } = \overline{ {{1}}^{nm} } \end{aligned}$$

since \(m=2\). If \(m-1 \ne 1\), we use \( \Psi ( w_2 , y_2 , z_2 , x, w ) \) to conclude that there exist

$$\begin{aligned} w_{ 3 } \preceq w , \ y_{ 3 } \preceq y_2 \text{ and } z_{3 } \preceq z_1 \end{aligned}$$

such that

$$\begin{aligned} z_{ 2} = z_{3 } \overline{ {{1}}^{n } } \ \wedge \ y_2 = y_{ 3 } 1 \ \wedge \ w_{ 3 } 00 z_{ 3 } 0 y_{3 } 00 \preceq w \ . \end{aligned}$$

By the axiom schemas \(\textsf {WD}_{3}\) and \(\textsf {WD}_{2}\), we must have

$$\begin{aligned} y_{ 3} = \overline{ {{1}}^{m -2 } } \ . \end{aligned}$$

If \(m-2 = 1\), then by \( \varPhi ( y_3 , z_3 , x ) \) we have \( z_{ 3 } = \overline{ {{1}}^{n } } \). By \(\textsf {WD}_{1}\), this implies

$$\begin{aligned} z_1 = z_2 \overline{ {{1}}^{n } } = ( \ z_3 \overline{ {{1}}^{n } } \ ) \overline{ {{1}}^{n } } = \overline{ {{1}}^{n } } \ \overline{ {{1}}^{n } } \ \overline{ {{1}}^{n } } = \overline{ {{1}}^{n+n+n} } = \overline{ {{1}}^{nm} } \end{aligned}$$

since \(m=3\). If \(m-2 \ne 1\), we use \( \Psi ( w_3, y_3 , z_3 , x, w ) \) to repeat this procedure. We notice that after a finite number of steps, we have that there exist \( w_{ m } \preceq w \), \( y_{ m } \preceq y_{ m-1} \) and \( z_{ m } \preceq z_1\) such that

$$\begin{aligned} z_{m-1} = z_{ m} \overline{ {{1}}^{n } } \ \wedge \ y_{ m-1} = \overline{ {{1}}^{m-(m-2) } } \wedge y_{m-1} = y_{ m } 1 \ \wedge \ w_{m } 00 z_{ m } 0 y_{m } 00 \preceq w \ . \end{aligned}$$

By the axiom schemas \(\textsf {WD}_{3}\) and \(\textsf {WD}_{2}\), we must have

$$\begin{aligned} y_{ m} = 1 \ . \end{aligned}$$

The condition \(\Phi ( y_m, z_m , x ) \) in \( \Psi ( w_{ m-1 }, y_{ m-1} , z_{ m-1 } , x , w ) \) then implies

$$\begin{aligned} z_{m} = \overline{ {{1}}^{n } } \ . \end{aligned}$$

By backtracking, we observe that

$$\begin{aligned} z_1 = \overline{ {{1}}^{n } } \; \overline{ {{1}}^{n } } \ldots \overline{ {{1}}^{n } } \end{aligned}$$

where \(\overline{ {{1}}^{n } }\) occurs m times. It then follows from the axiom schema \(\textsf {WD}_{1}\) that \(z_1 = \overline{{{1}}^{nm} } \). Thus, by \(\textsf {WD}_{1}\), we have \(Z = \overline{{{1}}^{nm} {{0}} } \). Hence, (***) holds. \(\square \)

Theorem 4

\( \textsf {R}^{-}\) is interpretable in WD.

Proof

We choose the domain \( \delta (x) \equiv \ x=x \). We translate, 0, \( \mathrm {S} \), \(\times \) and \(\le \) as follows:

$$\begin{aligned} \phi _{0} (x)&\equiv \ x = 0\\ \phi _{ \mathrm {S} } (x, y )&\equiv \ y = 1 x\\ \phi _{ \times } (x, y, z )&\equiv \ ( \ \exists ! w \; [ \ \phi _{M} (x, y, w) \ ] \ \wedge \phi _{M} (x, y, z) \ ) \vee \\&\qquad ( \ \lnot \exists ! w \; [ \ \phi _{M} (x, y, w) \ ] \ \wedge z=0 \ )\\ \phi _{ \le } (x, y )&\equiv \ x = 0 \vee \exists z \; \exists w \preceq y \; [ \ x = z0 \wedge y = w0 \wedge z \preceq w \ ] \ . \end{aligned}$$

The formula \(\phi _{M} \) is defined in Lemma 2. By the axiom schema \(\textsf {WD}_{1}\), our translation of \( \mathrm {S} \) implies that we translate each numeral \({\overline{n}}\) as the biteral \( \overline{{{1}}^{ n} {{0}} } \). By the axiom schema \(\textsf {WD}_{2}\), the translation of each instance of the axiom schema \( \textsf {R}^{{-}_{2} }\) is a theorem of WD.

We now show that the translation of each instance of the axiom schema \( \textsf {R}^{{-}_{1} }\) is a theorem of WD. By Lemma 2, for each natural number n and m, we have

$$\begin{aligned} \textsf {WD} \vdash \exists ! w \; [ \ \phi _{ M } ( \overline{ {{1}}^n {{0}} }, \overline{ {{1}}^m {{0}} }, w \ ] \wedge \phi _{ M } ( \overline{ {{1}}^n {{0}} }, \overline{ {{1}}^m {{0}} }, \overline{ {{1}}^{nm } {{0}} } ) \ . \end{aligned}$$

Hence

$$\begin{aligned} \textsf {WD} \vdash \phi _{ \times } ( \overline{ {{1}}^n {{0}} }, \overline{ {{1}}^m {{0}} }, \overline{ {{1}}^{nm } {{0}} } ) \ . \end{aligned}$$

Thus, the translation of each instance of the axiom schema \( \textsf {R}^{{-}_{1} }\) is a theorem of WD. We observe that \(\textsf {WD} \) can have models where not all elements in the universe are realization of terms of the form \(\overline{ {{1}}^k {{0}} }\). Hence, the condition

$$\begin{aligned} \lnot \exists ! w \; [ \ \phi _{M} (x, y, w) \ ] \ \wedge z=0 \end{aligned}$$

in the definition of \(\phi _{ \times } (x,y,z) \) ensures that \(\times \) is translated as a total function.

We now show that the translation of each instance of the axiom schema \( \textsf {R}^{{-}_{3} }\) is a theorem of WD. We have

$$\begin{aligned} \begin{array}{l c l c l } \phi _{ \le } (x, \overline{{{1}}^{ n} {{0}} } ) &{} \Leftrightarrow &{} x = 0 \vee \exists z \; \exists w \preceq \overline{{{1}}^{ n} {{0}} } \; [ \ x = z0 \wedge \overline{{{1}}^{ n} {{0}} } = w0 \wedge z \preceq w \ ] \\ &{} \Leftrightarrow &{} x = 0 \vee \exists z \; [ \ x = z0 \wedge z \preceq \overline{{{1}}^{ n} } \ ] &{} &{} ( \text{ by } \textsf {WD}_{3} , \textsf {WD}_{2} ) \\ &{} \Leftrightarrow &{} x = 0 \vee \bigvee _{ \gamma \preceq ^{ {\mathfrak {D}} } {{1}}^{ n} } x = {\overline{\gamma }} 0 &{} &{} ( \text{ by } \textsf {WD}_{3} ) \\ &{} \Leftrightarrow &{} x = 0 \vee \bigvee _{ \gamma \preceq ^{ {\mathfrak {D}} } {{1}}^{ n} } x = \overline{ \gamma {{0}} } \ . \end{array} \end{aligned}$$

The last equivalence is due to how biterals are defined. We thus see that the translation of each instance of the axiom schema \( \textsf {R}^{{-}_{3} }\) is a theorem of WD. \(\square \)

Theorem 5

\(\textsf {WD}\) and \({\textsf {R}}\) are mutually interpretable.

Proof

We have shown that \(\textsf {R}^{-}\) is interpretable in \(\textsf {WD}\). Since \(\textsf {R}^{-} \) and \({\textsf {R}}\) are mutually interpretable, \( {\textsf {R}} \) is interpretable in \( \textsf {WD} \). To see that \( \textsf {WD} \) is interpretable in \( {\textsf {R}} \), we first observe that \( \textsf {WD} \) is locally finitely satisfiable, i.e., any finite subset of the axioms has a finite model. In [20], Albert Visser shows that a recursively enumerable theory is interpretable in \( {\textsf {R}} \) if and only if it is locally finitely satisfiable. Hence, \( \textsf {WD} \) is interpretable in \( {\textsf {R}} \). Thus, \(\textsf {WD}\) and \({\textsf {R}}\) are mutually interpretable. \(\square \)

We could have shown that \( \textsf {WD} \) is interpretable in \( {\textsf {R}} \) by showing that it is interpretable in \( \textsf {WTC}^{{ -\varepsilon } } \), which we know is mutually interpretable with \( {\textsf {R}} \). Since we in our Master‘s thesis [12] show that \(\textsf {WBT} \) and \( \textsf {WTC}^{{ -\varepsilon } } \) are equivalent, it suffices to show how to interpret \(\textsf {WD}\) in \(\textsf {WBT}\). We choose the domain \( \delta (x) \equiv x=x\). We translate 0, 1, \(\circ \) and \(\preceq \) as follows:

$$\begin{aligned} \phi _{0} (x)&\equiv \ x = 0\\ \phi _{1} (x)&\equiv \ x = 1\\ \phi _{\circ } (x, y, z )&\equiv \ z = xy\\ \phi _{ \preceq } (x, y )&\equiv \ y = x \vee \exists z \; [ \ y = xz \ ] \ . \end{aligned}$$

Clearly, the translation of each instance of the axiom schemas \(\textsf {WD}_{1}\) and \(\textsf {WD}_{2}\) is a theorem of \(\textsf {WBT} \). We now show that the translation of each instance of the axiom schema \(\textsf {WD}_{3}\) is a theorem of \(\textsf {WBT} \). We have

$$\begin{aligned} \begin{array}{l c l c l } \phi _{ \preceq } (x, {\overline{\alpha }} ) &{} \Leftrightarrow &{} {\overline{\alpha }} = x \vee \exists z \; [ \ {\overline{\alpha }} = x z \ ] \\ &{} \Leftrightarrow &{} {\overline{\alpha }} = x \vee \exists z \; [ \ x \sqsubseteq _{ {\textsf {s}} } {\overline{\alpha }} \wedge z \sqsubseteq _{ {\textsf {s}} } {\overline{\alpha }} \wedge {\overline{\alpha }} = x z \ ] &{} &{} ( \text{ by } \text{ def. } \text{ of } \sqsubseteq _{ {\textsf {s}} } \ ) \\ &{} \Leftrightarrow &{} {\overline{\alpha }} = x \vee \\ &{}&{} \bigvee _{ \beta , \gamma \in \langle ... \alpha \rangle } ( \ x= {\overline{\beta }} \wedge z = {\overline{\gamma }} \wedge {\overline{\beta }} \ {\overline{\gamma }} = {\overline{\alpha }} \ ) &{} &{} ( \textsf {WBT}_{3} ) \\ &{} \Leftrightarrow &{} \bigvee _{ \beta \preceq ^{ {\mathfrak {D}} } \alpha } x = {\overline{\beta }} &{} &{} ( \textsf {WBT}_{1}, \ \textsf {WBT}_{2} ) \ . \end{array} \end{aligned}$$

We thus see that the translation of each instance of the axiom schema \( \textsf {WD}_{3}\) is a theorem of WBT. Hence, \( \textsf {WD} \) is interpretable in \( \textsf {WBT} \).

5 The theory \({\textsf {D}}\)

We now show that the theory \({\textsf {D}}\) is mutually interpretable with \({\textsf {Q}}\). In contrast to \( {\textsf {Q}} \) and \( {\textsf {TC}^{ -\varepsilon } } \), which have \(\varPi _2\)-axioms, the theory \( {\textsf {D}} \) is purely universally axiomatised. When interpreting one theory into another, handling existential quantifiers can become cumbersome. This is clearly illustrated in the proof of Theorem  6. Therefore, having a theory with purely universal axiomatization and that is mutually interpretable with \( {\textsf {Q}} \) could be advantageous in some circumstances.

Definition 3

The first-order theory D is defined by the following non-logical axioms:

$$\begin{aligned} \begin{array}{r c l } {\textsf {D}}_{1} &{} &{} \forall x y z \; [ \ (xy) z = x (yz) \ ] \\ {\textsf {D}}_{2} &{}&{} \forall x y \; [ \ x \ne y \rightarrow ( \ x0 \ne y0 \wedge x1 \ne y1 \ ) \ ] \\ {\textsf {D}}_{3} &{}&{} \forall x y \; [ \ x0 \ne y1 \ ] \\ {\textsf {D}}_{4} &{} &{} \forall x \; [ \ x \preceq 0 \leftrightarrow x=0 \ ] \\ {\textsf {D}}_{5} &{} &{} \forall x y \; [ \ x \preceq y 0 \leftrightarrow ( \ x = y 0 \vee x \preceq y \ ) \ ] \\ {\textsf {D}}_{6} &{} &{} \forall x y \; [ \ x \preceq y 1 \leftrightarrow ( \ x = y 1 \vee x \preceq y \ ) \ ] \ . \end{array} \end{aligned}$$

For \( {\textsf {D}} \) to be an extension of \( \textsf {WD} \), we need the axiom \( \forall x \; [ \ x \preceq 1 \leftrightarrow x = 1 \ ] \). The theory \({\textsf {D}} \) extended with this axiom is what we call \( {\textsf {D}} \) at page3. The next lemma shows why we have decided to not include this axiom. We could very well have replaced \( {\textsf {D}}_{4} \) by \( \forall x \; [ \ x \preceq 1 \leftrightarrow x = 1 \ ] \). The proof of the next lemma also illustrates some of the advantages of not having to worry about existential quantifiers when defining a domain.

Lemma 3

\( {\textsf {D}} + \forall x \; [ \ x \preceq 1 \leftrightarrow x = 1 \ ] \) is interpretable in \({\textsf {D}} \).

Proof

We translate 0, 1, \(\circ \) and \(\preceq \) as

$$\begin{aligned} \phi _{0} (x)&\equiv \ x = 0100\\ \phi _{1} (x)&\equiv \ x = 0110\\ \phi _{ \preceq } (x, y )&\equiv \ x \preceq y\\ \phi _{\circ } (x, y, z )&\equiv xy = z \ . \end{aligned}$$

We choose the domain

$$\begin{aligned} I = \lbrace x : \ x=0100 \vee x= 0110 \vee \exists y \; [ \ x = y 0100 \vee x= y0110 \ ] \rbrace \ . \end{aligned}$$

Clearly, I contains 0100, 0110 and is closed under \(\circ \). We use \( {\textsf {D}}_{1} \) when showing that I is closed under \(\circ \). We now proceed to show that the translation of each non-logical axiom of \( {\textsf {D}} + \forall x \; [ \ x \preceq 1 \leftrightarrow x = 1 \ ] \) is a theorem of \({\textsf {D}} \).

By \({\textsf {D}}_{1} \) and the definition of \(\phi _{\circ } (x, y, z ) \), the translation of \({\textsf {D}}_{1} \) is a theorem of \( {\textsf {D}} \). The translation of \( {\textsf {D}}_{2} \) is a theorem of \({\textsf {D}} \) since by \({\textsf {D}}_{2} \) we have

$$\begin{aligned} x 0100 = y 0100 \ \vee \ x 0110 = y 0110 \ \ \Rightarrow \ \ x=y \ . \end{aligned}$$

The translation of \( {\textsf {D}}_{3} \) is also a theorem of \({\textsf {D}} \) since by \({\textsf {D}}_{2} \) we have

$$\begin{aligned} x 0100 = y 0110 \ \ \Rightarrow \ \ x 010 = y 011 \end{aligned}$$

and \( x 010 = y 011 \) contradicts \( {\textsf {D}}_{3} \).

We now show that the translation of \( {\textsf {D}}_{4} \) is a theorem of \({\textsf {D}} \). We have

$$\begin{aligned} \begin{array}{r c l c c l } x \preceq 0100 &{} \Leftrightarrow &{} x = 0100 \vee x=010 \vee x \preceq 01 &{} &{} &{} ( \text{ by } {\textsf {D}}_{5} \ ) \\ &{} \Leftrightarrow &{} x = 0100 \vee x=010 \vee x = 01 \vee x \preceq 0 &{} &{} &{} ( \text{ by } {\textsf {D}}_{6} \ ) \\ &{} \Leftrightarrow &{} x = 0100 \vee x=010 \vee x = 01 \vee x = 0 &{} &{} &{} ( \text{ by } {\textsf {D}}_{4} \ ) \ . \end{array} \end{aligned}$$

We need to show that we cannot have

$$\begin{aligned} x=010 \vee x = 01 \vee x = 0 \ . \end{aligned}$$

Since \(x \in I\), by the definition I, we have that \(x = z 0100 \vee x= z 0110 \) where z could possibly be empty (\(x = zu \) with z empty means \(x= u\) since we do not have an empty string in \( {\textsf {D}} \)). We have

$$\begin{aligned} z 0100= & {} 010 \vee z 0100 =01 \vee z 0100 =0\\&\Downarrow \\ z 0100= & {} 010 \vee z 0100 =01 \vee 11z 0100 = 110\\&\Downarrow ( \text{ by } {\textsf {D}}_{2} \ )\\ z 010= & {} 01 \vee z 0100 =01 \vee 11z 010 = 11 \end{aligned}$$

contradicting \( {\textsf {D}}_{3} \). We also have

$$\begin{aligned} z 0110= & {} 010 \vee z 0110 =01 \vee z 0110 =0\\&\Downarrow \\ 0 z 0110= & {} 0010 \vee z 0110 =01 \vee 00z 0110 = 000\\&\Downarrow ( \text{ by } {\textsf {D}}_{2} \ )\\ 0z 01= & {} 00 \vee z 0100 =01 \vee 00z 011 = 00 \end{aligned}$$

contradicting \( {\textsf {D}}_{3} \). Hence, \( x \preceq 0100 \) if and only if \( x = 0100 \). Thus, the translation of \( {\textsf {D}}_{4} \) is a theorem of \({\textsf {D}} \). By similar reasoning, the translation of \( \forall x \; [ \ x \preceq 1 \leftrightarrow x = 1 \ ] \) is a theorem of \({\textsf {D}} \).

We now show that the translation of \( {\textsf {D}}_{5} \) is a theorem of \({\textsf {D}} \). By \( {\textsf {D}}_{5} \), we have

$$\begin{aligned} x \preceq y 0100 \Leftrightarrow x = y 0100 \vee x=y 010 \vee x \preceq y 01 \ . \end{aligned}$$

By \( {\textsf {D}}_{6} \), we have

$$\begin{aligned} x \preceq y 0100 \ \Leftrightarrow \ x = y 0100 \vee x=y 010 \vee x = y 01 \vee x \preceq y 0 \ . \end{aligned}$$

By \( {\textsf {D}}_{5} \), we have

$$\begin{aligned} x \preceq y 0100 \ \Leftrightarrow \ x = y 0100 \vee x=y 010 \vee x = y 01 \vee x = y0 \vee x \preceq y \ . \end{aligned}$$

We need to show that we cannot have

$$\begin{aligned} x=y 010 \vee x=y 01 \vee x=y 0 \ . \end{aligned}$$

Since \(x, y \in I\), by the definition I, we have

$$\begin{aligned} x = z 0100 \ \vee \ x= z 0110 \text{ and } y = w 0100 \ \vee \ y = w 0110 \end{aligned}$$

where z and w could possibly be empty. Reasoning as in the preceding paragraph shows that \( x=y 010 \vee x=y 01 \vee x=y 0 \) leads to a contradiction. Hence,

$$\begin{aligned} x \preceq y 0100 \Leftrightarrow x = y 0100 \vee x \preceq y \ . \end{aligned}$$

Thus, the translation of \( {\textsf {D}}_{5} \) is a theorem of \({\textsf {D}} \). By similar reasoning, the translation of \( {\textsf {D}}_{6} \) is a theorem of \({\textsf {D}} \).

Since \({\textsf {D}} \) proves the translation of each axiom of \( {\textsf {D}} + \forall x \; [ \ x \preceq 1 \leftrightarrow x = 1 \ ] \), it follows that \( {\textsf {D}} + \forall x \; [ \ x \preceq 1 \leftrightarrow x = 1 \ ] \) is interpretable in \({\textsf {D}} \). \(\square \)

We now proceed to show that \( {\textsf {Q}} \) and \( {\textsf {D}} \) are mutually interpretable. We do this indirectly by showing that \( {\textsf {D}} \) is mutually interpretable with the theory \( \textsf {BTQ} \) which we have seen is mutually interpretable with \( {\textsf {Q}} \).

Theorem 6

\(\textsf {BTQ}\) is interpretable in \({\textsf {D}}\).

Proof

By Lemma 3, it suffices to show that \( \textsf {BTQ} \) is interpretable in \( {\textsf {D}} + {\textsf {D}}_{4}^{ \prime } \) where

$$\begin{aligned} {\textsf {D}}_{4}^{ \prime } \equiv \ \forall x \; [ \ x \preceq 1 \leftrightarrow x = 1 \ ] \ . \end{aligned}$$

We also observe that \({\textsf {D}}\) proves the axioms \(\textsf {BTQ}_{1}\), \(\textsf {BTQ}_{2}\) and \(\textsf {BTQ}_{3}\) (the axioms are identical with \( {\textsf {D}}_{1} \), \( {\textsf {D}}_{2} \), \( {\textsf {D}}_{3} \)). So, to translate \(\textsf {BTQ}\) in \({\textsf {D}} + {\textsf {D}}_{4}^{ \prime } \), we simply define a domain K such that the axiom

$$\begin{aligned} \textsf {BTQ}_{4} \equiv \ \forall x \; [ \ x =0 \vee x=1 \vee \exists y \; [ \ x=y0 \vee x= y1 \ ] \ ] \end{aligned}$$

holds restricted to K. Before defining K, we define auxiliary classes \(A \supseteq B \supseteq C \supseteq I \supseteq J \supseteq K\). We need to ensure that there is \(y \in K \) such that \(x = y0 \) or \(x = y1 \) if \(x \in K \) and \(x \ne 0, 1 \). We do this by relying on \( \preceq \). The idea is to first let I be such that if \(x \in I \) and \(x \ne 0, 1 \); then we can find \(y \preceq x \) such that \(x = y0 \) or \(x = y1 \). It will not necessarily be the case that \(y \in I \). What we then do is to restrict I so that we have a subclass K that is downward closed under \(\preceq \), that is, \(x \in K\) and \( y \preceq x \) implies \(y \in K \). Since K is a subclass of I, this immediately ensures that \( \textsf {BTQ}_{4} \) holds in K. We realize that in order for I to be closed under \( \circ \), it is useful if for all \(x_0 , x_1 \in I \), we have that \(x_0 \preceq x_0 x_1 \) and that \( y \preceq x_1 \) implies \(x_0 y \preceq x_0 x_1 \). We therefore let I be a subclass of a class C with this property.

We let

$$\begin{aligned} A = \lbrace y : \ \forall x \; [ \ xy \preceq xy \ ] \rbrace \ . \end{aligned}$$

By \({\textsf {D}}_{5}\) and \({\textsf {D}}_{6}\), we have \( {\textsf {D}} \vdash \forall x \; [ \ x0 \preceq x0 \wedge x1 \preceq x1 \ ] \). Hence, \(0,1 \in A\). We now show that A is closed under \(\circ \). Suppose \(y_0, y_1 \in A\). Since \(y_1 \in A\), we have

$$\begin{aligned} \forall x \; [ \ (x y_0) y_1 \preceq (x y_0) y_1 \ ] \ . \end{aligned}$$

By \({\textsf {D}}_{1}\), we then have \(\forall x \; [ \ x ( y_0 y_1) \preceq x (y_0 y_1) \ ] \). Hence, \(y_0 y_1 \in A\). Thus, A is closed under \(\circ \).

We let

$$\begin{aligned} B = \lbrace x \in A : \ x \preceq x \rbrace \ . \end{aligned}$$

By \({\textsf {D}}_{4} \) and \( {\textsf {D}}_{4}^{ \prime } \), we have \({\textsf {D}} \vdash 0 \preceq 0 \wedge 1 \preceq 1 \). Hence, \(0, 1 \in B\). We now show that B is closed under \(\circ \). Suppose \(x_0, x_1 \in B\). Since \(x_1 \in B \subseteq A\), we have \(x_0 x_1 \preceq x_0 x_1\). Hence, \(x_0 x_1 \in B\). Thus, B is closed under \(\circ \).

We let

$$\begin{aligned} C = \lbrace z \in B : \ \forall x y \; [ \ x \preceq y z \leftrightarrow ( \ x \preceq y \vee \exists u \preceq z \; [ \ x = y u \ ] \ ) \ ] \ \rbrace \ . \end{aligned}$$

By \({\textsf {D}}_{5}\) and \({\textsf {D}}_{4}\), we have

$$\begin{aligned} x \preceq y0 \&\Leftrightarrow \ x = y0 \vee x \preceq y\\&\Leftrightarrow \ x \preceq y \vee \exists u \preceq 0 \; [ \ x = yu \ ] \ . \end{aligned}$$

Similarly, by \({\textsf {D}}_{6}\) and \( {\textsf {D}}_{4}^{ \prime } \), we have

$$\begin{aligned} x \preceq y1 \&\Leftrightarrow \ x = y1 \vee x \preceq y\\&\Leftrightarrow \ x \preceq y \vee \exists u \preceq 1 \; [ \ x = yu \ ] \ . \end{aligned}$$

Hence, \(0,1 \in C\). Next we show that C is closed under \(\circ \). Suppose \(z_0, z_1 \in C\). Then

$$\begin{aligned} \begin{array}{l c l c l } x \preceq y(z_0 z_1) &{} \Leftrightarrow &{} x \preceq ( y z_0 ) z_1 &{} &{} ( {\textsf {D}}_{1} ) \\ &{} \Leftrightarrow &{} x \preceq y z_0 \vee \exists u_1 \preceq z_1 \; [ \ x = (y z_0) u_1 \ ] &{} &{} ( z_1 \in C ) \\ &{} \Leftrightarrow &{} x \preceq y \vee \exists u_0 \preceq z_0 \; [ \ x = y u_0 \ ] \vee \\ &{} &{} \exists u_1 \preceq z_1 \; [ \ x = (y z_0) u_1 \ ] &{} &{} ( z_0 \in C ) \\ &{} \Leftrightarrow &{} x \preceq y \vee \exists u_0 \preceq z_0 \; [ \ x = y u_0 \ ] \vee \\ &{} &{} \exists u_1 \preceq z_1 \; [ \ x = y ( z_0 u_1) \ ] &{} &{} ( {\textsf {D}}_{1} ) \\ &{} \Leftrightarrow &{} x \preceq y \vee \exists u \preceq z_0 z_1 \; [ \ x = y u \ ] &{} &{} ( z_1 \in C ) \ . \end{array} \end{aligned}$$

We justify the last equivalence as follows:

(\( \Rightarrow \)) :

Suppose \(u_0 \preceq z_0 \wedge x = y u_0 \). Since \(z_1 \in C\) and \(u_0 \preceq z_0 \), the right-left implication in the definition of C tells us that \(u_0 \preceq z_0 z_1 \). We can thus let \(u = u_0 \). Suppose now \(u_1 \preceq z_1 \wedge x = y (z_0 u_1) \). Since \(u_1 \preceq z_1 \) and \(z_1 \in C\), the right-left implication in the definition of C tells us that \( z_0 u_1 \preceq z_0 z_1 \). We can thus let \(u = z_0 u_1 \).

(\( \Leftarrow \)) :

Suppose \( u \preceq z_0 z_1 \wedge x = yu \). Since \(z_1 \in C\), by the left-right implication in the definition of C, we have that \(u \preceq z_0 \) or there exists \(u^{ \prime } \preceq z_1 \) such that \(u = z_0 u^{ \prime } \). If \(u \preceq z_0 \), then \(u_0 \preceq z_0 \wedge x = y u_0 \) by setting \(u_0 = u \). If \(u^{ \prime } \preceq z_1 \wedge u = z_0 u^{ \prime } \), then \(u_1 \preceq z_1 \wedge x = y (z_0 u_1 ) \) by setting \(u_1 = u^{ \prime } \).

Hence, \(z_0 z_1 \in C\). Thus, C is closed under \(\circ \).

We let

$$\begin{aligned} I = \lbrace x \in C : \ x = 0 \vee x=1 \vee \exists y \preceq x \; [ \ x= y0 \vee x=y1 \ ] \ \rbrace \ . \end{aligned}$$

By definition of I, we have \(0,1 \in I\). We now show that I is closed under \(\circ \). Suppose \(x_0, x_1 \in I\). Since \(x_1 \in I\), we have one of the following cases:

  1. (1)

    \(x_1 = 0 \vee x_1 = 1\)

  2. (2)

    \(x_1 = y_1 0 \vee x_1 = y_1 1\) for some \(y_1 \preceq x_1\).

We first consider (1). Since \(x_0 \in C \subseteq B\), we have \(x_0 \preceq x_0\). Since \(x_1 \in C\), this implies \(x_0 \preceq x_0 x_1\). Hence, \(\exists y \preceq x_0 x_1 \; [ \ x_0 x_1 = y 0 \vee x_0 x_1 = y 1 \ ] \). We now consider (2). We have \(\exists y_1 \preceq x_1 \; [ \ x_0 x_1 =( x_0 y_1 ) 0 \vee x_0 x_1 =( x_0 y_1 ) 1 \ ] \). Since \(x_1 \in C\), this implies

$$\begin{aligned} \exists y \preceq x_0 x_1 \; [ \ x_0 x_1 = y 0 \vee x_0 x_1 = y 1 \ ] \ . \end{aligned}$$

We thus see that both cases imply \(x_0 x_1 \in I\). Hence, I is closed under \(\circ \).

We let

$$\begin{aligned} J = \lbrace v \in I : \ \forall x \preceq v \; [ \ x \in I \ ] \ \rbrace \ . \end{aligned}$$

By \({\textsf {D}}_{4} \) and \( {\textsf {D}}_{4}^{ \prime } \), we have \({\textsf {D}} \vdash \forall x \preceq 0 \; [ \ x = 0 \ ] \wedge \forall x \preceq 1 \; [ \ x = 1 \ ] \). Hence, \(0,1 \in J\). We now show that J is closed under \(\circ \). Suppose, \(v_0, v_1 \in J\) and \(x \preceq v_0 v_1 \). Since \(v_1 \in C\), we have one of the following:

  1. (a)

    \(x \preceq v_0 \)

  2. (b)

    \(x = v_0 y \) for some \(y \preceq v_1\).

In case of (a), we have \(x \in I\) since \(v_0 \in J\). In case of (b), we have \(y \in I\) since \(v_1 \in J\). We also have \(v_0 \in I\) since \(v_0 \in J \subseteq I\). Since I is closed under \(\circ \), we then have \(x = v_0 y \in I\). We thus see that both cases imply \(v_0 v_1 \in J\). Hence, J is closed under \(\circ \).

We let

$$\begin{aligned} K = \lbrace v \in J : \ \forall w \preceq v \; \forall x y \; [ \ x \preceq y \wedge y \preceq w \rightarrow x \preceq w \ ] \ \rbrace \ . \end{aligned}$$

By \({\textsf {D}}_4\), we have

$$\begin{aligned} x \preceq y \wedge y \preceq w \wedge w \preceq 0 \ \Rightarrow \ x = y = w = 0 \preceq 0 \ . \end{aligned}$$

Hence, \(0 \in K \). By \( {\textsf {D}}_{4}^{ \prime } \), we have

$$\begin{aligned} x \preceq y \wedge y \preceq w \wedge w \preceq 1 \ \Rightarrow \ x = y = w = 1 \preceq 1 \ . \end{aligned}$$

Hence, \(1 \in K \).

We now show that K is closed under \(\circ \). Suppose \(v_0, v_1 \in K\) and

$$\begin{aligned} w \preceq v_0 v_1 \wedge x \preceq y \wedge y \preceq w \ . \end{aligned}$$

We need to show that \(x \preceq w\). From \(w \preceq v_0 v_1\) and \(v_1 \in K \subseteq C\), we have one of the following:

  1. (I)

    \(w \preceq v_0\)

  2. (II)

    \(w = v_0 u\) for some \(u \preceq v_1\).

In case of (I), since \(v_0 \in K\), we have

$$\begin{aligned} w \preceq v_0 \wedge x \preceq y \wedge y \preceq w \ \Rightarrow \ x \preceq w \ . \end{aligned}$$

In case of (II), we observe that \(y \preceq w = v_0 u\) and \( u \preceq v_1 \). Furthermore, since \(v_1 \in J\), we have that \(u \preceq v_1\) implies \(u \in I \subseteq C\). Then, \(y \preceq w= v_0 u\) and \(u \in C\) implies that we have one of the following:

  1. (IIa)

    \(y \preceq v_0\)

  2. (IIb)

    \(y = v_0 u^{ \prime } \) for some \(u^{ \prime } \preceq u\).

In case of (IIa), since \(v_0 \in K \subseteq B\), we have

$$\begin{aligned} v_0 \preceq v_0 \wedge x \preceq y \wedge y \preceq v_0 \ \Rightarrow \ x \preceq v_0 \end{aligned}$$

since \(v_0 \in K\). Then, since \(u \in C\), we have

$$\begin{aligned} x \preceq v_0 \ \Rightarrow \ x \preceq v_0 u = w \ . \end{aligned}$$

In case of (IIb), we observe that \(v_1 \preceq v_1\) since \(v_1 \in K \subseteq B\). Then, \(v_1 \in K\) implies

$$\begin{aligned} v_1 \preceq v_1 \wedge u^{ \prime } \preceq u \wedge u \preceq v_1 \ \Rightarrow \ u^{ \prime } \preceq v_1 \ . \end{aligned}$$

Since \(v_1 \in K \subseteq J\), we have that \(u^{ \prime } \preceq v_1 \) implies \(u^{ \prime } \in I \subseteq C\). Then, \(x \preceq y = v_0 u^{ \prime } \) and \(u^{ \prime } \in C\) implies that we have one of the following:

  1. (IIbi)

    \(x \preceq v_0\)

  2. (IIbii)

    \(x = v_0 u^{ \prime \prime } \) for some \(u^{ \prime \prime } \preceq u^{ \prime } \).

In case of (IIbi), since \(u \preceq v_1\) and \(v_1 \in K \subseteq J\) implies \(u \in I \subseteq C \), we have

$$\begin{aligned} x \preceq v_0 \ \Rightarrow \ x \preceq v_0 u = w \ . \end{aligned}$$

In case of (IIbii), we first observe that

$$\begin{aligned} u \preceq v_1 \wedge u^{ \prime \prime } \preceq u^{ \prime } \wedge u^{ \prime } \preceq u \&\Rightarrow \ u^{ \prime \prime } \preceq u \end{aligned}$$

since \(v_1 \in K\). Then, since \(u \in C \), we have

$$\begin{aligned} x = v_0 u^{ \prime \prime } \wedge u^{ \prime \prime } \preceq u \ \Rightarrow \ x\preceq v_0 u = w \ . \end{aligned}$$

We thus see that

$$\begin{aligned} x \preceq y \wedge y \preceq w \wedge w \preceq v_0 v_1 \ \Rightarrow \ x \preceq w \ . \end{aligned}$$

Hence, \(v_0 v_1 \in K\). Thus, K is closed under \(\circ \).

We now show that the class K has the following important property:

$$\begin{aligned} \forall v \in K \; \forall w \; [ \ w \preceq v \rightarrow w \in K \ ] \ . \qquad \quad (*) \end{aligned}$$

Indeed, suppose \( v \in K \) and \( w \preceq v \). We need to show that \(w \in K \). By definition of K, we need to prove

$$\begin{aligned} z \preceq w \wedge x \preceq y \wedge y \preceq z \ \Rightarrow \ x \preceq z \ . \end{aligned}$$

Since \(v \in K \subseteq B\), we have \(v \preceq v\). Then,

$$\begin{aligned} v \in K \wedge v \preceq v \wedge z \preceq w \wedge w \preceq v \ \Rightarrow \ z \preceq v \ . \end{aligned}$$

This in turn implies

$$\begin{aligned} v \in K \wedge z \preceq v \wedge x \preceq y \wedge y \preceq z \ \Rightarrow \ x \preceq z \ . \end{aligned}$$

Hence, \( w \in K \). Thus, (*) holds.

We are now ready to give an interpretation of \(\textsf {BTQ} \) in \({\textsf {D}}\). We choose the domain K. We translate 0, 1 and \(\circ \) as

$$\begin{aligned} \phi _{0} (x)&\equiv \ x = 0\\ \phi _{1} (x)&\equiv \ x = 1\\ \phi _{\circ } (x, y, z )&\equiv \ z = xy \ . \end{aligned}$$

It is clear that the translations of \(\textsf {BTQ}_{1} \), \(\textsf {BTQ}_{2} \) and \(\textsf {BTQ}_{3} \) are theorems of \({\textsf {D}}\). We now show that the translation of \(\textsf {BTQ}_{4} \) is a theorem of \({\textsf {D}}\). Let \(x \in K\). Since \(K \subseteq I\), we have

$$\begin{aligned} x = 0 \vee x= 1 \vee \exists y \preceq x \; [ \ x= y0 \vee x=y1 \ ] \ . \end{aligned}$$

If the third disjunct is the case, then we have \(y \in K\) by (*). Thus, the translation of \(\textsf {BTQ}_{4} \) is a theorem of \({\textsf {D}}\). Hence, \(\textsf {BTQ}\) is interpretable in \({\textsf {D}}\). \(\square \)

Theorem 7

\( {\textsf {D}}\) is interpretable in \( \textsf {BTQ}\).

Proof

We choose the domain \(J(x) \equiv \ x = x \). We translate 0, 1, \(\circ \) and and \(\preceq \) as follows:

$$\begin{aligned} \phi _{0} (x)&\equiv \ x = 0\\ \phi _{1} (x)&\equiv \ x = 1\\ \phi _{\circ } (x, y, z )&\equiv \ z = xy\\ \phi _{ \preceq } (x, y )&\equiv \ y = x \vee \exists z \; [ \ y = xz \ ] \ . \end{aligned}$$

It is clear that the translation of \({\textsf {D}}_{1}\), \({\textsf {D}}_{2}\) and \({\textsf {D}}_{3}\) are theorems of \(\textsf {BTQ}\). We now show that the translation of

$$\begin{aligned} {\textsf {D}}_{4} \equiv \ \forall x \; [ \ x \preceq 0 \leftrightarrow x=0 \ ] \end{aligned}$$

is a theorem of \(\textsf {BTQ}\). So, suppose \(0 = x \vee \exists z \; [ \ 0 = xz \ ] \). It is not difficult to see that we cannot have \(\exists z \; [ \ 0 = xz \ ] \). Hence, the translation of \({\textsf {D}}_{4}\) is a theorem of \(\textsf {BTQ}\).

We now show that the translation of

$$\begin{aligned} {\textsf {D}}_{5} \equiv \ \forall x y \; [ \ x \preceq y 0 \leftrightarrow ( \ x = y 0 \vee x \preceq y \ ) \ ] \end{aligned}$$

is a theorem of \(\textsf {BTQ}\). We need to show that

$$\begin{aligned} y0 = x \vee \exists z \; [ \ y0 = xz \ ] \ \Leftrightarrow \ y0 = x \vee y = x \vee \exists u \; [ \ y = xu \ ] \ . \end{aligned}$$

It suffices to show that

$$\begin{aligned} \exists z \; [ \ y0 = xz \ ] \ \Leftrightarrow \ y = x \vee \exists u \; [ \ y = xu \ ] \ . \qquad \quad (*) \end{aligned}$$

The right-left implication of (*) is trivial. The left-right implication holds since

$$\begin{aligned} \begin{array}{l c l c l } y0 = xz &{} \Rightarrow &{} ( \ y0 = x0 \wedge z=0 \ ) \vee \\ &{} &{} \exists z^{ \prime } \; [ \ y0 = x ( z^{ \prime }0 ) \wedge z = z^{ \prime } 0 \ ] &{} &{} ( \text{ by } \textsf {BTQ}_{4}, \textsf {BTQ}_{3} \ ) \\ &{} \Rightarrow &{} ( \ y0 = x0 \wedge z=0 \ ) \vee \\ &{} &{}\exists z^{ \prime } \; [ \ y0 =( x z^{ \prime } ) 0 \wedge z = z^{ \prime } 0 \ ] &{} &{} ( \text{ by } \textsf {BTQ}_{1} \ ) \\ &{} \Rightarrow &{}( \ y = x \wedge z=0 \ ) \vee \\ &{} &{} \exists z^{ \prime } \; [ \ y = x z^{ \prime } \wedge z = z^{ \prime } 0 \ ] &{} &{} ( \text{ by } \textsf {BTQ}_{2} \ ) \\ &{} \Rightarrow &{} y=x \vee \exists u \; [ \ y = xu \ ] \ . \end{array} \end{aligned}$$

Thus, the translation of \({\textsf {D}}_{5}\) is a theorem of \(\textsf {BTQ}\).

By similar reasoning, the translation of \({\textsf {D}}_{6}\) is a theorem of \(\textsf {BTQ}\).

Thus, \({\textsf {D}}\) is interpretable in \(\textsf {BTQ}\). \(\square \)

6 Minimality results

This section is devoted to show that the axiomatizations of \( \textsf {WD} \), \( \textsf {WBT} \) and \( \textsf {BTQ} \) are minimal essentially undecidable, which is to say that removing any one of the axioms (axiom schemas) gives a theory that is not essentially undecidable. We are not able to show that our axiomatization of \( {\textsf {D}} \) is minimal essentially undecidable, but we reduce the problem to showing that \(\textsf { D} \! - \! \textsf { D}_5 \) and \(\textsf { D} \! - \! \textsf { D}_6 \) are not essentially undecidable. However, as \( {\textsf {D}} \) has a finite axiomatization, we can make it minimal essentially undecidable by replacing some of the axioms with their conjunction.

We now proceed to show that \(\textsf { WBT }\! - \! \textsf {WBT}_1\), \(\textsf { WD} \! - \! \textsf {WD}_1\) and \(\textsf { D} \! - \! \textsf {D}_1 \) are not essentially undecidable by interpreting them in \(\textsf {S2S}\). S2S is a monadic second order theory whose language is \(\lbrace {\textsf {e}} , 0, 1 , \textsf {S}_{0 } , \textsf {S}_{1 } \rbrace \), where \({\textsf {e}} \), 0 and 1 are constant symbols and \( \textsf {S}_{0 }\) and \( \textsf {S}_{1 }\) are unary function symbols. The axioms of S2S are the true sentences in the standard second-order structure where the universe is \(\lbrace {{0}}, {{1}} \rbrace ^{*} \). The symbol \( {\textsf {e}} \) is interpreted as the empty string, 0 is interpreted as 0 and 1 is interpreted as 1. The function symbol \( \textsf {S}_{0 }\) is interpreted as the function that takes a bit string and concatenates it with the bit 0, and the function symbol \( \textsf {S}_{1 }\) is interpreted as the function that takes a bit string and concatenates it with the bit 1. We have quantifiers that range over \( \lbrace {{0}}, {{1}} \rbrace ^* \), and we have quantifiers that range over subsets of \( \lbrace {{0}}, {{1}} \rbrace ^* \). It was proved in Rabin [13] that S2S is decidable. Our interpretation of \(\textsf { WBT} \! - \! \textsf {WBT}_{1 }\) in \(\textsf {S2S}\) does not use the monadic second order part, and this makes the induced algorithm more efficient. It is known that extending S2S with the prefix relation does not change the expressive power of S2S (see Börger et al. [1] p. 317). We also show this when we interpret \(\textsf { WD} \! - \! \textsf {WD}_{1 }\) and \(\textsf { D } \! - \! \textsf {D}_1 \) in S2S.

We recall that biterals are associated to the left. So, \(( (0 \circ 0) \circ 0) \) is a biteral while \(( 0 \circ (0 \circ 0) )\) is not. Although we have so far not needed to take this into account, it now becomes important.

Lemma 4

\(\textsf { WBT }\! - \! \textsf {WBT}_1 \) and \(\textsf { WD} \! - \!\textsf {WD}_1 \) are not essentially undecidable.

Proof

We interpret the two theories in \( \textsf {S2S} \) as follows: We choose the domain \(\delta (x) \equiv \ x=x \). We map 0 and 1 to

$$\begin{aligned} \phi _{0} (x) \equiv&\ x = 0 \\ \phi _{1} (x) \equiv&\ x = 1 \ . \end{aligned}$$

So, 0 and 1 are realized as 0 and 1, respectively. We map \(\circ \) to

$$\begin{aligned} \phi _{\circ } (x, y , z )&\equiv \ \Big ( \ ( \ x = {\textsf {e}} \vee ( \ y \ne 0 \wedge y \ne 1 \ ) \ ) \wedge z= {\textsf {e}} \ \Big ) \vee \\&\quad \Big [ \ x \ne {\textsf {e}} \wedge \Big ( \ ( \ y = 0 \wedge z = \textsf {S}_{0 } x \ ) \vee \\&\quad ( \ y = 1 \wedge z = \textsf {S}_{1 } x \ ) \ \Big ) \ \Big ] \ . \end{aligned}$$

This means that \(\circ \) is realized as the function

$$\begin{aligned} (x, y ) \mapsto {\left\{ \begin{array}{ll} \varepsilon &{} \quad \text{ if } x = \varepsilon \text{ or } y \notin \lbrace {{0}}, {{1}} \rbrace \\ x {{0}} &{} \quad \text{ if } x \ne \varepsilon \text{ and } y = {{0}}\\ x {{1}} &{} \quad \text{ if } x \ne \varepsilon \text{ and } y = {{1}} \ . \end{array}\right. } \end{aligned}$$

Recalling that biterals are associated to the left, it is clear that the translation of each instance of \( \textsf {WBT}_{2} \) and \( \textsf {WD}_{2} \) is a theorem of \( \textsf {S2S} \). We now show that the translation of each instance of

$$\begin{aligned} \textsf {WBT}_{3} \equiv \ \forall x \; [ \ x \sqsubseteq _{ {\textsf {s}} } {\overline{\alpha }} \rightarrow \bigvee _{ \gamma \in \langle ... \alpha \rangle } x= {\overline{\gamma }} \ ] \end{aligned}$$

is a theorem of \( \textsf {S2S} \). We recall that

$$\begin{aligned} x \sqsubseteq _{ {\textsf {s}} } y \equiv \ y = x \vee \exists u v \; [ \ y=ux \vee y= xv \vee y= (ux) v \vee y=u (xv ) \ ] \ . \end{aligned}$$

Let \( \alpha \ne \varepsilon \). Suppose \(\beta \) is a substring of \( \alpha \) w.r.t. \( \phi _{ \circ } (x, y, z) \). We show that \( \beta \ne \varepsilon \) and that \( \beta \) is a substring of \( \alpha \) in the actual sense. We have one of the following cases:

  • \( \alpha = \beta \)

  • There exists u such that \( \phi _{ \circ } (u, \beta , \alpha ) \). Then, \( \alpha = u \beta \), \( \ \beta \in \lbrace {{0}}, {{1}} \rbrace \) and \(u \ne \varepsilon \).

  • There exists v such that \( \phi _{ \circ } ( \beta , v , \alpha ) \). Then, \( \alpha = \beta v \), \( \ \beta \ne \varepsilon \) and \( v \in \lbrace {{0}}, {{1}} \rbrace \).

  • There exist uv and w such that \( \phi _{ \circ } ( u, \beta , w ) \wedge \phi _{ \circ } ( w, v, \alpha ) \). Then, \( \alpha = u \beta v \), \( \ u \ne \varepsilon \) and \( \beta , v \in \lbrace {{0}}, {{1}} \rbrace \).

  • There exist uv and w such that \( \phi _{ \circ } ( \beta , v , w ) \wedge \phi _{ \circ } ( u, w , \alpha ) \). This is not possible since \( \phi _{ \circ } ( u, w , \alpha ) \) implies \( w \in \lbrace {{0}}, {{1}} \rbrace \) while we cannot have v such that \( \phi _{ \circ } ( \beta , v , {{0}} ) \) or \( \phi _{ \circ } ( \beta , v , {{1}} ) \).

We thus see that \( \beta \) is a substring of \( \alpha \) in the actual sense.

We map \(\preceq \) to

$$\begin{aligned}&\phi _{ \preceq } (x, y ) \equiv \ \exists Y \; \Big [ \ {\textsf {e}} \notin Y \; \wedge \; x \in Y \; \wedge \; y\in Y \; \wedge \; \textsf {S}_{0 } y \notin Y \; \wedge \; \textsf {S}_{1 } y \notin Y \; \wedge \; \\&\quad \forall z \; [ \ \big ( z \ne {\textsf {e}} \; \wedge \; ( \ \textsf {S}_{0 } z \in Y \; \vee \; \textsf {S}_{1 } z \in Y \ ) \ \big ) \rightarrow z \in Y \ ] \; \wedge \; \\&\quad \forall w \; [ \ \textsf {S}_{0 } w \in Y \rightarrow \textsf {S}_{1 } w \notin Y \ ] \ \Big ] \ . \end{aligned}$$

This formula forces \(\preceq \) to be realized as the prefix relation on \( \lbrace {{0}}, {{1}} \rbrace ^+ \). Indeed, suppose \(\alpha , \beta \in \lbrace {{0}}, {{1}} \rbrace ^+ \) and \(\alpha \) is related to \(\beta \) as defined by \(\phi _{ \preceq } (x, y )\). The first line in \(\phi _{ \preceq } (x, y )\) tells us that there exists \(Y \subseteq \lbrace {{0}}, {{1}} \rbrace ^+ \) such that

$$\begin{aligned} \alpha , \beta \in Y \ \ \text{ and } \ \ \beta {{0}}, \beta {{1}} \notin Y \ . \end{aligned}$$

The second line in \(\phi _{ \preceq } (x, y )\) tells us that if \( \gamma \in Y \), then all the non-empty prefixes of \( \gamma \) are also in Y. Let

$$\begin{aligned} \lbrace 0, 1 \rbrace \ni \alpha _0 \prec \alpha _1 \prec \ldots \prec \alpha _k = \alpha \ \ \ \text{ and } \ \ \ \lbrace 0, 1 \rbrace \ni \beta _0 \prec \beta _1 \prec \ldots \prec \beta _m = \beta \end{aligned}$$

denote all the prefixes of \( \alpha \) and \( \beta \). The notation \( \gamma \prec \delta \) then means that \( \gamma \) is the longest proper prefix of \( \delta \). We show by induction that for each \( 0 \le j \le k\), there exists \( 0 \le i \le m \) such that \( \alpha _j = \beta _i \), which implies that \( \alpha \) is a prefix of \( \beta \). The third line in \(\phi _{ \preceq } (x, y )\) tells us that exactly one of 0 and 1 belongs to Y since we get for free that \( \forall w \; [ \ \textsf {S}_{1 } w \in Y \rightarrow \textsf {S}_{0 } w \notin Y \ ] \). Hence, \( \alpha _0 = \beta _0 \). Suppose now that for \(0 \le j < k \) there exists \(0 \le i \le m \) such that \( \alpha _j = \beta _i \). We observe that we must have that \( i < m \) since \( \alpha _j = \beta _i = \beta \) and \( \alpha _{ j+1} \in Y \) would contradict \( \beta {{0}}, \beta {{1}} \notin Y \). It remains to show that \( \alpha _{ j+1} = \beta _{ i+1} \). The third line in \(\phi _{ \preceq } (x, y )\) tells us that exactly one of \( \alpha _j {{0}} \) and \( \alpha _j {{1}} \) belongs to Y. Hence, \( \alpha _{ j+1} = \beta _{ i+1} \). Thus, by induction, each prefix of \( \alpha \) is a prefix of \( \beta \).

We thus see that this translation has the desired properties. This translation shows us how to define decidable models of the theories in question. \(\square \)

Lemma 5

\(\textsf { D} \! - \! \textsf { D}_1 \) is not essentially undecidable.

Proof

We modify the translation in Lemma 4 as follows: We map \(\circ \) to

$$\begin{aligned} \phi _{\circ } (x, y , z ) \equiv&\ \Big ( \ ( \ y \ne 0 \; \vee \; y \ne 1 \ ) \; \wedge \; z= x \ \Big ) \vee ( \ y = 0 \; \wedge \; z = \textsf {S}_{0 } x \ ) \vee ( \ y = 1 \;\\&\wedge \; z = \textsf {S}_{1 } x \ ) \ \ . \end{aligned}$$

This means that \(\circ \) is realized as the function

$$\begin{aligned} (x, y ) \mapsto {\left\{ \begin{array}{ll} x &{} \quad \text{ if } y \notin \lbrace {{0}}, {{1}} \rbrace \\ x {{0}} &{} \quad \text{ if } y = {{0}} \\ x {{1}} &{} \quad \text{ if } y = {{1}} \ . \end{array}\right. } \end{aligned}$$

We need this modification to ensure that the axiom \( {\textsf {D}}_{3} \equiv \ \forall x y \; [ \ x0 \ne y1 \ ] \) holds. \(\square \)

We observe that the simple translation of \(\circ \) we give in the preceding lemma does not work in the case of Lemma  4 since the axiom scheme \( \textsf {WBT}_{3} \) then fails. Indeed, for any non-empty bit string \( \alpha \) and any bit string \( \beta \ne 0, 1\), we have \( \phi _{\circ } ( {\overline{\alpha }}, {\overline{\beta }} , {\overline{\alpha }} ) \). This means that all substring different from 0 and 1 are substrings of \( \alpha \) w.r.t. \( \phi _{\circ } ( x, y, z ) \).

Lemma 6

\(\textsf { BTQ }\! - \! \textsf {BTQ}_1 \) is not essentially undecidable.

Proof

Let \(\varSigma \) denote the set of all sentences in first order logic with equality and the language \(\lbrace 0,1,+, \times , \le \rbrace \) true in the structure \( ( {\mathbb {R}}, 0,1, +, \times , \le )\). It was shown by Tarski [17] that \(\varSigma \) is decidable. We show that \(\textsf { BTQ } \! - \! \textsf {BTQ}_1 \) is interpretable in \(\varSigma \).

We now define a relative interpretation of \( \textsf { BTQ } \! - \! \textsf {BTQ}_1 \) in \( \varSigma \). We relativize quantification to \(\lambda (x) :\equiv x=x\) and use the following translation of symbols

$$\begin{aligned} 0 \mapsto \&\psi _{0} (x) :\equiv \ x=0 \\ 1 \mapsto \&\psi _{1} (x) :\equiv \ x= 1 \\ \circ \mapsto \&\psi _{\circ } (x,y,z) \end{aligned}$$

where \(\psi _{\circ } (x,y,z) \) is a formula which defines in \( ( {\mathbb {R}}, 0,1, +, \times , \le )\) the function \(h = h_3 \circ h_2 \circ h_1 : {\mathbb {R}}^2 \rightarrow {\mathbb {R}} \) where

$$\begin{aligned} h_1(r,u)&= {\left\{ \begin{array}{ll} -2+ \frac{2r}{1+ \sqrt{1+ 4r^{2}}} &{} \quad \text{ if } u= 0 \\ \frac{2r}{1+ \sqrt{1+ 4r^{2}}} &{} \quad \text{ if } u= 1 \\ 0 &{} \quad \text{ otherwise } \end{array}\right. } \\ h_2(r)&= {\left\{ \begin{array}{ll} \frac{1+r}{2} &{} \quad \text{ if } r \in (-3,-1) \cup (-1,1) \\ 0 &{} \quad \text{ otherwise } \end{array}\right. } \\ h_3(r)&= {\left\{ \begin{array}{ll} \frac{1}{r} - 1 &{} \quad \text{ if } r \in (0,1) \\ \frac{1}{r} + 1 &{} \quad \text{ if } r \in (-1, 0) \\ 0 &{} \quad \text{ otherwise } \ . \end{array}\right. } \end{aligned}$$

We first observe that the map \({\mathbb {R}} \rightarrow (-1,1)\) defined by

$$\begin{aligned} r \mapsto \frac{2r}{1+ \sqrt{1+ 4r^{2}}} \end{aligned}$$

is a bijection. The sections \(h(-,0)\) and \( h(-,1)\) are injective and have images \((- \infty , 0)\) and \( (0, \infty )\). This is because we have the following sequences of bijections:

$$\begin{aligned}&{\mathbb {R}} \xrightarrow [ h_1(-,0) ] {\sim } (-3, -1) \xrightarrow [ h_2 ] {\sim } (-1, 0) \xrightarrow [ h_3 ] {\sim } (- \infty , 0 ) \\&{\mathbb {R}} \xrightarrow [ h_1(-,1) ] {\sim } (-1, 1) \xrightarrow [ h_2 ] {\sim } (0, 1 ) \xrightarrow [ h_3 ] {\sim } (0, \infty ) \ . \end{aligned}$$

The fact that \(h(-,0)\) and \( h(-,1)\) are injective and have disjoint images implies that the translation of

$$\begin{aligned} \textsf {BTQ}_{2} \equiv \ \forall x y \; [ \ x \ne y \rightarrow ( \ x 0 \ne y 0 \wedge x 1 \ne y 1 \ ) \ ] \end{aligned}$$

and

$$\begin{aligned} \textsf {BTQ}_{3} \equiv \ \forall x y \; [ \ x0 \ne y1 \ ] \end{aligned}$$

are theorems of \(\varSigma \). The translation of

$$\begin{aligned} \textsf {BTQ}_{4} \equiv \ \forall x \; [ \ x =0 \vee x=1 \vee \exists y \; [ \ x=y0 \vee x= y1 \ ] \ ] \end{aligned}$$

is also a theorem of \(\varSigma \) since we have defined \( \psi _0 (x) \equiv x=0\) and since the union of the images of \(h(-,0)\) and \( h(-,1)\) is

$$\begin{aligned} (-\infty , 0) \cup (0, \infty ) = {\mathbb {R}} \setminus \lbrace 0 \rbrace \ . \end{aligned}$$

The translation shows how to define a decidable model of \(\textsf { BTQ} \! - \! \textsf {BTQ}_1 \). \(\square \)

One of the referees has observed that we get a simpler proof of the preceding lemma by considering the translation given in Lemma 4 and restricting the domain to the set of all non-empty strings. We can thus for example translate \( \circ \) as follows

$$\begin{aligned} (x, y ) \mapsto {\left\{ \begin{array}{ll} x {{0}} &{} \quad \text{ if } y = {{0}} \\ x {{1}} &{} \quad \text{ if } y = {{1}} \\ 0 &{} \quad \text{ otherwise. } \end{array}\right. } \end{aligned}$$

Lemma 7

\(\textsf { WBT} \! - \! \textsf {WBT}_2 \) and \(\textsf { WD} \! - \! \textsf {WD}_2 \) are not essentially undecidable.

Proof

We obtain a one element model \({\mathfrak {A}}\) of \(\textsf { WBT }\! - \! \textsf {WBT}_{2 }\) and \(\textsf { WD} \! - \! \textsf {WD}_2 \) as follows:

  • the universe is \(\lbrace 0 \rbrace \)

  • \(0^{ {\mathfrak {A}} } = 0\), \( \ 1^{ {\mathfrak {A}} } = 0\)

  • \(0 \circ ^{ {\mathfrak {A}} } 0 = 0\)

  • \( \preceq ^{ {\mathfrak {A}} } = \lbrace (0, 0 ) \rbrace \).

Since \(\circ ^{ {\mathfrak {A}} }\) is associative, \(\textsf { WBT}_1 \) and \(\textsf {WD}_{1 }\) hold in \({\mathfrak {A}}\). We observe that \( {\mathfrak {A}}\) satisfies

$$\begin{aligned} \forall x \; [ \ x \sqsubseteq _\mathsf{s} {\overline{\alpha }} \rightarrow x={\overline{\alpha }} \ ] \ \ \text{ and } \ \ \forall x \; [ \ x \preceq {\overline{\alpha }} \leftrightarrow \bigvee _{ \beta \preceq ^{ {\mathfrak {D}} } \alpha } x={\overline{\beta }} \ ] \end{aligned}$$

since there is only one element in the universe. Thus, \(\textsf { WBT}_{3 }\) and \(\textsf { WD}_{3 }\) hold in \({\mathfrak {A}}\). \(\square \)

Lemma 8

\({\textsf {BTQ}} \! - \! {\textsf {BTQ}}_2\) is not essentially undecidable.

Proof

We obtain a two element model \({\mathfrak {A}}\) of \({\textsf {BTQ}} \! - \! {\textsf {BTQ}}_2\) as follows:

  • the universe is \(\lbrace 0,1 \rbrace \)

  • \(0^{ {\mathfrak {A}} } = 0\), \( \ 1^{ {\mathfrak {A}} } = 1\)

  • \(\forall x,y \in \lbrace 0,1 \rbrace ( \ x \circ ^{ {\mathfrak {A}} } y = y \ ) \).

The operator \( \circ ^{ {\mathfrak {A}} } \) is clearly associative. The axiom \( \forall x y \; [ \ x0 \ne y1 \ ] \) holds since

$$\begin{aligned} x \circ ^{ {\mathfrak {A}} } 0 = 0 \ne 1 = y \circ ^{ {\mathfrak {A}} } 1 \ . \end{aligned}$$

The axiom \(\forall x \; [ \ x = 0 \vee x=1 \vee \exists y \; [ \ x=y0 \vee x=y1 \ ] \ ] \) holds since the universe is \( \lbrace 0^{ {\mathfrak {A}} } , 1^{ {\mathfrak {A}} } \rbrace \). \(\square \)

The following proof of the next Lemma was suggested by one of the referees . We show that \(\textsf {D} \! - \! \textsf {D}_2\) is not essentially undecidable by interpreting it in Presburger Arithmetic. Presburger Arithmetic refers to all sentences in first-order logic with equality and the language \( \lbrace 0,1, + , < \rbrace \) true in the structure \( ( {\mathbb {N}}, 0, 1, + , < )\). It is shown that Presburger Arithmetic is decidable in Chapter 3.4 of Smoryński [15]. The idea is to consider the free monoid generated by 0 and 1 modulo the equations \(00 = 0\) and \(11= 1\). The universe of this structure then consists of strings of the form \((01)^n\), \(1 (01)^n \), \( (01)^n 0 \), \(1 (01)^n 0 \), with n being a natural number. Since we do not have the empty string, we have \(n>0 \) when the string is of the form \((01)^n\). The concatenation operator on this set is described as follows

$$\begin{aligned} x \star y = {\left\{ \begin{array}{ll} xy &{} \quad \text{ if } x = ua, \ y= b v , \ a \ne b \text{ and } a,b \in \lbrace 0, 1 \rbrace \\ u a v &{} \quad \text{ if } x = ua , \ y = a v \text{ and } a \in \lbrace 0, 1 \rbrace \ . \end{array}\right. } \end{aligned}$$

Observe that u and v denote the empty string in the cases \(x, y \in \lbrace 0, 1 \rbrace \). It is clear that axiom \( {\textsf {D}}_{3} \equiv \ \forall xy \; [ \ x0 \ne x1 \ ] \) holds in this model. It also follows easily from the definition of \(\star \) that \((x \star y ) \star z = x \star (y \star z ) \) when y has length at least two. To see that \( \star \) is indeed associative, we observe that given distinct \(a, b \in \lbrace 0, 1 \rbrace \), we have

$$\begin{aligned} (x \star a ) \star z&= {\left\{ \begin{array}{ll} x a z &{} \quad \text{ if } x = ub , \ z = b v \\ x a v &{} \quad \text{ if } x = ub , \ z = a v \\ u a z &{} \quad \text{ if } x = ua , \ z = bv \\ u a v &{} \quad \text{ if } x = ua , \ z = av \end{array}\right. } \\&= x \star (a \star z ) \ . \end{aligned}$$

Thus, axiom \( {\textsf {D}}_{1} \equiv \ \forall xy z \; [ \ (xy)z = x (yz) \ ] \) holds.

We interpret \( \preceq \) as the prefix relation, i.e., \( x \preceq y \) if and only if \(x = y\) or there exists z such that \(y = x \star z \). Axiom \( {\textsf {D}}_{4} \equiv \ \forall x \; [ \ x \preceq 0 \leftrightarrow x = 0 \ ] \) holds since \(x \star z = 0\) implies \(x = z = 0 \). Axiom \( {\textsf {D}}_{5} \equiv \ \forall x y \; [ \ x \preceq y0 \leftrightarrow ( x = y0 \vee x \preceq y \ ] \) holds by the following reasoning:

  • Suppose \( x \preceq y \star 0 \) and \( y \star 0 = x \star z \) for some z. If y has 0 as a suffix, then \( y = x \star z \), which implies \( x \preceq y \). Assume now y has 1 as a suffix. We have that \(z = 0 \) or \(z = u10 \) where u is possibly empty. We first assume \(z = 0\). If x has 0 as a suffix, then \( y \star 0 = x \). If y and x both have 1 as suffices, then \(y = x\), which in turn implies \(x \preceq y \). Suppose now \( z = u10 \). Then \(y = x \star u1 \), which in turn implies \(x \preceq y \).

  • Suppose \( x = y \star 0 \ \vee \ y = x \ \vee \ \exists z \; [ \ y = x \star z \ ] \). Then, by associativity of \( \star \), we have \( x = y \star 0 \ \vee \ y \star 0 = x \star 0 \ \vee \ \exists z \; [ \ y \star 0 = x \star ( z \star 0 ) \ ] \). Hence \( x \preceq y \star 0 \).

By similar reasoning, axiom \( {\textsf {D}}_{6} \equiv \ \forall x y \; [ \ x \preceq y1 \leftrightarrow ( x = y1 \vee x \preceq y \ ] \) holds.

Now, to interpret \(\mathsf D \! - \! D_2\) in Presburger Arithmetic, we associate strings with natural numbers as follows

$$\begin{aligned} (01)^n \mapsto 4n , \ \ 1 (01)^n \mapsto 4n+1 , \ \ (01)^n 0 \mapsto 4n+2 , \ \ 1 (01)^n 0 \mapsto 4n+3 \ . \end{aligned}$$

It is then not difficult to see that \( \star \) is definable in Presburger Arithmetic. This in turn implies that the prefix relation is definable. We thus have the following result.

Lemma 9

\({\textsf {D}} \! - \! {\textsf {D}}_2\) is not essentially undecidable.

Lemma 10

\({\textsf {BTQ}} \! - \! {\textsf {BTQ}}_3\) and \({\textsf {D}} \! - \! {\textsf {D}}_3\) are not essentially undecidable.

Proof

Since Presburger Arithmetic is decidable, we obtain a decidable model of \({\textsf {BTQ}} \! - \! {\textsf {BTQ}}_3\) and \({\textsf {D}} \! - \! {\textsf {D}}_3\) as follows:

  • the universe is the set of natural numbers \({\mathbb {N}} \)

  • \(0^{ {\mathfrak {A}} } = 0\), \(1^{ {\mathfrak {A}} } = 0\)

  • \(\forall x,y \in {\mathbb {N}} \; \big ( \ x \circ ^{ {\mathfrak {A}} } y = x+y \ \big ) \)

  • \( x \preceq ^{ {\mathfrak {A}} } y \ \Leftrightarrow \ x=y \).

It is not difficult to see that axioms of \({\textsf {BTQ}} \! - \! {\textsf {BTQ}}_3\) hold. Axiom

$$\begin{aligned} {\textsf {D}}_{4} \equiv \forall x \; [ \ x \preceq 0 \leftrightarrow x = 0 \ ] \end{aligned}$$

holds since \( x \preceq ^{ {\mathfrak {A}} } 0^{ {\mathfrak {A}} }\) if and only if \( x= 0^{ {\mathfrak {A}} } \). The axioms

$$\begin{aligned}&{\textsf {D}}_{5} \equiv \forall x y \; [ \ x \preceq y 0 \leftrightarrow ( \ x = y0 \vee x \preceq y \ ) \ ] \ \text{ and } \\&{\textsf {D}}_{6} \equiv \forall x y \; [ \ x \preceq y 1 \leftrightarrow ( \ x = y1 \vee x \preceq y \ ) \ ] \end{aligned}$$

hold since \(x \preceq ^{ {\mathfrak {A}} } x \) and \( y \circ ^{ {\mathfrak {A}} } 1^{ {\mathfrak {A}} } = y = y \circ ^{ {\mathfrak {A}} } 0^{ {\mathfrak {A}} } \). \(\square \)

Lemma 11

\(\textsf { WD} \! - \! \textsf {WD}_3 \), \(\textsf { WBT }\! - \! \textsf {WBT}_{3 }\), \(\textsf { BTQ} \! - \! \textsf {BTQ}_4 \) and \(\textsf { D} \! - \! \textsf {D}_{4 }\) are not essentially undecidable.

Proof

We start by showing that \(\textsf { BTQ} \! - \! \textsf {BTQ}_4 \) has a decidable model. We consider the set \( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) \) of \(2 \times 2\) matrices with coefficients in \( {\mathbb {R}}_{ \ge 0}\) and which are such that the first entry is strictly positive. So, the elements of \( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) \) are of the form

$$\begin{aligned} \begin{pmatrix} x \in {\mathbb {R}}_{>0 } &{} y \in {\mathbb {R}}_{ \ge 0 }\\ z \in {\mathbb {R}}_{ \ge 0 } &{} w \in {\mathbb {R}}_{ \ge 0 } \end{pmatrix} \ . \end{aligned}$$

We consider the binary operation of matrix multiplication, denoted \( \times \). It is not difficult to verify that \( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) \) is closed under matrix multiplication. We can consider this structure as a first-order structure for the language \( \lbrace c_0, c_1, \times \rbrace \) where \(c_0\) and \(c_1\) are constant symbols for the matrices

$$\begin{aligned} \begin{pmatrix} 1 &{} 0 \\ 1 &{} 1 \end{pmatrix} \ \text{ and } \ \begin{pmatrix} 1 &{} 1 \\ 0 &{} 1 \end{pmatrix} \ \end{aligned}$$

respectively. Let \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \) denote the set of sentences true in this structure. We show that \(\textsf { BTQ} \! - \! \textsf {BTQ}_4 \) is interpretable in \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \) and that \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \) is decidable. Since relative interpretability preserves the property of being essentially undecidable, this means that \(\textsf { BTQ} \! - \! \textsf {BTQ}_4\) is not essentially undecidable.

We interpret \(\textsf { BTQ} \! - \! \textsf {BTQ}_4 \) in \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \) as follows:

  • We choose the domain \( \delta (x) \equiv x =x \).

  • We map 0 to \(\phi _{0} (x) = x = c_0\).

  • We map 1 to \(\phi _{1} (x) \equiv x = c_1\).

  • We map \(\circ \) to \(\phi _{\circ } (x,y,z ) \equiv x \times y = z \).

Since matrix multiplication is associative, the translation of

$$\begin{aligned} \textsf {BTQ}_{1} \equiv \forall x y z \; [ \ (xy) z = x (yz) \ ] \end{aligned}$$

is a theorem of \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \). To see that the translation of

$$\begin{aligned} \textsf {BTQ}_{2} \equiv \forall x y \; [ \ x \ne y \rightarrow ( \ x 0 \ne y 0 \wedge x 1 \ne y 1 \ ) \ ] \end{aligned}$$

is a theorem of \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \), we observe that

$$\begin{aligned} \begin{pmatrix} x &{} y \\ z &{} w \end{pmatrix} \times \begin{pmatrix} 1 &{} 0 \\ 1 &{} 1 \end{pmatrix} = \begin{pmatrix} a &{} b \\ c &{} d \end{pmatrix} \times \begin{pmatrix} 1 &{} 0 \\ 1 &{} 1 \end{pmatrix} \&\Rightarrow \ \begin{pmatrix} x+y &{} y \\ z+w &{} w \end{pmatrix} = \begin{pmatrix} a+b &{} b \\ c+d &{} d \end{pmatrix} \\&\Rightarrow \ x = a \wedge y = b \wedge z = c \wedge w = d \end{aligned}$$

and

$$\begin{aligned} \begin{pmatrix} x &{} y \\ z &{} w \end{pmatrix} \times \begin{pmatrix} 1 &{} 1 \\ 0 &{} 1 \end{pmatrix} = \begin{pmatrix} a &{} b \\ c &{} d \end{pmatrix} \times \begin{pmatrix} 1 &{} 1 \\ 0 &{} 1 \end{pmatrix} \&\Rightarrow \ \begin{pmatrix} x &{} x+y \\ z &{} z+w \end{pmatrix} = \begin{pmatrix} a &{} a+ b \\ c &{} c+d \end{pmatrix} \\ \\&\Rightarrow \ x = a \wedge y = b \wedge z = c \wedge w = d \ . \end{aligned}$$

To see that translation of

$$\begin{aligned} \textsf {BTQ}_{3} \equiv \forall x y \; [ \ x0 \ne y1 \ ] \end{aligned}$$

is a theorem of \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \), we observe that

$$\begin{aligned} \begin{pmatrix} x &{} y \\ z &{} w \end{pmatrix} \times \begin{pmatrix} 1 &{} 0 \\ 1 &{} 1 \end{pmatrix} = \begin{pmatrix} a &{} b \\ c &{} d \end{pmatrix} \times \begin{pmatrix} 1 &{} 1 \\ 0 &{} 1 \end{pmatrix} \ \Rightarrow&\ \begin{pmatrix} x+y &{} y \\ z+w &{} w \end{pmatrix} = \begin{pmatrix} a &{} a+ b \\ c &{} c+d \end{pmatrix} \\ \\ \Rightarrow&\ x+y = a \wedge y= a+b \ \\ \Rightarrow&\ x+ a +b = a \\ \Rightarrow&\ x+b = 0 \\ \Rightarrow&\ x=0 \ . \end{aligned}$$

But we cannot have \(x=0\) by how the set \( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) \) is defined. We then see that the translation of \( \textsf {BTQ}_{3}\) is a theorem of \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \)

To see that \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \) is decidable, we use Tarski‘s result that the set \( \textsf {Th} ( {\mathbb {R}} ) \) of sentences which are true in the ordered field of real numbers \( ( {\mathbb {R}}, 0, 1, + , \times , \le ) \) is decidable (see Tarski [17]). To see that this is the case, we associate each sentence \(\phi \) in the language of \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0 } ) ) \) with a sentence \( \phi ^{ *}\) in the language of \( \textsf {Th} ( {\mathbb {R}} ) \) such that

$$\begin{aligned} \phi \in \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \ \Leftrightarrow \ \phi ^{ * } \in \textsf {Th} ( {\mathbb {R}} ) . \end{aligned}$$

We start by associating \(c_0 \) and \(c_1\) with

$$\begin{aligned} \begin{pmatrix} 1 &{} 0 \\ 1 &{} 1 \end{pmatrix} \ \text{ and } \ \begin{pmatrix} 1 &{} 1 \\ 0 &{} 1 \end{pmatrix} \end{aligned}$$

respectively, and by associating each variable x with

$$\begin{aligned} \begin{pmatrix} x_1 &{} x_2 \\ x_3 &{} x_4 \end{pmatrix} \ . \end{aligned}$$

We recall that matrix multiplication is defined by

$$\begin{aligned} \begin{pmatrix} x &{} y \\ z &{} w \end{pmatrix} \times \begin{pmatrix} a &{} b \\ c &{} d \end{pmatrix} = \begin{pmatrix} xa + yc &{} xb + yd \\ za+ wc &{} zb + wd \end{pmatrix} \ . \end{aligned}$$

By following this definition, we associate each term s in the language of \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0 } ) ) \) with a matrix

$$\begin{aligned} \begin{pmatrix} p_{s , 1 } &{} p_{s, 2 } \\ p_{s, 3} &{} p_{s, 4 } \end{pmatrix} \end{aligned}$$

where each \( p_{s, i }\) is a term in the language of \( \textsf {Th} ( {\mathbb {R}} ) \). So, for example, the term \(x \times y\) is associated with the matrix

$$\begin{aligned} \begin{pmatrix} x_1 y_1 + x_2 y_3 &{} x_1 y_2 + x_2 y_4 \\ x_3 y_1 + x_4 y_3 &{} x_3 y_2 + x_4 y_4 \end{pmatrix} \ . \end{aligned}$$

Next we define \(\phi ^*\) by recursion on the structure of \(\phi \) as follows:

(1):

If \(\phi \equiv \ s=t\), then

$$\begin{aligned} \phi ^* \equiv \bigwedge _{ x \in S \cup T , 1 \le i \le 4 } x_i \ge 0 \ \wedge \ \bigwedge _{ y \in S \cup T } y_1 \ne 0 \ \wedge \ \bigwedge _{i=1}^{4} p_{ s, i } = p_{ t, i } \end{aligned}$$

where S and T denotes the set of variables occurring in s and t, respectively.

(2):

If \(\phi \equiv \ \lnot \alpha \), then \( \phi ^* \equiv \lnot \alpha ^* \)

(3):

If \(\phi \equiv \ \alpha \vee \beta \), then \( \phi ^* \equiv \alpha ^* \vee \beta ^* \)

(4):

If \(\phi \equiv \ \alpha \wedge \beta \), then \( \phi ^* \equiv \alpha ^* \wedge \beta ^* \)

(5):

If \(\phi \equiv \ \exists x \alpha \), then \( \phi ^* \equiv \exists x_1 x_2 x_3 x_ 4 \alpha ^* \)

(6):

If \(\phi \equiv \ \forall x \alpha \), then \( \phi ^* \equiv \forall x_1 x_2 x_3 x_ 4 \alpha ^* \).

The first big conjunct in clause (1) reflects the fact that matrices in \( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0 } ) \) have entries in \( {\mathbb {R}}_{ \ge 0} \). The second big conjunct in (1) reflects the fact that matrices in \( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0 } ) \) are such that the first entry is in \( {\mathbb {R}}_{ > 0} \). The last big conjunct in (1) states that the matrix associated with s equals the matrix associated with t. It is obvious that \( \phi \) is a theorem of \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \) if and only if \( \phi ^* \) is a theorem \( \textsf {Th} ( {\mathbb {R}} ) \). Thus, \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \) is decidable.

It is not difficult to see that the model of \(\textsf { BTQ} \! - \! \textsf {BTQ}_4 \) we have defined is also a model of \(\textsf { WD} \! - \! \textsf {WD}_3 \) and \(\textsf { WBT} \! - \! \textsf {WBT}_3 \). We extend it to a decidable model of \(\textsf { D }\! - \! \textsf {D}_4 \) by interpreting \(\preceq \) as \( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0 } ) \times \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0 } ) \). \(\square \)

Our proof of decidability of \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \) is actually a 4-dimensional interpretation of \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \) in \( \textsf {Th} ( {\mathbb {R}} ) \). This means that each object in the language of \( \textsf {Th} ( \textsf {M}_{2} ( {\mathbb {R}}_{ \ge 0} ) ) \) is associated with a quadruple of objects in the language of \( \textsf {Th} ( {\mathbb {R}} ) \). For more on this more general notion of interpretability, see Visser [20].

We can now state the main theorem of this section.

Theorem 8

\( \textsf {WD} \), \(\textsf {WBT} \) and \( \textsf {BTQ} \) are minimal essentially undecidable.

The only thing that is lacking to show that \( {\textsf {D}} \) is minimal essentially undecidable is to show that \( {\textsf {D}} \!- \! {\textsf {D}}_{5} \) and \( {\textsf {D}} \!- \! {\textsf {D}}_{6} \) are not essentially undecidable. A negative solution of the following problem would thus show that \( {\textsf {D}} \) is minimal essentially undecidable.

Open Problem 2

Show that neither \(\textsf { D } \! - \! \textsf {D}_5 \) nor \(\textsf { D } \! - \! \textsf {D}_6 \) is essentially undecidable.

We have not put much focus on the theories \( \textsf {BT} \) and \( {\textsf {C}} \). The proofs of Lemma 5 and Lemma 11 show that \( \textsf {C} \; - \; \textsf {C}_1 \), \( \textsf {C} \; - \; \textsf {C}_4 \) and \( \textsf { BT} \; - \; \textsf {BT}_4 \) are not essentially undecidable. Beyond that, the minimality of \( \textsf {BT} \) and \( {\textsf {C}} \) is an open problem.

Open Problem 3

Are \( \textsf {BT} \) and \( {\textsf {C}} \) minimal essentially undecidable?

In this section, we investigated whether our axiom sets are minimal w.r.t set inclusion and the property of being essentially undecidable. A different notion of minimality that we have been implicitly investigating is minimality w.r.t. interpretability. We have seen that \( \textsf {WD} \) is interpretable in all the essentially undecidable theories we have studied. It is however not minimal in the interpretability pre-order. In [7], Jeřábek shows that there is an essentially undecidable theory that is interpretable in \( {\textsf {R}} \) but that does not interpret \( {\textsf {R}} \). The theory Jeřábek gives is such that all partially recursive functions are representable. In Section 3 of [2], Yong Cheng uses results in [7] to give many examples of essentially undecidable theories that are interpretable in \( {\textsf {R}} \) but that do not interpret \( {\textsf {R}} \). One of the referees observed that the existence of theories strictly below \( {\textsf {R}} \) w.r.t. interpretability also follows from the work of Shoenfield [14].

Although there are many essentially undecidable theories below \( {\textsf {R}} \) w.r.t interpretability, to the best of our knowledge, it is not known whether there exists a minimal computably enumerable essentially undecidable theory, w.r.t. interpretability. For a more detailed discussion of this problem, we refer the reader to Yong Cheng [2]. It follows from the idea behind the proof of Theorem 4.7 of [2] that the interpretability degrees of computably enumerable essentially undecidable theories have infima. Hence, if there exists a minimal essentially undecidable theory, then that theory is also the minimum essentially undecidable theory.

7 Summary

We have formulated essentially undecidable theories \(\textsf {WD} \subset {\textsf {C}} \subset {\textsf {D}}\) and \(\textsf {WBT} \subset \textsf {BT} \subset \textsf {BTQ}\) (see Fig. 3). The theories \( \textsf {WD} \), \( \textsf {WBT} \) and \( {\textsf {R}} \) are mutually interpretable while the theories \( {\textsf {D}} \), \( \textsf {BTQ} \) and \( {\textsf {Q}} \) are mutually interpretable. The theories \(\textsf {WD} \), \(\textsf {WBT} \) and \( \textsf {BTQ}\) have minimal essentially undecidable axiomatizations, but it not clear whether the same is true of \( {\textsf {D}} \). Both \(\textsf {WD} \) and \({\textsf {D}} \) are purely universally axiomatised.

Fig. 3
figure 3

\(S \rightarrow T\) means S is interpretable in T but S does not interpret T. \(S \leftrightarrow T\) means S and T are mutually interpretable. \(S \dashrightarrow T\) means S is interpretable in T but it is unknown whether S interprets T