PROSPECTS FOR A NAÏVE THEORY OF CLASSES HARTRY FIELD, HARVEY LEDERMAN AND TORE FJETLAND ØGAARD 1. INTRODUCTION1 Let L be any first order language, and L+ be the result of adding to it a 1-place predicate 'Class', a 2-place predicate '∈', an abstraction operator { : }, where for any formula A, {x : A} is a term whose free variables are those that are free in A except for x; and also, if necessary, a new primitive conditional. The naıve theory of classes (over L) consists of the following principles: Abstraction Schema: (1) ∀u1, ...,∀un∀z[z ∈ {x : A(x;u1, ...,un)}⇔ A(z;u1, ...,un)]2 Class-Abstracts: (2) ∀u1, ...,∀unClass({x : A(x,u1, ...,un})) Extensionality Rule: (3) Class(a)∧Class(b)∧∀u(u ∈ a⇔ u ∈ b)  ∀z(a ∈ z⇔ b ∈ z) (The schematic variable A in (1) and (2) is assumed to have as substituends all formulas in the language L+.) The theory also takes universal instantiation and existential generalization to apply even to abstraction terms: informally speaking, it assumes that these are denoting terms. Given the Abstraction Schema, the conclusion of the Extensionality Rule is equivalent to the claim that a and b are intersubstitutable in all contexts. The Extensionality Rule thus guarantees that we can define identity between classes as coextensiveness without giving up on the substitutivity rule for identity. (In fact, there may be reasons to want a stronger form of Extensionality; we'll discuss that in due course.) As is well known, the naıve theory of classes is inconsistent in classical logic (when A⇒ B is defined as ¬A∨B). Indeed, Russell's Paradox shows the classical inconsistency of the Abstraction Schema alone, even without Extensionality. But there are in the literature a variety of non-classical logics in which the Abstraction Schema is demonstrably consistent: in some cases it is negation-consistent, i.e. it never Date: First version: April 12, 2014. This version: November 23, 2014. 1HF and HL contributed equally to the first draft of this paper, containing all of Sections 2-9 and Appendix B in essentially their present form. After reading the earlier draft, TFØ discovered the impossibility result of Section 10 and recognized that incorporating this result required taking Abstraction (not Comprehension) as primitive throughout the paper. HF and HL then jointly wrote Section 10 and rewrote Section 1. All three authors contributed equally to Appendix A. Thanks to an anonymous reviewer for pressing us to clarify a number of points of presentation and substance. 2Some may find it more natural to formulate the naıve theory of classes without a primitive abstraction operator, replacing the Abstraction Schema with a Comprehension Schema ∀u1, ...,∀un∃y[Class(y)∧∀x(x ∈ y⇔ A(x;u1, ...,un))] which is in a certain sense weaker. Since Abstraction clearly implies Comprehension, the consistency results for Abstraction in Sections 2-9 carry over immediately to Comprehension. The difference will only matter for the negative result in Section 10, so we will postpone detailed discussion of the difference until then. 1 PROSPECTS FOR A NAÏVE THEORY OF CLASSES 2 entails both a sentence and its negation, but in all cases it is at least Post-consistent, i.e. it doesn't entail everything. These theories, with Abstraction but not Extensionality, are not theories of "classes", since they do not necessarily allow us to identify all "classes" which have exactly the same members; for this reason, they are better called naıve theories of properties, and the abstraction operator and the '∈' used in the theories should be interpreted as a property-forming operator and the relation of instantiation respectively.3 The project of finding a logic for this naıve theory of properties-with Abstraction but not Extensionality-turns out to be essentially the same as the project of finding a logic compatible with the naıve theory of truth and satisfaction. The non-classical logics in the recent literature on naıve truth thus provide a number of options for strong logics of naıve properties. But what if we want a naıve theory of classes, with Extensionality as well as the Abstraction Schema? In a number of papers (1983; 1989; 2013) and a book (2006), Ross Brady has presented several closely related naıve theories of classes. The theories are very similar, but there are slight differences from one to another in the conditional that they employ. Brady's technique for proving Extensionality to be consistent is quite different from anything used in the standard constructions to show that Abstraction holds; we can no longer simply carry over proofs from the literature on naıve truth and satisfaction. So the question arises: to what extent can Brady's result be generalized to other logics that are known to be adequate for naıve properties (or, for naıve truth and satisfaction)? A difficulty in answering this question is that Brady's presentation of his proof is rather opaque. Our first goal in this paper is to present his result in a simpler, more accessible way; Sections 2 through 6 will be devoted to this task. This new presentation of Brady may or may not count as a "new proof" of his theorem, but it at least makes it easier to see how Brady's result can be generalized to apply to some other logics in the same vicinity as the ones Brady has considered. Section 5 already contains two slight generalizations 3Some of the discussion of "naıve classes" or "naıve sets" in the literature, e.g Gilmore 1967, White 1979 Grišin 1982, and Maddy 1983, is concerned at best with naıve properties in our sense, since they don't have a rule of extensionality. In the case of Gilmore and Maddy, in fact, we don't strictly have even naıve property theory, because there is no⇒ in the language strong enough to deliver the Abstraction Schema. These theories are nonetheless "naıve in spirit", and most naıve property theories work by extending Gilmore's construction to include an appropriate⇒. Maddy at one point (p. 134) suggests adding a primitive '=' to the language and giving a model-theoretic version of Extensionality in terms of it, but even then there is no rule of the language that expresses Extensionality. Moreover, when she considers this option, she restricts Abstraction so that '=' cannot appear in in the scope of { : }: so her suggestion gets "extensionality in spirit" only by giving up on "naıveté in spirit". (We should also note that while Maddy (1983: 134) suggests that the proof that her added = obeys intersubstitutivity is trivial, it certainly didn't seem so to us, as the proof of the Micro-Extensionality Theorem in Section 4 attests.) White 1979 shows that the addition of an axiom of extensionality turns a naıve property theory stated in terms of Łukasiewicz continuum valued logic inconsistent, but the naıve property theory was already ω-inconsistent, so perhaps this is unsurprising. In the case of Grišin, the addition of Extensionality is known to lead to (Post-) inconsistency (see Shirahata 1995 and especially Cantini 2003 for useful discussion); essentially that's because Grišin's logic validates permutation A→ (B→ C) ` B→ (A→ C). In fact, as is shown in Øgaard 2014 (Theorem 15), the rule A→ (B→C),B ` A→C already suffices for triviality in a naıve class theory. The logics we study below will thus not validate this permutation principle. (And, unlike Grišin's, the logics we study will have the full structural rules.) There is in addition a series of papers by Hinnion and others (e.g. Forti and Hinnion 1989, Hinnion 1994, Hinnion and Libert 2003; 2008) that show the consistency of various forms of Extensionality, but with highly restricted Abstraction: it is restricted to formulas in which neither negation, nor a conditional, nor even the abstraction operator itself occur. Because of these restrictions, these theories aren't naıve in our sense. Similarly, Brady 1971 and Brady and Routley 1989 restrict abstraction to the→-free language, and so give up on naıveté . PROSPECTS FOR A NAÏVE THEORY OF CLASSES 3 of Brady, demonstrating both how Brady's theorem applies to a logic which includes a non-contraposable conditional, and how it generalizes to "dynamic" variants of his construction. But the most important generalization comes in Section 9, when we show that Brady's result carries over neatly to certain logics which have a modal-like semantics that uses four-valued, as opposed to three-valued, worlds. (These will include some logics from Bacon 2013a, which turn out to be much closer to Brady's than they initially appear to be, and yield a naıve theory of classes by essentially Brady's argument.) This last generalization has some significant advantages over Brady's original constructions. Most notably, the resultant logic validates a weakening rule (indeed, a weakening axiom) for a non-contraposable conditional → from which the contraposable conditional ⇒ (whose biconditional is used in the laws above) is defined. But we argue that even these improved Brady-like logics are too weak for reasoning about classes. Worse yet, we conclude with an impossibility result which shows fairly decisively that one cannot hope to do significantly better. In the presence of Abstraction, Extensionality introduces new strength: whereas strong conditional logics can be shown to be consistent with naıve theories of truth and properties, the same cannot be said for the naıve theory of classes.4 2. THE GOAL Let L be any first order language, with primitive logical operators ¬, ∧, and ∀;5 for simplicity we'll assume that its only singular terms are variables. We'll also assume that it contains an identity predicate; and for convenience, that it contain a primitive sentence ⊥ to be understood as logically false. We use ∨ and ∃ as metalinguistic abbreviations with the usual definitions, and define > as ¬⊥. (Primitive sentences can be viewed as 0-place predicates, so this will require no addition to the formation rules below.) Let M be any 4Dana Scott (1961) showed that if standard ZF is formulated, as it usually is, using the axiom of Replacement ∀w1, . . . ,wn ∀a [(∀x ∈ a)(∃!y)φ(x,y,w1, . . . ,wn)⊃ (∃b)(∀y)(y ∈ b≡ (∃x ∈ a)φ(x,y,w1, . . . ,wn))], then this ZFREP minus Extensionality is interpretable in Z and hence has weaker consistency strength than standard ZF. This result might seem to show that Extensionality is responsible for adding important strength to standard set theory developed in classical logic. But ZF can equivalently be axiomatized using Collection ∀w1, . . . ,wn ∀a[(∀x ∈ a)(∃y)φ(x,y,w1, . . . ,wn)⊃ (∃b)(∀x ∈ a)(∃y ∈ b)φ(x,y,w1, . . . ,wn)] together with Separation, instead of Replacement, and Harvey Friedman (1973) has shown that any model of the axioms of this ZFCOL other than Extensionality can be transformed into a model of full ZF, including Extensionality. This suggests that Scott's result might be better interpreted as demonstrating the weakness of Replacement (as opposed to Collection) rather than as illustrating the power of Extensionality. (See Hamkins 2014 for helpful discussion.) In our non-classical context, where Abstraction is assumed, the situation is very different. We impose no class-theoretic axioms beyond Abstraction and Extensionality, and since Abstraction is known to be consistent in the logics we consider, the inconsistency result is due entirely to the strength of Extensionality. This is of particular interest since naıve Abstraction yields all the standard principles of ZFCOL minus Foundation and Extensionality: the needed sets can in each case be defined as {x : A(x)} for an appropriately chosen A plus parameters. So ZFCOL minus Foundation and minus Extensionality is consistent with naıve abstraction in the known logics for naıve properties. Extensionality tips the balance into inconsistency. 5L might include the term 'Set', and a predicate '∈set ' that is to be distinguished from the '∈' that is added in the move to L+. The classes to be introduced are then to be conceived, at least initially, as in all cases distinct from sets. (We might later consider an identification of sets with certain classes, e.g. the hereditarily bivalent ones, and an identification of 'x ∈set y' with 'Set(y)∧ x ∈ y'.) PROSPECTS FOR A NAÏVE THEORY OF CLASSES 4 classical normal model for L in which ⊥ is false, where 'normal' means that the extension in M of the identity predicate is {〈o,o〉 : o is in the domain of M}. Let L+ be the result of adding a new 1-place predicate 'Class', a new two-place predicate ∈, a term-forming operator { : }, a binary operator→, and for convenience, a set N of primitive names with the same cardinality as the domain DM of M. The goal will be to extend the classical M to a non-classical model for L+, non-classical in that (though the model is constructed using classical set theory) it validates only the principles of a sublogic of classical logic. It is essential that the sublogics have a primitive conditional: the conditional defined from ¬ and ∨ in these logics will fail to satisfy either Reflexivity (A→ A) or Modus Ponens; because of this, Abstraction and Extensionality stated with the defined conditional will either fail or be too weak to deserve the name. The challenge is to see whether we can give the new conditional a reasonably strong logic, but one which is still consistent with both Abstraction and Extensionality. To meet this challenge, we will consider a family of different methods of constructing, for each classical model M of the old L, a new non-classical model for L+; at least until Section 8, the non-classical models (and resulting logics) will differ only in the treatment of the conditionals (and hence biconditionals) that they employ. (Our focus will be on nonclassical sublogics that have all the classical structural rules. Most will restrict excluded middle while retaining disjunctive syllogism, but we will later consider some that restrict disjunctive syllogism. They will otherwise be standard for the connectives ¬, ∧ and ∨, and for the quantifiers: in particular, double negation elimination and all the deMorgan laws hold.) But in every case, the new non-classical model will have the original M effectively as a submodel; M will model the urelements over which the classes are built.6 (The construction actually works also from the not-quite classical model that has empty domain; this yields pure class theory, i.e. with no urelements.) So the idea is to show that whatever the classical reality, there is a non-classical extension of it with naıve classes. (Many of the classes in the models we'll construct will behave classically; the non-classicality arises only for "pathological" classes such as the class of all non-self-membered classes.) In more detail: The formation rules for L+ are what one would expect: The 0-terms are the variables and the primitive names; For each n, the atomic n-formulas are the result of (for some k) applying a k-place predicate (whether in the ground language or '∈' or 'Class') to k n-terms; For each n, we build up n-formulas from atomic n-formulas using ¬, ∧, → and ∀ (together with a variable) in the usual way; The (n+1)-terms are the variables and the primitive names together with anything of form {x : A}where x is a variable and A an n-formula. (So if m > n, all n-terms are m-terms and all n-formulas are m-formulas.) A term is anything that is an n-term for some n, and similarly for formulas. (Intuitively, an n-formula is a formula in which the largest number of nestings of abstracts is at most n.) Free occurrence of a variable in a term or formula is defined inductively in the obvious way (the free occurrences of variables of the term {x : A(x)} are the free occurrences of variables other than x in A). A term or formula with no free occurrences of variables is 6It follows that for any sentence A of the ground language, if we "translate" it into L+ by restricting all its existential quantifiers by the condition '¬Class(x)∧ ...' and all its universal quantifiers by 'Class(x)∨ ...' , then the translation has the same value in M+ that the original had in M. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 5 called closed. An abstract is a term (not necessarily closed) that is not a primitive name or variable. We will construct our models in two stages. First (Sections 3-5) we will construct a preliminary model M+ whose domain is DM ∪{closed abstracts of L+}. (Each closed abstract can be regarded as naming itself, and each member of DM will be regarded as named by a member of N; given this, we can treat quantification as substitutional.) Later (Section 6) we'll move to a "contracted" model M+/ ≈ whose domain is DM ∪{ equivalence classes of closed abstracts of L+}, under a suitable equivalence relation ≈. It is only in the contracted models that it is appropriate to think of what we are adding to DM as classes; until we do the contraction, we should think of the new models as containing the things in the old model together with class-representatives, where each class will have many representatives. (So the predicate 'Class', which applies to the new objects, is a bit misleading in the case of the uncontracted model.) To repeat, there will be a family of different constructions of models M+, leading to different logics for→; and for each M+ there will be a corresponding M+/≈. The model M+ will be such that the predicates of the original language L are classical and have the same extension as in the original model. (So M+ is in a sense a non-classical extension of M.) In particular, in M+ the identity predicate of L is only a predicate of identity-restricted-to-DM; so to avoid confusion, we'll write the identity predicate of L as '=L'. We'll eventually want to define identity in L+ by x = y iff (x =L y)∨ [Class(x)∧Class(y)∧∀u(u ∈ x⇔ u ∈ y)], where ⇔ is as defined below. This obviously coincides with '=L' in the domain of M; later on (Section ??) we'll show that it satisfies the principles of identity appropriate to the non-classical logic. But until we contract M+, this won't behave as an identity predicate in our model, so to avoid the danger of confusion we prefer to avoid talk of class-identity until we get to M+/≈. The goal is that whatever the starting classical model, the new model will "validate" (in a sense to be explained) the Abstraction Schema and the Extensionality Rule. Once we've defined identity as above, this last will give us a = b  ∀z(a ∈ z⇔ b ∈ z). which, together with Abstraction implies the substitutivity of identity. Here A⇒ B is short for (A→ B)∧ (¬B→¬A) (and A⇔ B for (A⇒ B)∧ (B⇒ A)), so that (given the redundancy of double negation and minimal laws for conjunction) it is guaranteed to be contraposable, i.e. A⇒ B is equivalent to ¬B⇒¬A. We could have assumed that the primitive → is itself contraposable, in which case ⇒ would coincide with it and we could state Abstraction and Extensionality using the corresponding biconditional ↔. But it is more general not to make that assumption, and to use⇒ and⇔ defined as above in formulating Abstraction and Extensionality. (Brady himself uses only a contraposable conditional in his constructions, so his '→' is our '⇒'. In this respect our constructions will be more general than Brady's, but they are so closely modeled after his that their conditionals are reasonably called "non-contraposable Brady conditionals".) What about a conditional form of extensionality? As we'll see, one of Brady's constructions delivers (in our⇒ notation)  ∀x∀y[Class(x)∧Class(y)∧∀u(u ∈ x⇔ u ∈ y)⇒∀z(x ∈ z⇔ y ∈ z)], but what are arguably the more satisfactory ones do not. However, all of them yield  ∀x∀y[Class(x)∧Class(y)∧∀u(u ∈ x⇔ u ∈ y)→∀z(x ∈ z⇔ y ∈ z)]. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 6 This last axiom is one benefit of taking the non-contraposable→ as primitive: it gives us an axiom form of the substitutivity of identity, and not just the weaker rule form (we'll discuss this further in Section 6). Still, we suspect that getting this stronger axiom form depends on particular features of the models we'll consider, and we'd be willing to settle for the rule form, if doing so allowed better laws for the conditionals.7 Another more substantial reason for taking a non-contraposable conditional as primitive will emerge in Sections 8 and 9. 3. STATIC AND DYNAMIC MICRO-CONSTRUCTIONS Brady uses a three-valued modal-like semantics for his proofs that Extensionality is consistent with Abstraction. We'll work in that framework until Section 8, when we'll start to move toward a 4-valued generalization. We call the values 0, 1/2 and 1. The cardinality of the set WM of worlds depends on the cardinality |M| of the ground model M, but this won't matter for our purposes. One world @M of WM is designated, in that an inference is M-valid if and only if: if the premises have value 1 at @M , so does the conclusion. An inference is valid if and only if it is M-valid for all ground models M. The value of a conditional at a given world is determined, at least in part, by the values of its antecedent and consequent at other worlds. In particular, for each world w, facts about the other worlds determine a function vw (a prevaluation) that maps each conditional sentence A→ B into one of the values in {0,1/2,1}.8 In this section and the next, we will leave the prevaluation vw associated with world w a black box. In this section the task is to show how, given the prevaluation vw at a world w and the underlying ground model M, we determine the values of every sentence of L+ at w; in particular the values of sentences containing "∈". To this end, we adapt the inductive procedure of Gilmore (1967; 1974) and Kripke (1975). The valuation procedure (which we will call the "micro-construction") works by first assigning a value |A|w,σ in {0,1/2,1} to each sentence A at each ordinal level σ ; we then show that as σ gets bigger we eventually reach an ordinal Ψw after which the values of sentences don't change, and those "fixed point values" |A|w,Ψw will be regarded as the values |A|w (without an ordinal subscript) of the sentences at w. In the next section, we show that this procedure gives a value-theoretic result that we call "Micro-Extensionality", which will be the basis of our later results on the consistency of Extensionality given Brady's method of constructing prevaluations. In Section 5 we will finally turn to Brady's method of constructing prevaluations (we'll call this the "macroconstruction"), and show how the Micro-Extensionality Theorem can be used to establish the consistency of Extensionality with Abstraction in Brady's conditional logics. Since in this section and the next, our focus is on an arbitrary world whose prevaluation is given; we'll use v for the prevaluation at the world in question. Since at each world the value of every sentence is determined by the associated prevaluation together with the ground model, we will sometimes write |A|v,M , or even |A|v (since we'll be holding the 7It is natural to put extensionality conditionally: "If a and b are classes with the same members then they are members of the same classes." But this uses the ordinary English 'if....then', which is not well modeled by the non-contraposable Brady '→'. A more complicated language with two primitive conditionals might be required to yield an appropriate conditional formulation. 8Every object in the domain of M+ will be the denotation (in M+) of either a new name in N or of a closed class abstract. Were it not for this, we'd need to complicate the description of the various vw: each would map, into {0,1/2,1}, ordered pairs whose first member is a conditional formula A→ B and whose second member is an assignment s of objects to variables, with the condition that if s and s∗ agree in all variables free in A→ B then vw(〈A→ B,s〉) = vw(〈A→ B,s∗〉). PROSPECTS FOR A NAÏVE THEORY OF CLASSES 7 classical "ground model" M fixed), instead of |A|w,M .Thus, we sometimes also speak of the micro-construction over v. Until Section 8, we will stick to micro-constructions that generate the Strong Kleene logic for the connectives ¬, ∧ and ∀ (and ∨ and ∃, which are defined from the others in the usual way), and use only minimal fixed points of the micro-construction. Even with this restriction, there is more than one possible way to adapt Gilmore and Kripke. The simplest-which Brady uses-we call the static micro-construction. "Static" here means that the values of conditionals don't change during the construction: they are simply the values given by the prevaluation v. (Within each static micro-construction, the conditionals behave essentially like atomic formulas.) This static construction goes as follows: For any ordinal σ : (1) If p is an atomic k-place predicate of the ground language and t1, ..., tk are closed terms, |p(t1, ..., tk)|v,σ is 1 if 〈t1, ..., tk〉 is in the extension of p in the ground model M, and 0 otherwise; (2) |Class(t1)|v,σ is 1 if t1 is a closed abstract, 0 if it is a name in N; (3) If t1 and t2 are closed terms and t2 isn't a class abstract, |t1 ∈ t2|v,σ is 0; (4) |A→ B|v,σ is just v(A→ B); (5) |¬B|v,σ is 1−|B|v,σ ; (6) |A∧B|v,σ is min{|A|v,σ , |B|v,σ}; (7) |∀xA|v,σ is min{|A(t/x)|v,σ : t is a closed term of L+}; (8) If t1 is a closed term and t2 is {x : A(x)}, |t1 ∈ t2|v,σ is (a) 1 iff (∃ρ < σ)(∀τ in the interval [ρ,σ))(|A(t/x)|v,τ = 1) (b) 0 iff (∃ρ < σ)(∀τ in the interval [ρ,σ))(|A(t/x)|v,τ = 0) (c) 1/2 otherwise. This inductive definition yields a value for every sentence at every σ , with the main induction on σ and a sub-induction on the complexity of the sentence. The important feature of this procedure is that it is "monotonic in the information order": if a sentence gets value 1 or 0 at any σ , it gets that same value at all larger τ; the only transitions in value as σ increases are from 1/2 to 0 and from 1/2 to 1. Slightly more formally, Definition 3.1. Let |A|v,σ 6K |A|v,τ mean that if |A|v,σ is 1 then so is |A|v,τ , and similarly if |A|v,σ is 0 then so is |A|v,τ . Then the key monotonicity lemma states: (RM): If σ < τ then for all sentences A, |A|v,σ 6K |A|v,τ . (The details of the argument for (RM) are available in many places, e.g. Kripke 1975.)9 9We haven't added a truth or satisfaction predicate to L+, but could easily have done so, provided the ground language is rich enough to encode syntax. We could ensure the naıveté of True or Sat, relative to a particular coding of syntax, within the same Gilmore-Kripke micro-construction as we used for classes. Focusing just on 'True' for simplicity, we'd use the following valuation rules: If t is not the Gödel number of an L+ sentence, then |True(t)|v,σ = 0; If t is the Gödel number of the L+ sentence A, then |True(t)|v,σ =  1 iff (∃ρ < σ)(∀τ in the interval [ρ,σ))(|A|v,τ = 1) 0 iff (∃ρ < σ)(∀τ in the interval [ρ,σ))(|A|v,τ = 0) 1/2 otherwise. The simultaneous construction of valuations for truth and membership clearly won't affect the key monotonicity lemma RM in either case. So the construction would give us a fixed point where both truth and class membership are naıve. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 8 Given (RM), two things follow. First, the rule for ∈ simplifies: (8∗) If t2 is {x : Ax}, |t1 ∈ t2|v,σ is (a) 1 iff (∃ρ < σ)(|A(t/x)|v,ρ = 1) (b) 0 iff (∃ρ < σ)(|A(t/x)|v,ρ = 0) (c) 1/2 otherwise. Second, and of crucial importance, cardinality considerations ensure that there is a point Ψv past which σ can never change, yielding "final values" for each sentence at each world. Letting |A|v abbreviate |A|v,Ψv , this tells us that for any closed t and any A with no variables beyond x free, (FP): |t ∈ {x : A}|v = |A(t/x)|v. By contrast to the static construction, dynamic micro-constructions allow for certain changes in the values of conditionals as the micro-construction proceeds. In the dynamic constructions the function v still plays a role in determining the value of conditionals, but unlike in the static construction, it is not the whole story; the value of a conditional is now in part determined by the values of its antecedent and consequent. Of course, one can only allow limited forms of changes during the micro-construction if monotonicity is to be preserved (and to give up monotonicity would be to give up the central idea of the construction). But here's one useful example of a dynamic construction (and the only one we'll consider in detail in the present three-valued setting): we keep everything the same except that we replace the valuation rule for→ by |A→ B|v,σ =  1 iff v(A→ B) = 1 0 iff v(A→ B) = 0, or [v(A→ B) = 1/2 and |A|v,σ = 1 and |B|v,σ = 0] 1/2 otherwise; that is, iff v(A→ B) = 1/2 and (|A|v,σ < 1 or |B|v,σ > 0). This modification in the rules for the conditional doesn't prevent us from arguing inductively that once a sentence has value 1 it retains that value throughout the Kripke-Gilmore construction, and the same for 0. Given this, the fixed point argument goes through as before. That is, (FP) holds for the dynamic construction as well as the static. Let a formula of form t ∈{x :C(x;u1, ...,uk)} and its corresponding formula C(t;u1, ...,uk) be basic equivalents. Call a prevaluation v transparent if for any sentences A and B, and any A∗ and B∗ obtainable from A and B respectively by sequences of substitutions of basic equivalents, v(A→ B) = v(A∗→ B∗). Then we have Intersubstitutivity Corollary: If a sentence A is obtainable from a sentence B by a sequence of substitutions of basic equivalents, and v is transparent, then A and B have the same value at the fixed point over v. (This is immediate from (FP) and the valuation rules for the static and dynamic microconstructions. Without the transparency assumption about v, all we could conclude from (FP) is that this holds when the only substitutions are outside the scope of an→.) Now call a prevaluation v reflexive if v(C→C) = 1 for every sentence C. Corollary on Abstraction: If v is transparent and reflexive, it gives value 1 to each instance of the Abstraction schema. (Reflexivity together with transparency lead by the Intersubstitutivity Corollary to v(t ∈{x : A}→ A(t/x)) = v(A(t/x)→ t ∈ {x : A}) = 1 for each t, and so |t ∈ {x : A}↔ A(t/x)|v = 1. Since quantification is treated substitutionally we get Abstraction immediately.) PROSPECTS FOR A NAÏVE THEORY OF CLASSES 9 As we'll see, there are plenty of transparent and reflexive prevaluations. A trivial example is the prevaluation that assigns value 1 to every conditional. This trivial prevaluation, in fact, even validates Extensionality, in both rule and conditional forms. So if the task were merely to validate both Abstraction and Extensionality, we'd be done. But of course we also want the conditional to obey reasonable laws, for example Modus Ponens, which this valuation fails to deliver. Before describing Brady's treatment of the conditional (which we'll call the "macroconstruction"), we prove an important theorem about the micro-constructions. This "MicroExtensionality Theorem" is really the heart of Brady's extensionality result. 4. THE MICRO-EXTENSIONALITY THEOREM Let a and b be closed abstracts. If v is a prevaluation, call it • 〈a,b〉-congruent if for all formulas C(x) and D(x) with no variables other than x free, v(C(a)→ D(a)) = v(C(b)→ D(b)); • 〈a,b〉-extensional if for every closed L+ term t, |t ∈ a|v = |t ∈ b|v • strongly 〈a,b〉-congruent if |a ∈ t|v = |b ∈ t|v for every closed L+ term t. For transparent v, we could equivalently say: • strongly 〈a,b〉-congruent if for all formulas C(x) with no variables other than x free, |C(a)|v = |C(b)|v. So in the case of transparent v, strong 〈a,b〉-congruence entails ordinary 〈a,b〉-congruence, at least in the case of the static micro-construction. Theorem (Micro-Extensionality Theorem). If v is transparent, 〈a,b〉-congruent and 〈a,b〉extensional, it is strongly 〈a,b〉-congruent. This theorem holds for both static and dynamic micro-constructions. To establish it, it's convenient to reformulate it. Let Σa,b,v be the set of formulas A(x) with no free variables other than x such that for the final values |A(a)|v and |A(b)|v, |A(a)|v 6= |A(b)|v. (|A(a)|v 6= |A(b)|v would not be possible unless x were free in A(x), so we might as well have said that x and only x is free in A(x).) Then a further reformulation of the claim that v is strongly 〈a,b〉-congruent is that Σa,b,v = /0. Given this way of stating the claim, the micro-extensionality theorem can be put as: for any prevaluation v that is transparent and 〈a,b〉-congruent, (1) If Σa,b,v 6= /0 then v is not 〈a,b〉-extensional. (For the remainder of this section, we will assume that v is a transparent, 〈a,b〉-congruent prevaluation.) With a few more definitions, we'll be able to simplify this statement even further. For each sentence B, let μv(B) be the first level of the Kripke construction at which B assumes its final value in the construction based on the prevaluation v. (So if the final value of B is 1/2 in that construction, μv(B) is 0.) We'll call this the level of B (relative to v). For any σ , let Σa,b,v,σ be the set of formulas A(x) in Σa,b,v such that at least one of A(a) and A(b) has level 6 σ and also has value in {0,1} at the fixed point of the microconstruction. (The second requirement ensures that, at σ , A(a) and A(b) already have different values even if one of them does not yet have its final value.) Obviously if |A(a)|v 6= |A(b)|v then one of them has value 1 or 0 at some σ , so we can rephrase (1) as (1∗) If ∃σ(Σa,b,v,σ 6= /0) then v is not 〈a,b〉-extensional. We break the proof of this claim into two lemmas. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 10 Lemma 4.1. ∀σ [ if Σa,b,v,σ 6= /0 then Σa,b,v,σ contains formulas of form t(x) ∈ x]. Proof. Assuming that ∃σ(Σa,b,v,σ 6= /0), let δa,b,v be the smallest ordinal σ such that Σa,b,v,σ 6= /0. When σ < σ∗, Σa,b,v,σ ⊆ Σa,b,v,σ∗ by definition; so the lemma will be established if we establish the instance where σ is δa,b,v. (For the remainder of the proof, a,b,v will remain fixed, so we'll suppress mention of them hereafter; thus |A| without any subscripts will mean the value of A at the fixed point of the micro-construction over v.) We establish this claim by establishing its contraposition, which we prove by induction on complexity. That is, we establish that: if no formula of form t(x) ∈ x is in Σδ then Σδ = /0, making use of the fact that for all ρ < δ , Σρ = /0. (That is, for any ρ < δ and any B(x), if |B(a)|ρ ∈ {0,1} then |B(b)|ρ = |B(a)|ρ , and similarly with a and b reversed.) Atomic formulas B(x) with at most x free are either: (i) Formulas of the ground language; (ii) Formulas of form Class(t(x)); (iii) Formulas of form t(x) ∈ n where n is a name in N (iv) Formulas of form t(x) ∈ {y : B(x,y)} (v) Formulas of form t(x) ∈ x. (We're allowing that t(x) not contain x free, and that B(x,y) not contain x free and/or not contain y free; it's just that no variables other than those displayed can be free in these expressions.) No formulas of form (i) can be in any Σσ : a and b are abstracts, so when x is free in B(x) for ground-language atomic B, |B(a)| = |B(b)| = 0 (and when x isn't free in B(x), B(a) and B(b) are the same sentence). Similarly for case (ii): the only terms in which a and b occur are terms for classes, so |Class(t(a))|= |Class(t(b))|= 1. Similarly for case (iii): if n ∈ N, then |t ∈ n|= 0 for any closed term t. As for (iv), suppose t(x)∈{y : B(x,y)} is in Σδ ; then at least one of |t(a)∈{y : B(a,y)}|δ and |t(b) ∈ {y : B(b,y)}|δ is in {0,1}, and we can suppose WLOG that the first is. But if |t(a) ∈ {y : B(a,y)}|δ = 1 then there are ρ < δ such that |B(a, t(a))|ρ = 1 (and hence |B(a, t(a))|= 1). By choice of δ , this requires that |B(b, t(b))|= 1. But then |t(a) ∈ {y : B(a,y)}|= |t(b) ∈ {y : B(b,y)}|δ = 1, which contradicts the supposition that t(x)∈ {y : B(x,y)} is in Σδ . The analogous argument holds for |t(a) ∈ {y : B(a,y)}|δ = 0; so no formula of form (iv) is in Σδ . Finally, in case (v), by hypothesis no formulas of this form are in Σδ . So, putting the cases together, no atomic formulas are in Σδ . But then the result holds for non-atomic formulas too: for it's clear that (a) if B(x) isn't in Σδ , ¬B(x) can't be either; (b) if neither B(x) nor C(x) is in Σδ , B(x)∧C(x) can't be; (c) if for all closed terms t, B(t,x) isn't in Σδ , ∀yB(y,x) can't be. What about the conditional? Here we appeal to the 〈a,b〉-congruence of v. On the static micro-construction, that directly yields (dstatic) B(x)→C(x) can't be in Σδ . But even on the dynamic, the 〈a,b〉-congruence of v yields that a difference between |B(a) → C(a)|δ and |B(b) → C(b)|δ requires a difference either between |B(a)|δ and |B(b)|δ or between |C(a)|δ and |C(b)|δ , so we have (d) if B(x) and C(x) aren't in Σδ , then B(x)→C(x) can't be.  PROSPECTS FOR A NAÏVE THEORY OF CLASSES 11 The proof in fact shows that if Σa,b,v 6= /0 then the only atomic members of Σa,b,v,δa,b,v are of form t(x) ∈ x. Lemma 4.2. Let δa,b,v be as in the proof of of the preceding Lemma. Then if Σa,b,v,δa,b,v contains a formula of form t(x)∈ x then v is not 〈a,b〉-extensional (and in particular, either |t(a) ∈ a|v 6= |t(a) ∈ b|v or |t(b) ∈ a|v 6= |t(b) ∈ b|v). Proof. (Once again, we suppress mention of a,b,v in the proof.) If t(x) ∈ x is in Σδ , then at least one of |t(a) ∈ a|δ and |t(b) ∈ b|δ is in {0,1}, and we can suppose WLOG that the first is. So either (i) |t(a) ∈ a|δ = 1 and |t(b) ∈ b|< 1 or (ii) |t(a) ∈ a|δ = 0 and |t(b) ∈ b|> 0. Since a is a closed abstract, we can write it as {y : A(y)}. In case (i), the first conjunct implies that |A(t(a))|ρ = 1 for some ρ < δ ; by choice of δ , Σρ must have been empty, so |A(t(b))|= |A(t(a)|= 1 and hence |t(b) ∈ a|= 1. But this and the second conjunct implies that |t(b) ∈ a| 6= |t(b) ∈ b|, so we have a violation of 〈a,b〉-extensionality, as required. Case (ii) is similar. (Of course if |t(b) ∈ b|δ rather than |t(a) ∈ a|δ was in {0,1}, it would have been t(a) rather than t(b) that provided the counterinstance to 〈a,b〉-extensionality.)  Given Lemmas 4.1 and 4.2, the Micro-extensionality Theorem is immediate: 〈a,b〉extensionality rules out Σδ ,a,b,v containing any formula of form t(x) ∈ x (Lemma 4.2), and that entails that Σδ ,a,b,v is empty (by Lemma 4.1). At this point we might be tempted to follow a suggestion of Maddy's mentioned in note 3: introduce a new primitive = into the language, not entering into the micro-construction, and extend the fixed point valuation to include it by the condition that |a = b|v = 1 if and only if either a,b ∈ N and a = b in the original model M, or a,b are closed abstracts, and ∀x(|x ∈ a|v = |x ∈ b|v). (There are several choices for the 0 clause.) The MicroExtensionality Theorem demonstrates that the = defined in this way will validate every instance of the schematic rule a= b,φ(a)  φ(b).10 But this trick won't give us a genuinely naıve theory of classes: since this manner of introducing = into the language does not allow for its appearance in formulas that occur in the term-forming operator, full Abstraction (and even Comprehension, see note 1) is lost. To have naıveté together with Extensionality will take something more. 5. BRADY'S MACRO-CONSTRUCTION(S), AND EXTENSIONALITY Now, finally, we introduce the treatments of the conditional in Brady 1983 and 2006, along with some minor variants.11 In each of these constructions, the space of "worlds" 10That rule is a rather weak form of substitutivity, and gives rise to transitivity only in the very weak form a = b,b = c  a = c, but the absence of a useful conditional in the Strong Kleene logic makes these the strongest forms of substitutivity and transitivity available there. 11Brady 1989 is a dialetheic theory, so we discuss it only later, in fn. 29. In a recent paper (2013), Brady uses "metavaluations" to provide some new meta-logical results concerning logics which support naıve class theory; his results divide into two parts, one concerning what he calls "M1" logics, and the other concerning "M2" logics. In M1 logics, no negations of conditionals are valid (that is, for every A and B, 6` ¬(A→ B)); this renders them of little interest for the present project. (To mention just one reason: a satisfactory logic for reasoning about classes should allow us to say that it is not the case that everything PROSPECTS FOR A NAÏVE THEORY OF CLASSES 12 is well-ordered: we can label the worlds by an initial segment of the ordinals. For every ordinal α , each "macro-construction" assigns to the world labeled by α a "prevaluation" vα , and we will write |A|α for the value of A at world α , that is, the value of A at the minimal fixed point of the micro-construction over vα . At any world α , the prevaluation vα assigns to each conditional a value determined entirely by the values of its antecedent and consequent at worlds prior to it in the well-ordering (i.e., labeled by prior ordinals). So in both static and dynamic constructions, whether a conditional gets value 1 at a world α is determined entirely by the values of its antecedent and consequent at prior worlds. (At stage 0 all conditionals get value 1 in both static and dynamic constructions.) In the static construction, the same is true for value 0; in the dynamic, whether it gets value 0 at α is partly determined by the values of antecedent and consequent at prior worlds, but may also be determined in part by the values of the antecedent and consequent at α itself. In the static constructions, as we'll see, the values of conditionals decrease or remain the same as the ordinals increase. Brady himself uses static constructions, and a contraposable conditional that we'll write as⇒. In all cases his 1-clause is: (1⇒) |A⇒ B|α = vα(A⇒ B) = 1 iff (∀β < α)(|A|β 6 |B|β ). For the 0-clause (which we'll write using vα so as to allow consideration of dynamic variants) Brady 1983 uses (0⇒A) vα(A⇒ B) = 0 iff (∃β < α)(|A|β = 1 and |B|β = 0). Brady 2006, by contrast, takes⇒ to be bivalent; there he uses the 0-clause (0⇒B) |A⇒ B|α = vα(A⇒ B) = 0 if (∃β < α)(|A|β > |B|β ), i.e. iff |A⇒ B|α 6= 1. (In this case there's no room to distinguish the dynamic construction from the static.) The details of the 0-clause will make no difference to the proof that the Extensionality Rule is sound on the models: all that matters is that the right hand side be incompatible with the right hand side of the 1 clause and have the form (∃β < α)Θ(β ) where Θ(β ) does not contain α free. As we mentioned earlier, it is more general to introduce a non-contraposable →, and define ⇒ from it, so that A⇒ B means (A→ B)∧ (¬B→ ¬A). For this we use as the 1-clause (1→) |A→ B|α = vα(A→ B) = 1 iff (∀β < α)(if |A|β = 1 then |B|β = 1). (1→) implies (1⇒), given the definition of⇒. As we'll see, the extensionality proof will also go through in this more general setting. in the universal class belongs to the empty class, that is, it should include ` ¬∀x(x ∈ {y : >} → x ∈ {y : ⊥}).) The 2013 paper also appears to have a number of errors in the proofs: for instance, Brady claims as one of the main logical achievements of the paper that his construction for M1 logics allows the extension of his consistency proof to logics which include the "special permutation" axiom (A→ (B→ (C→ D)))→ (B→ (A→ (C→ D))) along with Conjunctive Syllogism. But this is in fact impossible, since it is shown in Øgaard 2014 (Corollary 1) that if one adds even the weaker rule form of special permutation A→ (B→ (C→ D)) ` (B→ (A→ (C→ D))) to any logic with conjunctive syllogism (satisfying very minimal requirements that Brady's do satisfy), then one can derive the special contraction rule A→ (A→ (A→ B)) ` A→ (A→ B). As Øgaard remarks, that rule (like ordinary contraction) is enough to trivialize even naıve truth theory, which has no analog of Extensionality. While Brady offers new meta-logical results for both M1 and M2 logics, the M2 logics he studies are essentially the ones from his earlier work. The main difference in the new models is that the conditionals are taken to have starting value 1 if and only if they are theorems of a pre-specified deductive system. This modification means that, like some of our "dynamic" constructions, they avoid certain undesirable laws, although Brady does not state which exactly these new non-laws are. For this reason, and since the errors in the paper make it difficult to know what can be shown using Brady's new technique, we do not consider it further. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 13 What about the 0-clause for →? Once again, for the extensionality rule it makes no difference, beyond the constraints mentioned in connection with ⇒. Here are two possibilities: (0→A) vα(A→ B) = 0 iff (∃β < α)(|A|β = 1 and |B|β = 0). (0→B) vα(A→ B) = 0 iff vα(A→ B) 6= 1, i.e. iff (∃β < α)(|A|β = 1 and |B|β < 1). With⇒ defined as above, these rules induce the above rules for⇒. A third possibility is (0→C) vα(A→ B) is never 0, i.e. vα(A→ B) = 1/2 iff (∃β < α)(|A|β = 1 and |B|β < 1). This wouldn't be very interesting in connection with the static micro-construction, but in connection with the dynamic construction we outlined earlier it yields: |A→ B|α =  1 iff (∀β < α)(if |A|β = 1 then |B|β = 1) 0 iff (∃β < α)(|A|β = 1 and |B|β < 1) and |A|α = 1 and |B|α = 0. In this case, |A⇒ B|α is 0 if and only if (∃β < α)(|A|β > |B|β ) and |A|α = 1 and |B|α = 0. We'll confine our consideration of dynamic micro-constructions to (0→C). (A dynamic construction with (0→B) would coincide with the static, and with (0→A), the dynamic clause mentioned earlier wouldn't produce interestingly different results.) The crucial fact about all these Brady-like constructions is that whatever the 0-clause (provided it meets the constraints above), we have (∗) if α < β then for all A and B, vβ (A→ B)6 vα(A→ B). (6 is the normal numerical order, as opposed to the information order 6K used in the micro-constructions. In the case of the static constructions this means that |A→ B|β 6 |A→ B|α , but in the dynamic with (0→C) it is not ruled out that A→ B have value 0 at one stage and 1/2 at a later stage.) (∗) implies that we eventually reach a fixed point where increasing α makes no difference to the values of sentences. We take this fixed point ΩM to be the previously mentioned @, and we define a sentence to be M-valid if it takes value 1 at ΩM , and an inference to be M-valid if it preserves 1 there. Validity for sentences is then defined as taking value 1 at ΩM for all ground models M; similarly, an inference is valid if it preserves value 1 at ΩM for all ground models M. The ordinal ΩM associated with this fixed point value may differ for different ground models M but in what follows, we'll often speak as if M has been fixed, and refer to the fixed point simply as Ω. At this Ω, the 1-clause for→ yields: (BFP1) vΩ(A→ B) = 1 iff (∀β )(if |A|β = 1 then |B|β = 1). Since vΩ is one of the vβ , we can immediately conclude Corollary. If vΩ(A→ B) = 1 and |A|Ω = 1 then |B|Ω = 1. Even in the dynamic, |A→ B|α = 1 iff vα(A→ B) = 1, so we can replace the first conjunct of the antecedent by |A→ B|Ω = 1. Thus given our definition of validity we have: Modus Ponens: A,A→ B  B. We also get a fixed point result for 0, though it depends on the 0-clause we use. For the three listed we have (BFP0A) vΩ(A→ B) = 0 iff (∃β )(|A|β = 1 and |B|β = 0); (BFP0B) vΩ(A→ B) = 0 iff vΩ(A→ B) 6= 1, i.e. iff (∃β )(|A|β = 1 and |B|β < 1); PROSPECTS FOR A NAÏVE THEORY OF CLASSES 14 (BFP0C) vΩ(A→ B) 6= 0. Since vΩ is one of the vβ , a corollary of (BFP0A) is that if |A|Ω = 1 and |B|Ω = 0 then vΩ(A→B)= 0 and hence |A→B|Ω = 0. (BFP0B) gives the stronger result, that if |A|Ω = 1 and |B|Ω < 1 then vΩ(A→ B) = 0 and hence |A→ B|Ω = 0. (BFP0C) obviously yields no result guaranteeing vΩ(A→ B) = 0; however, given that it will be used only with the dynamic construction, here too we have that if |A|Ω = 1 and |B|Ω = 0 then (vΩ(A→ B) 6= 1 and hence) |A→ B|Ω = 0. So each of the constructions (provided that 0-clause C is used only with the dynamic) validates not only Modus Ponens but also Contra-Modus-Ponens: A,¬B  ¬(A→ B). Note that both the (0→A) of Brady 1983 and the (0→B) of Brady 2006 validate the rather odd law ¬(> → ¬(A → B)): since all conditionals have value 1 at the starting valuation of the macro-construction, their negations are guaranteed to have value 0, while > has value 1 there; so>→¬(A→ B) will have value 0 from stage 1 onward. (0→C) with the dynamic construction avoids this odd consequence: the "bad" starting valuation merely forces vα(>→ ¬(A→ B)) to have value 1/2 when α is at least 1, and the dynamic clause leaves it there unless A→ B itself gets value 1. So the logic resulting from the dynamic has at least this advantage over the static. The reader who would like a clearer sense of the mechanics of these constructions should take a look at the attached footnote, where we justify the validity of one important law.12 Taking stock: we have given a common two-part fixed point construction governing a variety of different conditionals (both contraposable and non-contraposable; and with different 0 clauses, some static and some dynamic). The first part is a micro-construction focused on the membership relation; the second is a macro-construction for the conditional. The two-part construction turns an arbitrary classical base model for the ground language into a 3-valued modal model for the enlarged language. (So far we're dealing with "uncontracted" models, in which distinct closed abstracts denote different things even if the model declares them coextensive.) We define validity in terms of preservation of the value 1 at the "base world" ΩM (for every formula relative to an assignment of values to the free variables), in every model M. In Section 4 we proved the important micro-extensionality theorem for the dynamic and static micro-constructions. From this, we get a unified proof of Brady's core result for all of these constructions: Theorem 5.1 (Brady Extensionality Theorem). In all of the constructions, we have the rule (1) Class(a)∧Class(b)∧∀u(u ∈ a⇔ u ∈ b)  ∀z(a ∈ z⇔ b ∈ z); 12Consider the rule Prefixing for the contraposable conditional⇒: A⇒ B  (C⇒ A)⇒ (C⇒ B). (The proof of the analogous rule for the non-contraposable is analogous.) If |A⇒ B|Ω = 1, then vΩ(A⇒ B) = 1 and hence by (BFP1), for all α |A|α 6 |B|α . We want to show that for all α we also have that |C⇒ A|α 6 |C⇒ B|α ; that is, (i) if |C⇒ A|α = 1 then |C⇒ B|α = 1, and (ii) if |C⇒ B|α = 0 then |C⇒ A|α = 0. The antecedent of (i) requires that for all β < α , |C|β 6 |A|β , hence since |A|β 6 |B|β for any β , including those which are smaller than α , it's clear that for all β < α , |C|β 6 |B|β , which gives us that |C⇒ B|α = 1 as required. The antecedent of (ii) can never hold for a static conditional with 0-clause (C). For a static with 0-clause (A) it requires that for some earlier β , |C|β = 1 and |B|β = 0; but since |B|β > |A|β for all such β , we also have that |C|β = 1 and |A|β = 0, guaranteeing that |C⇒ A|α = 0, as required. For a static with 0-clause (B) the proof exactly analogous. Making the conditional dynamic in any of the ways contemplated won't change anything, since the assumption of the rule still gives us that vΩ(A⇒ B) = 1, and that guarantees that the value of B is at least that of A at every β .) PROSPECTS FOR A NAÏVE THEORY OF CLASSES 15 indeed, we have the following non-contraposable conditional form: (2)  ∀x∀y[Class(x)∧Class(y)∧∀u(u ∈ x⇔ u ∈ y)→∀z(x ∈ z⇔ y ∈ z)]. And the version with (0→B) even yields  ∀x∀y[Class(x)∧Class(y)∧∀u(u ∈ x⇔ u ∈ y)⇒∀z(x ∈ z⇔ y ∈ z)].(3) Proof. To prove (2), we need that if a and b are closed abstracts, then for all α , if |∀u(u ∈ a⇔ u ∈ b)|α = 1 then |∀z(x ∈ z⇔ y ∈ z)|α = 1. And for each α , the antecedent holds iff for all u and all β < α , |u ∈ a|vβ = |u ∈ b|vβ , i.e. iff all such vβ are 〈a,b〉-extensional; whereas the consequent holds iff for all z and all β < α , |a ∈ z|vβ = |b ∈ z|vβ , i.e. iff all such vβ are strongly 〈a,b〉-congruent. So what we need is (*) for all α , if all vβ with β < α are 〈a,b〉-extensional then they are all strongly 〈a,b〉-congruent. It is clear by induction that each vβ is transparent. So Micro-Extensionality guarantees that for each β and each a and b, if vβ is 〈a,b〉-congruent and 〈a,b〉-extensional then vβ is strongly 〈a,b〉-congruent. So on the assumption that (i) all vβ with β < α are 〈a,b〉extensional, we can conclude that (ii) if all such vβ are 〈a,b〉-congruent then they are all strongly 〈a,b〉-congruent. And it is immediate from the Brady construction that (iii) for each β , if (∀γ < β )(vγ is strongly 〈a,b〉-congruent) then vβ is 〈a,b〉-congruent. Putting (ii) and (iii) together, we have, on assumption (i), that for each β < α, if (∀γ < β )(vγ is strongly 〈a,b〉-congruent) then vβ is strongly 〈a,b〉-congruent; so by induction, all such vβ are strongly 〈a,b〉-congruent, as desired. That proves (2), and (1) follows from it. And with rule (0→B), (3) reduces to (2), since with that rule the antecedent Class(x)∧Class(y)∧∀u(u ∈ x⇔ u ∈ y) and the consequent ∀z(x ∈ z⇔ y ∈ z) can only take on the classical values 0 or 1.  It's worth noting that in any of these constructions, we have a converse of extensionality, even in⇒ form:  ∀x∀y[Class(x)∧Class(y)∧∀z(x ∈ z⇔ y ∈ z)⇒∀u(u ∈ x⇔ u ∈ y)];(4) for any u, simply let z be {w : u ∈ w}, and apply the Intersubstitutivity Corollary (as guaranteed by the transparency of all v, plus FP). So with the bivalent 0 clause extensionality holds with⇔, and with the others it holds with an→ in one direction and both an→ and an⇒ in the reverse. 6. IDENTITY For the Brady construction, it remains only to introduce identity, show that it obeys reasonable laws, and make explicit the familiar technique of turning the models we've constructed into "normal" models where the identity relation is standard. In the logics with non-bivalent conditionals, the identity predicate will not be bivalent either, so it will be worth making explicit how the normalization technique works there. Most of what follows will be independent of the 0 clause for conditionals. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 16 As anticipated in Section 2, we define x = y as13 (x =L y)∨ [Class(x)∧Class(y)∧∀u(u ∈ x⇔ u ∈ y)]. Note that by this definition and the properties of the Brady construction, if α < β and |x = y|α < 1 then |x = y|β < 1. Observation. For x and y in the domain of M, = coincides with =L. More precisely: if either t1 or t2 is a name for a member of the domain of the ground model, then for any ordinal α , |t1 = t2|α is the same as |t1 =L t2|α (which is the same for all α). The proof is trivial, given that |Class(t)|α is 0 whenever t denotes an object in the ground model. It is also clear that if for any α |t1 =L t2|α is 1, then t1 and t2 denote the same object of the domain of the ground model and so for every β , |t1 ∈ z⇔ t2 ∈ z|β = 1. Given this and the Observation, the strong form of extensionality ((2) of Theorem 5.1) becomes  ∀x∀y[x = y→∀z(x ∈ z⇔ y ∈ z)]. And since for any formula A(u) with at most u free, there is a set {u : A(u)}, this immediately yields a strong form of substitutivity Substitutivity:  ∀x∀y[x = y→ [A(x/u)⇔ A(y/u)]], (The A may of course contain free variables other than u, in which case the validity of the Substitutivity claim as written is tantamount to the validity of its universal closure.) Note that this form of substitutivity is much stronger than the rule form x = y,A(x/u)  A(y/u).14 It's also trivial that identity is reflexive and symmetric (in a strong⇔ form): Reflexivity:  ∀x(x = x). Symmetry:  ∀x∀y(x = y⇔ y = x). And we have two forms of transitivity: Transitivity: (1):  ∀x∀y∀z((x = y∧ y = z)→ x = z) (2):  ∀x∀y∀z[x = y→ (y = z⇔ x = z)]. 13An alternative definition is: (x =L y)∨ [Class(x)∧Class(y)∧∀z(x ∈ z⇔ y ∈ z)]. Call this =ALT . It's easy to see that for any α , |x = y|α is 1 if and only if |x =ALT y|α is 1, and that if |x = y|α is 0 then so is |x =ALT y|α ; but the converse to the latter fails except when one uses the bivalent 0 clause B. We think the definition in the text more intuitive, but the choice between the two won't matter in what follows. (It might seem that we could use the simpler =ALT∗: ∀z(x ∈ z⇔ y ∈ z). But when x and y are distinct objects in the ground model, |x =ALT∗ y|0 = 1, which (since |x =L y|0 = 0) prevents |x =ALT∗ y→ x =L y|Ω from being 1 for such x and y. So if '=ALT∗' were used for '=', we wouldn't have the desired |x = y⇔ x =L y|Ω = 1 for x and y in the ground model.) 14Some papers in the literature (e.g. Bacon 2013b, Øgaard 2014) consider another strengthening of substitutivity, BadSubst: A(x/u)  x = y→ A(y/u), and derive various paradoxes from the addition of other principles to this. But we think (BadSubst) unacceptable in any context where the bivalence of identity is not assumed. This is clearly so if the language is rich enough to formulate a "determinately operator" D: then an instance of (BadSubst) is D(x = x)  x = y→ D(x = y), which given  D(x = x) yields  x = y→ D(x = y), i.e. that if x and y are identical they are determinately so. (Evans's (1978) argument against indeterminate identity was of basically this form, though he used a slightly different principle, which follows from this one if→ is contraposable.) So we don't see these paradoxes as presenting new challenges for naıve class theory; in any case, none of the approaches to Extensionality contemplated in this paper would validate (BadSubst). By contrast to these earlier results, we think the negative result of the final section is important because it involves principles which we can't see an independent motivation for denying. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 17 Proof. Form (2) follows from Substitutivity, taking A(u) to be 'u = z' (and using the symmetry of⇔). For (1), it suffices that for any β , if |x = y|β = |y = z|β = 1 then |x = z|β = 1. And that's immediate: the antecedent entails that for all γ < β and all u, |u ∈ x|γ = |u ∈ y|γ and |u ∈ y|γ = |u ∈ z|γ ; and that entails that for all γ < β and all u, |u ∈ x|γ = |u ∈ z|γ , and hence that |x = z|β = 1.  (We find it unlikely that the very strong forms of Substitutivity and Transitivity will hold for logics substantially different from those in this paper, unless they use two primitive conditionals (i.e. unless the non-contraposable→ in these laws is not one from which the contraposable⇒ is defined).15) In the semantics with the bivalent → (i.e. with 0-clause (B)), ⇒ is equivalent to →, so Substitutivity and both forms of Transitivity each hold in the stronger form in which the main conditionals are ⇒. With bivalent →, we also get excluded middle for identity statements.16 It remains only to show that we can contract the model M+ constructed in the previous section to a normal model M+/≈, one where For any o1,o2 in the domain D+/≈ of M+/≈, 〈o1,o2〉 satisfies 'x = y' if and only if o1 = o2. The method is basically the one familiar from classical logic, but requires slight care since we must proceed world by world (or, macro-stage by macro-stage). The elements of D+/≈ are to be equivalence classes of elements of D+, using the equivalence relation t1 ≈ t ′1 if and only if |t1 = t ′1|Ω = 1, where Ω is the Brady fixed point; that is, if and only if for all α , |t1 = t ′1|α = 1. Then for each α , we take the 3-valued extension E∗α of '∈' at the α th stage of the new model M∗ to be given by E∗α([t1], [t2]) = |t1 ∈ t2|α . This stipulation is only possible if it makes no difference which representatives of the equivalence classes we use, that is, if whenever t1 ≈ t ′1 and t2 ≈ t ′2, |t1 ∈ t2|α = |t ′1 ∈ t ′2|α ; but this follows from t1 = t ′1, t2 = t ′ 2  t1 ∈ t2⇔ t ′1 ∈ t ′2 which follows from two instances of the substitutivity rule (we don't need to use the axiom form). 15We do think that any adequate theory will have rule forms at least as strong as Substitutivity Rule: x = y  A(x/u)⇔ A(y/u), (and the generalization that puts any sequence of universal quantifiers over variables other than x and y before the conclusion), and its corollary Transitivity Rule: x = y  ∀z(y = z⇔ x = z). And preferably, it will have basically the axiom forms in the text, but as we say there, this will perhaps require that → be replaced with a second primitive conditional (which might be similar to a "Ramsey-Adams conditional" as discussed in the next section). 16What about a form of Substitutivity analogous to form (1) of Transitivity, that is, (Strong-Sub)  ∀x∀y[x = y∧A(x/u)→ A(y/u)] ? That fails even in the bivalent case. For instance, for any x and y, |x = y|0 is 1, so this form of Substitutivity fails for any A(u) for which there are any x and y for which |A(x)|0 = 1 and |A(y)|0 < 1! PROSPECTS FOR A NAÏVE THEORY OF CLASSES 18 7. THE RULE OF WEAKENING In addition to delivering the naıve theory of classes, Brady's logics also have some other very desirable features. The 1983 construction delivers the logic TWQd while the 2006 one delivers the stronger T JQd (for a statement of some important laws and rules of these logics, see Appendix A). But the logics have one feature we think highly undesirable, especially in the context of a naıve theory of classes: they fail to deliver the "rule of ⇒weakening" B  A⇒ B, or even the "rule of→ -weakening" B  A→ B. Now, you might say, who wants that? We shouldn't be able to infer from "I'll eat dinner tonight" to "If I die 1 minute from now then I'll eat dinner tonight". But although we take this "Ramsey-Adams phenomenon" seriously, there are three reasons why using it to defend Brady's logic for naıve class theory seems to us misguided. First, the reasons the weakening rule fails for Brady don't seem to have anything to do with the Ramsey-Adams phenomenon: the rule fails for Brady in cases which don't exhibit the characteristic features of the Ramsey-Adams phenomenon, and, conversely, Brady validates instances of rule-weakening in paradigm examples of the Ramsey-Adams phenomenon. Here's a typical example in which the weakening rule fails in Brady's logics. Although ¬(>→⊥) is valid (as is evident from the validity of Contra-Modus-Ponens), >→¬(>→⊥) isn't; indeed, its negation is. The reason is that at v0, every conditional has value 1, so ¬(>→⊥) has the "erroneous" value 0 at this undesignated world. This value soon gets "corrected": from v1 on, >→⊥ is corrected to having value 0, so ¬(>→⊥) is corrected to having value 1. But the original "error" leads to permanent errors in the values of many conditionals containing ¬(>→ ⊥), such as >→ ¬(>→ ⊥): since at v0 > has value 1 and ¬(>→⊥) doesn't, >→¬(>→⊥) has the "erroneous" value 0 at v1. And the rules of the Brady construction prevent the value of >→ ¬(>→ ⊥) from recovering: once a conditional gets value 0 it is doomed to stay there, it can never recover from the initial error. (All this holds for Brady's⇒ as well.) As we said, this example (and the structural features of the model which generate it) seems to have nothing whatever to do with the Ramsey-Adams phenomenon. More generally: the Ramsey-Adams phenomenon motivates allowing failures of rule-weakening for some sentences, but it does not motivate allowing cases where A∧B and 2 (A→B), never mind the case that we have here, where  A∧B and yet  ¬(A→ B) (if we take A to be > and B to be ¬(>→⊥)). (Certainly the above example from ordinary English is unlike this: "I will die 1 minute from now and I'll eat dinner tonight" is unassertable, let alone an obvious logical truth.) The kind of extreme failure of rule-weakening exhibited in Brady's logics simply cannot be explained by reference to the Ramsey-Adams phenomenon. In the opposite direction, the standard Ramsey-Adams examples such as the one above about eating dinner involve sentences in the ground language, not in the extended vocabulary. But nothing in Brady's constructions suggest that Weakening will fail for such sentences. (Indeed at least in the version of the constructions presented here, rule-weakening holds when we restrict attention to consequents which have vocabulary drawn exclusively PROSPECTS FOR A NAÏVE THEORY OF CLASSES 19 from the ground language.) So not only does Brady admit more failures of the weakening rule than are motivated by appeal to the Ramsey-Adams phenomenon, he also doesn't give us enough failures of the rule to remove the problematic examples. The second reason why we shouldn't use the Ramsey-Adams phenomenon to defend the failure of rule-weakening in Brady's logics is that one important application of conditionals is for restricted quantification: we explain "All A are B" as ''∀x(if Ax then Bx)". And for this, we need a conditional that does obey rule-weakening, since otherwise we can't infer "All A are B" from "everything is B". A Ramsey-Adams conditional may be useful for other purposes, but can't be used for restricted quantification: in a classical framework we need something more like the material conditional for that, and in a non-classical framework we need something that reduces to an analog of the material conditional in classical contexts.17 Brady himself acknowledges this: in a more recent joint paper on restricted quantification (Beall et al. 2006), he and his co-authors propose a second conditional for restricted quantification that does yield rule weakening. Unfortunately, this second conditional hasn't even been shown to validate naıve Comprehension or Abstraction, let alone Extensionality. But the point remains: in the context of restricted quantification, rule-weakening is a clear and widely accepted desideratum. Our third and final reason for wanting the weakening rule in the context of naıve class theory is that failures of that rule rob Extensionality of much of its intuitive force. Consider again the two sentences> and¬(>→⊥). Let a be {x :>} and b be {x :¬(>→⊥)}. Since > and ¬(>→⊥) have value 1 at the Brady fixed-point, clearly ∀x(x ∈ a) and ∀x(x ∈ b) do too: a and b are universal classes. But because > → ¬(> → ⊥) is not designated, neither is ∀x(x ∈ a→ x ∈ b), so the Brady theory will not declare these two universal sets coextensive. (Indeed, since the negation of >→ ¬(>→⊥) is designated, it will actually declare the two universal sets non-coextensive!) Thus in the absence of a weakening rule, Extensionality isn't nearly as strong as we'd have intuitively expected; it seems almost not to deserve the name. 8. POSITIVE BRADY LOGIC AND POSITIVE BACON LOGIC We take the observations of the previous section, especially the second and third, to show that if a useful naıve theory of classes is possible it will have to be in a logic that among other things validates a rule of Weakening. (Of course, Weakening alone isn't enough to be useful: we noted before that we can get Abstraction and Extensionality trivially, by declaring all conditionals valid, and it's clear that the same tactic would give us Weakening. But that's obviously not very interesting-to mention just one pathology, it would yield a drastic failure of Modus Ponens.) The literature contains several proposals for naıve theories of truth/satisfaction/properties in logics that do contain a rule of Weakening: for instance the revision constructions of Field 2008, or the fixed point construction for the "fibers" in Field 2014. But Brady's proof 17It might be thought that we need not only rule weakening, but a conditional form of it, either B→ (A→B) or an analog with⇒; for we want not only to be able to infer "All A are B" from "everything is B", but to assert If everything is B then all A are B. Perhaps, but we don't take it as obvious that the explicit 'if...then' in the displayed sentence is the same as the conditional used for restricted quantification. For example, one might want this second conditional to respect the Ramsey-Adams phenomenon. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 20 can't be adapted to these other constructions, at least without substantial modification of those constructions.18 But can we modify Brady's construction in such a way as to get Weakening for one or both of the conditionals → and ⇒? One strategy that might initially appear promising, but isn't, is to use the basic Brady construction but with a different starting point. (By the "starting point", we mean the v0 of the construction.) In the constructions developed by Brady himself that we've considered,19 v0 assigns value 1 to every conditional. But what if we modified the construction to use some other transparent prevaluation h as v0? That is, what if we used h to generate a Brady progression, as follows: vα(A→ B) =  1 iff h(A→ B) = 1 and (∀β < α)( if |A|β < 1 then |B|β = 1) 0 iff h(A→ B) = 0 and (∃β < α)(|A|β = 1 and |B|β < 1) (That's for 0 clause (0→A); modifications of the other 0 clauses would be analogous.) We might try using some kind of "super-macro construction" (revision-theoretic or fixed point) to generate a suitable h. But this approach isn't promising even as a way to get a satisfactory logic with Weakening, let alone one that gives rise to Extensionality. For we will have a failure of Weakening if there is a conditional A→ B that the fixed point assigns value 0 but h assigns a non-zero value; for then at the fixed point, ¬(A→ B) will get value 1 and >→ ¬(A→ B) won't. So to get Weakening, we need to know at the start which conditionals must get value 0. But if we knew this, there wouldn't be much reason to run the Brady construction at all. And even if we did know this, and still felt like using the Brady construction, getting Extensionality by the obvious generalization of the Brady proof would require an h which is 〈a,b〉-congruent for every pair 〈a,b〉-for which the final prevaluation is 〈a,b〉-extensional. There's no obvious procedure for finding a starting h that guarantees either Weakening or Extensionality, never mind both. One thing that does work-we don't find it appealing as it stands, but it will be a springboard to a better suggestion in the next section-is to eliminate the involutive negation ¬ from the language, at least as applied to formulas not in the ground language. (We'd then 18The following example is a counterexample to extensionality in both constructions: a = {x : ⊥} and b = {x :>⇒⊥}; c = {x : ∀y(x∈ y⇔ b∈ y)}. In both constructions, it's clear that ∀x(x∈ a⇔ x∈ b) comes out valid; so the extensionality rule would require ∀y(a∈ y⇔ b∈ y) to come out valid, which in turn requires a∈ c⇔ b∈ c to come out valid. But it doesn't. For instance, in the revision construction of Field 2008, |b ∈ c|β is 1 whenever β > 0; which means that for α > 1, |a ∈ c⇔ b ∈ c|α is 1 if and only if there is an interval prior to α throughout which |a ∈ c|β is 1. But by choice of c, this just means that for α > 1, |a ∈ c|α is 1 only if there is an interval prior to α throughout which |a ∈ c|β is 1. So unless |a ∈ c|α is 1 for α = 1, then by induction it isn't 1 for any bigger α either. But it isn't 1 for α = 1. To see this, let d be {x : (¬∃y)(y ∈ x)}. Then |a ∈ d|0 = 1; but since |> ⇒ ⊥|0 = 1/2, |b ∈ d|0 = 1/2, with the result that |a ∈ c|1 < 1. The reader may verify that the same example (replacing ⇒ above with the . or I of that paper) is a counterexample to the rule form of extensionality for the "fibers" of Field 2014. Theorem 10.1 will cast further light on this: both logics have attractive features that turn out to preclude Extensionality (in the presence of Abstraction). Thus the "substantial modification of the constructions" mentioned in the main text would have to result in a significant change of the logic, if the logic was to be used for naıve classes. (Naıve property theory works fine for both logics as they stand.) 19Brady 2013 does consider an alternative v0, but not with the intent of achieving Weakening. (Indeed in the case of his M2 logics, our result in Section 10 will show that whatever the corrected version of his construction validates, if it preserves Abstraction and Extensionality, it will not yield Weakening, since his conditionals are contraposable and validate Contra-Modus Ponens.) PROSPECTS FOR A NAÏVE THEORY OF CLASSES 21 take∨ and ∃ as well as∧ and ∀ as primitive.) While it would be possible to keep¬, but with formation rules that allow it to apply only to formulas of the ground language, it is more convenient to drop it entirely, and for each ground predicate p introduce a dual predicate p∗. (⊥∗ is our new version of>.) Call the resulting ground language Lpos, and the extended language L+,pos. We take the logic of the ground language to include the LEM-like axiom p(x1, ...,xk)∨ p∗(x1, ...,xk) and the explosion-like rule p(x1, ...,xk), p∗(x1, ...,xk) ⊥. This isn't really less expressive than keeping ¬ throughout the ground language, because the deMorgan laws of the language with ¬ would allow one to drive it in to atomic predicates and replace negations of those by their duals. We don't want to use a similar trick for the extended language; the point was to get rid of involutive negation there. But since we have a primitive absurdity constant⊥, we'll still be able to define various "negation-like" operations applicable to the full language, using the conditional: e.g. A→⊥, A→ (A→⊥), and so on. We'll say more about these in a moment. But first, we need to say something about a choice point for the conditional in the new setting. Since we have no involutive negation, we can no longer define ⇒ from →. It would be possible to allow it as an additional primitive, governed by the same valuation rule as before, but it will be important in what follows that we not do so; the logic we'll be calling "positive Brady logic" uses the rule for the→ conditional, not the⇒.20 The difference between ⇒ and → matters even in the negation-free setting, because there will still be sentences that get value 1/2; and if⇒ were in the language, the difference between 1/2 and 0 at some worlds would make a difference for which sentences received value 1 at other worlds, including the Brady fixed point Ω. For instance, let a abbreviate {x : x ∈ x}. Then |a ∈ a|v is 1/2 whatever the v, given that we're using minimal fixed points in the micro-construction. So |a ∈ a|v and |⊥|v are both less than 1 at all v, but |a ∈ a⇒ a ∈ a|Ω = 1 while |a ∈ a⇒ ⊥|Ω < 1. On the other hand, with → but not ⇒ or ¬ in the language, this phenomenon cannot arise: the distinction between a sentence A having value 1/2 or 0 is then idle in that it makes no difference as to whether compounds containing A are designated. (That's so for any of the 0-clauses, and for static or dynamic, which shows that these distinctions don't matter in the context of positive logic.) To prove this last claim about the language with only →, we'll introduce a 2-valued model for the negationand ⇒-free language, and show that the 3-valued and 2-valued static models assign the same sentences value 1. The 3-valued model, which we'll denote with a superscripted 3, is exactly as before, except with the restricted language. The 2valued model, denoted by a superscripted 2, uses only two values, 0 and 1. The rules for all connectives except ∈ remain unchanged; in the two-valued micro-construction, 0 behaves as 1/2 did before: we start all sentences a ∈ b at value 0, and sentences can pass from 0 to 1, but not in the other direction. (With the obvious modifications, the required monotonicity lemma (RM) goes through as usual.) In the macro-construction, the 1-clause for→ is as before; the 0-clause won't matter to the proof. We then have: Theorem 8.1. For all sentences A of L+,pos and all α and σ , |A|3α,σ = 1 iff |A|2α,σ = 1. Proof. The proof is by induction on pairs α and σ , ordered lexicographically, i.e. with 〈α,σ〉 < 〈β ,τ〉 if and only if either α < β , or α = β and σ < τ . It suffices to show for each α and σ that 20It might seem more in the spirit of Brady to take⇒ instead of (or together with)→ as primitive. But since we don't think the purely positive logic (whether developed with ⇒, or →) is promising, and since our main interest is in laying the ground-work for the four-valued model in the next section, we won't pursue this approach here. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 22 (*) |B|3α,σ = 1 iff |B|2α,σ = 1 whenever B is either of form C→D or of form a ∈ b where b is a class abstract; for then an obvious sub-induction on complexity yields that this is so for all sentences A of any complexity. Suppose that (*) holds for all pairs prior to 〈α,σ〉 in the lexicographic ordering. Then if σ > 0, it clearly holds for sentences of form C → D: for by the induction hypothesis this holds for the earlier 〈α,0〉; and so |C → D|3α,σ = |C → D|3α,0 = |C → D|2 α,0 = |C→ D|2α,σ . Moreover (still assuming σ > 0) (*) also holds for sentences of form a ∈ {x : C(x)}; for |a ∈ {x : C(x)}|3α,σ is 1 if and only if for some ρ < σ , |C(a)|3α,ρ is 1, which by the induction hypothesis holds iff for some ρ < σ , |C(a)|2α,ρ is 1, and hence if and only if |a ∈ {x : C(x)}|2α,σ is 1. So we need only consider pairs of form 〈α,0〉. But in that case, |a ∈ {x : C(x)}|3 α,0 and |a ∈ {x : C(x)}|2 α,0 are 1/2 and 0 respectively, so both differ from 1. And the induction hypothesis tells us that for all β < α , |C|3 β ,Ψ = 1 iff |C| 2 β ,Ψ = 1 and |D| 3 β ,Ψ = 1 iff |D| 2 β ,Ψ = 1; so given the rules for the non-contraposable→, |C→D|3 α,0 = 1 iff |C→D|2α,0 = 1.  The Brady fixed point result of course still holds for positive Brady logic: the restricted language, and the two-valued model, don't affect the proof. But positive Brady logic does have a new phenomenon: for every sentence in the language, not just for conditionals, values are non-increasing as the construction proceeds: (∗∗) If α1 < α2 then |A|α2 6 |A|α1 21 (where again this is the real 6, not the information ordering 6K). And that means that in addition to the negation-free laws that are validated by the Brady construction with involutive negation, we get not only the Weakening Rule but even the stronger conditional form: (WeakeningAx):  B→ (A→ B). For if any B has value 1 at an α , it has value 1 at all β < α , and so A→ B has value 1 at α . But what about Abstraction and Extensionality? Since we do not have ⇔ in the language we can't use our earlier formulations of these laws. But since we're working with a 2-valued model we don't need anything other than ↔; the reason is that in the restricted value space, ↔ has the crucial property that |A↔ B|Ω = 1 if and only if for all α |A|α = |B|α . Given the right to left of this "if and only if", Abstraction follows directly from the fixed point theorem. And given both directions of it, Extensionality reduces to the claim that when a and b are classes and for all α ∀x(|x ∈ a|α = |x ∈ b|α) then for all α ∀z(|a ∈ z|α = |b ∈ z|α); and that is proved in precisely the same way as before. (The claim is simply that 〈a,b〉-extensionality at every macro-stage of the Brady construction entails strong 〈a,b〉-congruence at every stage, and that follows by an obvious induction using Micro-Extensionality and the fact that the initial macro-stage is (weakly) 〈a,b〉-congruent whatever the a and b.) So this approach does yield a logic with Weakening, Modus Ponens, and our two classtheoretic axioms. (Indeed it yields the logic known as TJK+; see Appendix A.) But having 21Because of this, we can think of the Brady semantics for the positive logic as assigning to every sentence an ordinal value: |||A||| is the first α for which |A|α is 0 if there is such an α , and otherwise |||A||| is the fixed point ordinal ΩM of the negation-free construction for ground model M. On this semantics, ΩM is the designated value. (Only sentences with single bar value 1 at the fixed point get triple bar value ΩM , because no sentence A can first get (single-bar) value 0 at the fixed point ordinal: if A first got value 0 at ΩM , >→ A would first get value 0 at ΩM +1, which is impossible given that ΩM is a fixed point.) PROSPECTS FOR A NAÏVE THEORY OF CLASSES 23 no involutive negation is a high price to pay. And while there are other negation-like operations, in the language, we don't think they're enough to fill the gap. Consider the obvious candidates, the operations definable from ⊥ and iterated conditionals. It's clear that we can iterate the operation 'A→': for any A and B, define A→(n) B inductively in the obvious way: A→(0) B is just B A→(n+1) B is A→ (A→(n) B). And, when the ground language includes arithmetic, we can use the truth predicate to "infinitely disjoin" them:22 putting it loosely, A→(ω) B is ∃n[True(〈A→(n) B〉)]. This is loose, since the preceding didn't introduce the numerical superscripts as quantifiable variables; a more accurate statement23 would be: A→(ω) B is ∃n [n is a natural number and ∃x[True(x) and x is the nth member of the sequence whose 0th member is B and for all k whose (k+1)st member is obtained from the kth by prefixing 'A→ ']]. We can now continue: e.g., A →(ω+n+1) B is A → (A →(ω+n) B); and we can define A→(ω+ω) B analogously to how we defined A→(ω) B. When we come to limit ordinals of higher cofinality it becomes progressively more complicated, but we can continue a long way through the countable ordinals. (Doing it rigorously involves using a system of ordinal notations.) And insofar as we can do it, we have that if μ1 < μ2 then for all α , if |A→(μ1) B|α = 1 then |A→(μ2) B|α = 1. So, letting ¬NIμ A be A→(μ) ⊥, we have a transfinite sequence of progressively weaker "non-involutive negation operators". (¬NI0 is the maximally strong "negation": applied to anything at all, it yields an equivalent of ⊥.24 There is no way to define a weakest member of the sequence: roughly speaking, this is because the first ordinal for which there is no notation is a limit ordinal.25) But none of these "negation" operators behaves very much like we'd want a negation operator to behave: for instance, for each μ , ¬NIμ¬NIμ> is equivalent to ¬NIμ¬NIμ⊥ (both have value 1 at stages α < μ , 0 for later stages). Bacon 2013a (in the first five sections of his paper, before he introduces involutive negation) gives a construction which, though presented in a superficially quite different manner (see our Appendix B for exposition and discussion), is very similar to the one we've just described for positive Brady logic. In fact, for the positive fragment Bacon's construction, like the positive Brady we've developed, delivers the logic T JK+, as described in Appendix A. The only real differences between the constructions are: (i) that instead of going to the fixed point, Bacon goes through only the finite stages of the Brady construction; 22On adding a truth predicate, see above, note 9. 23Still not put perfectly precisely, but we assume clear enough. 24It is easy to verify that if (∃β < μ)(|A|β = 0) then |¬NIμ A|α is 1 for every α , and hence ¬NIμ A is equivalent to >; whereas if (∀β < μ)(|A|β = 1) then |¬NIμ A|α is 1 for α < μ and 0 for α > μ . (Or in the terminology of footnote 21: If |||A|||< μ then |||¬NIμ A|||= Ω, and if |||A||| ≥ μ then |||¬NIμ A|||= μ . As μ increases, there are more sentences A for which |||¬NIμ A||| = Ω, and those that don't get strictly higher value than for smaller μ .) So every sentence in the language is equivalent either to > or to some ¬NIμ>, with ⊥ equivalent to ¬NI0> but also to ¬NI0⊥. None of the ¬NIμ lead to paradox because ¬NIμ (¬NIμ>) is equivalent to ¬NIμ>: its value is 1 for α < μ , 0 for α 6 μ (i.e. its triple bar value is μ). 25This isn't quite accurate, because the notations needn't be restricted to standard Church-Kleene notations; they can be taken to include ones that use a non-bivalent membership predicate, and talk of "the first" F can be problematic in a non-bivalent context. (See Field 2008, Ch. 22, for a treatment of related issues in the context of naıve truth and satisfaction.) PROSPECTS FOR A NAÏVE THEORY OF CLASSES 24 (ii) accordingly, he defines validity, not as preservation of value 1 at the fixed point, but preservation of the property of having value 1 at all the finite stages. So although there's no finite stage where we have that if |A|n and |A→ B|n are both 1 then |B|n = 1, the definition of validity in (ii) still yields modus ponens. And of course the Weakening Axiom is still valid. Bacon's construction thus preserves some of the nice features of the Brady positive construction as we've developed it. But we aren't sure whether Bacon's construction has any advantages over the Brady positive logic construction, and there is what strikes us as a serious disadvantage: it leads to a dramatic failure of the existential quantifier form of reasoning by cases (MR2 in Appendix A). The sentence ¬NIω>, i.e. >→(ω) ⊥, has value 1 at each finite n, so on Bacon's definition it is a logical truth (or anyway, a consequence of arithmetic). That is odd in itself, but especially so since on unpacking the definition, it is an existential quantification of the form ∃n[n is a natural number and F(n)], where each F(n) is equivalent, given number theory, to >→(n) ⊥. Each of the >→(n) ⊥ is inconsistent: each leads to absurdity by repeated applications of modus ponens. This is the promised drastic failure of reasoning by cases: an existential quantification counts for Bacon as a logical truth, even though each of its instances is logically inconsistent. As we said, we know of no advantages of the Bacon positive logic over the Brady that could compensate for this. Nonetheless it is worth noting that the proof of Extensionality for positive Brady logic carries over to Bacon's construction: if a and b are class abstracts and for all n, |∀u(u ∈ a↔ u ∈ b)|n = 1, then an induction using the Micro-Extensionality Theorem shows that for all n, |∀z(a ∈ z↔ b ∈ z)|n = 1. 9. 4-VALUED BRADY AND BACON LOGICS In the final section of his paper Bacon gives a trick for introducing an involutive negation into his construction, while still preserving the Weakening Rule, indeed even the Weakening Axiom for the non-contraposable conditional→. Bacon himself aborts the construction of the conditional valuation after all finite stages, so that his new proposal still suffers from the failure of reasoning by cases just discussed (in fact the proposal as he states it doesn't even quite do what he claims it does: he claims it delivers Explosion and Reflexivity, but as we show in Appendix B, it can do so only if it denies that certain sentences express propositions, and restricts the application of the laws to sentences which do express propositions). But Bacon's trick can also be used for Brady's positive logic, where we continue the construction through to the Brady fixed point, and define designatedness in terms of the value at that fixed point. We'll develop Bacon's trick in this second setting. The trick is to assign each sentence A an ordered pair 〈|A|+, |A|−〉 of values in {0,1} at each world; intuitively, we can think of |A|+ as a value for A and |A|− as a value for its negation. We assume a classical model of the ground language, which takes sentences into {〈1,0〉,〈0,1〉}. In the Kripke micro-construction, as usual we take as given a prevaluation v that assigns values v+(A→ B) and v−(A→ B) for any sentences A and B. Using this v, we assign values as follows: (1) |¬A|+v,σ = |A|−v,σ and |¬A|−v,σ = |A|+v,σ (2) |A∧B|+v,σ = min{|A|+v,σ , |B|+v,σ} (3) |A∧B|−v,σ = max{|A|−v,σ , |B|−v,σ} (4) |∀xA(x)|+v,σ = min{|A(t/x)|+v,σ : t is a closed term of L+} (5) |∀xA(x)|−v,σ = max{|A(t/x)|−v,σ : t is a closed term of L+} PROSPECTS FOR A NAÏVE THEORY OF CLASSES 25 (6) If t1 is a closed term and t2 is a name in the ground language, |t ∈ t2|v,σ = 〈0,1〉 (7) If t1 is a closed term and t2 is {x : A(x)}, |t1 ∈ t2|+v,σ is 1 iff (∃ρ < σ)(∀τ in the interval [ρ,σ))(|A(t/x)|+v,τ = 1) (8) If t1 is a closed term and t2 is {x : A(x)}, |t1 ∈ t2|−v,σ is 1 iff (∃ρ < σ)(∀τ in the interval [ρ,σ))(|A(t/x)|−v,τ = 1) What about the conditional? The analogy isn't perfect, but we can think of the +-term in the valuation of the conditional as similar to the 1-clause in the three-valued case. So just as the static and dynamic constructions shared a 1-clause, they'll share the rule for |A→ B|+, namely, |A→ B|+v,σ = v+(A→ B). But just as the static and dynamic constructions differed on the 0 clause in the three-valued case, they'll differ here on the relationship between v− and |A→ B|−v,σ . In the static construction, the relationship is just identity: (S) |A→ B|−v,σ = v−(A→ B), whereas dynamic constructions may vary the relationship in one of several ways: (D−i ) |A→ B|−v,σ = 1 iff v−(A→ B) = 1 and |A|+v,σ = 1; or (D−ii ) |A→ B|−v,σ = 1 iff v−(A→ B) = 1 and |B|−v,σ = 1; or (D−iii) |A→ B|−v,σ = 1 iff v−(A→ B) = 1 and |A|+v,σ = 1 and |B|−v,σ = 1; or simply (D−iv) |A→ B|−v,σ = 1 iff |A|+v,σ = 1 and |B|−v,σ = 1. (With (D−iv), we don't really need v − at all; or we can think of (D−iv) as a special case of (D−iii), where v − always assigns value 1 to everything.) Whichever of these clauses for |A→ B|−v,σ we choose, neither the positive nor the negative component of the value of a sentence ever goes from 1 to 0 as the micro-stage σ increases; so there must be a point Ψv after which no sentence changes value, that is, a fixed point in the micro-construction of the valuation for ∈, which satisfies (FP). We now do a Brady macro-construction of vα , where at each macro-stage α we determine vα(A→ B) from the values of A and B at the minimal fixed points of the microconstructions at prior stages. As before, we simply write |A|α for the value of A at the minimal fixed point of the micro-construction over vα . In the case of v+α (A→ B)-which is just |A→ B|+α even on the dynamic, so we'll write it that way-we set: |A→ B|+α = 1 iff (∀β < α)(|A|+β 6 |B| + β ) That's for a non-contraposable conditional; for a contraposable it would be |A⇒ B|+α = 1 iff (∀β < α)(|A|+β 6 |B| + β and |B|− β 6 |A|− β ). What about the v−α ? As noted earlier the choice is irrelevant if we use version (D − iv) of the dynamic conditional. For the static, and for the dynamic proposals (D−i )-(D − iii), the most natural choice is (I): v−α (A→ B) = 1 iff (∀β < α)(|A|+β = |B| − β = 1); then for⇒ we have the same condition. (If (I) is chosen, then the |A→ B|−α clauses given in (D−i )-(D − iii) can be simplified, using the observation of the next paragraph: that +-values of sentences can never pass from 0 to 1 as the macro-construction proceeds. For instance, given that observation and (I), (D−i ) is equivalent to (I−i ): |A→ B|−α = 1 iff |A|+α = 1 and (∀β < α)(|B| − β = 1).) PROSPECTS FOR A NAÏVE THEORY OF CLASSES 26 So at the initial (α = 0) stage, both v+ and v− assign every conditional 1 (though in the dynamic versions this doesn't prevent some |A→B|−v0 from being 0). An easy induction on α , with a subinduction on complexity, shows that as α increases, neither the + nor the − value of any sentence can ever go from 0 to 1. This then leads to a fixed point ΩM as in the original Brady construction; every sentence gets one of the values 〈1,0〉, 〈0,1〉, 〈0,0〉 and 〈1,1〉 at the fixed point.26 How is validity to be defined? The best choice is to take it as preservation of the property of having +-value 1 at the fixed point (that is, the property of having value either 〈1,0〉 or 〈1,1〉 there). This however means that a sentence and its negation can both be designated; the logic will be paraconsistent, in that the existence of such "dialetheia" will not lead to explosion. For instance, let b be the abstract {x : x ∈ x→ x /∈ x}. By FP, b ∈ b is equivalent to b ∈ b→ b /∈ b; so letting X abbreviate the sentence b ∈ b, we have that X is equivalent to X → ¬X . On any of the treatments of the negative component of conditionals we've discussed, an easy induction gives that at any α , X has value 〈1,1〉.27 We could avoid dialetheia by taking only the value 〈1,0〉 as designated. But this would have a very serious cost: if X is the sentence above, X → X has value 〈1,1〉 at every α and thus at the fixed point ΩM , whatever the M, so it would not come out designated; the Reflexivity Law for conditionals, A→ A, would fail. (This choice between going dialetheic and restricting the Reflexivity Law arises in Bacon's logics too, though he seems to think otherwise; see Appendix B.) A proposal restricting Reflexivity in this way is not only intrinsically unattractive, it is a nonstarter for naıve class theory: for if Reflexivity fails, then Abstraction must also fail. (Extensionality will too, though slightly less obviously.) There is a way to avoid both dialetheia and restrictions on Reflexivity: modify the clause for the negative values of conditionals from (I) to (II): v−α (A→ B) = 1 iff (∀β < α)(|A|+β = |B| − β = 1 and |A|− β = |B|+ β = 0); or what is the same thing given the non-increasing nature of the values, • If α = 0, v−α (A→ B) = 1 • If α > 0, v−α (A→ B) = 1 iff (∀β < α)(|A|+β = |B| − β = 1) and |A|−v0 = |B| + v0 = 0. However, this too has a high price: when B is itself a conditional, |B|+v0 is never 0, so for α > 0, v−α (A→ B) can never be 1, and thus we no longer get the law Contra-Modus-Ponens: A,¬B  ¬(A→ B). For instance, > and ¬(>→⊥) don't together imply ¬(>→ (>→⊥)) on this account.28 Since the cost of giving up either Reflexivity or Contra-Modus-Ponens would be high, we conclude that the best version of the construction in this section admits dialetheia. (It is not only dialetheic, it requires restrictions on excluded middle as well, e.g. for the Russell paradox.)29 The resulting static construction validates a dialetheic version of TJK (without 26In the "triple bar" representation of previous footnotes, every sentence is assigned a pair of ordinals, each 6ΩM , summarizing the values at every stage of the macro-construction. 27|X |+α = 1 whenever (∀β < α)(|X |−β = 1); on the static, |X | − α = 1 iff (∀β < α)(|X |+β = 1), and on any of the dynamic, |X |−α = 1 iff |X |+α = 1. 28We think even dialetheists should accept Contra-Modus-Ponens: for instance, if A is a dialetheia, we think A→ ¬A and A→ A should be too. But we needn't decide this: the point of switching to (II) was to avoid dialetheia, so the proponent of this switch won't try to use dialetheia to explain away the resultant failure of Contra-Modus Ponens. 29An alternative which does validate excluded middle without restriction can be derived from Brady 1989. There, Brady gives a construction for a dialetheic logic, which differs from his other constructions in three PROSPECTS FOR A NAÏVE THEORY OF CLASSES 27 explosion, but with involutive negation) which we call DTJK, and which is described as usual in Appendix A. In the earlier three-valued construction, the dynamic construction improved the static primarily by invalidating some undesirable laws of the static. Here, the most obvious effect of the dynamic is adding new laws: for instance, no sentence of the form ¬(A→ B)→⊥ is valid in the static (since |A→ B|−0 is never 0), but ¬(⊥→ B)→⊥ is valid in all of the dynamic constructions except for (D−ii ), and ¬(A→>)→⊥ is valid in all of the dynamic constructions except for (D−i ). What about Extensionality? Unlike the case of positive logic, the ↔ form of the Extensionality Rule does not hold in any of the 4-valued constructions. For let c be {x : X} and d be {x : >}, where X is as above and > has value 〈1,0〉 at every world. Then |∀x(x ∈ c↔ x ∈ d|+ ΩM = 1. But if e is {x : x /∈ x} then |c ∈ e|ΩM = 〈1,1〉, while |d ∈ e|ΩM = 〈0,1〉, so |c ∈ e→ d ∈ e| + ΩM is 0. But this counterexample does not carry over to the ⇔ form of the extensionality rule, since |∀x(x ∈ c⇔ x ∈ d|+ ΩM is 0, not 1 (given that |¬(x ∈ c)→¬(x ∈ d)|+ ΩM is 0 for any x). Indeed, the ⇔ form of the Extensionality Rule is valid in the construction (whatever the v− rule), as is even the axiom  ∀x∀y[Class(x)∧Class(y)∧∀u(u ∈ x⇔ u ∈ y)→∀z(x ∈ z⇔ y ∈ z)]. To see this, observe first that, for any α , |A⇔B|+α = 1 only if, for all β <α , |A|β = |B|β . So if |∀u(u∈ a⇔ u∈ b)|+α = 1 for classes a and b, then vβ for β < α is 〈a,b〉-extensional, and we can show inductively, as before, that each vβ for β < α is strongly 〈a,b〉-congruent; and hence |∀z(x ∈ z⇔ y ∈ z)|+α = 1. In fact, it's not just Extensionality which holds: Abstraction clearly holds as well in both its↔ form and also in its stronger⇔ form. The construction yields a naıve theory of classes, whatever the v− rule. This is somewhat good news for dialetheists (at least, for those willing to restrict excluded middle as well). As we noted in Section 7, anyone who wants to use a conditional to define universal restricted quantification has reason to want that conditional to obey a Rule of Weakening: without it, there is no way to get the obviously desirable rule RQ-Weakening-Rule: Everything is B  All A are B. The importance of this rule has been recognized even by some of those (whether dialetheists or not) who stress the importance of relevance conditionals for which the Rule of Weakening fails: they also posit non-relevance conditionals for which the rule holds, to handle restricted quantification. (See for instance Beall et al. 2006.) So the good news is that dialetheists can combine the Weakening Rule with Abstraction and Extensionality.30 In fact, as with the positive logic, we can strengthen the Weakening Rule to an axiom, and hence with restricted quantification we get respects. First, all conditionals start by taking value 1/2; second, he uses the following clauses: |A⇒ B|α = 1 iff (∀β < α)(if |A|β > 1/2, |B|β = 1) and |A⇒ B|α = 0 iff (∃β < α)(|A|β > 1/2 and |B|β = 0), which, as our notation suggests, yields contraposition. Third, and finally he takes the designated values to be both 1/2 and 1. The construction delivers the logic DJ plus Contra-Modus Ponens. TFØ notes that in this case we can define a non-contraposable conditional 7→ which has both Weakening and no restrictions on excluded middle, by letting A 7→ B = (A∧λ )⇒ B, where λ is the Russell sentence {x : ¬x ∈ x} ∈ {x : ¬x ∈ x}. Here, the non-contraposable conditional is defined from the contraposable as opposed to the other way round, but the logic of each is in some respects as good as what we have in the main text. A main loss is that the new conditional does not satisfy Suffixing or Prefixing (see next Section). 30A similar point holds for dialetheists who think we don't need a conditional to express universal restricted quantification but can take that as a primitive quantifier (∀x 3 Ax)Bx (read as "every A is B"). (When Ax has the form x ∈ t we will abbreviate (∀x 3 Ax)Bx as (∀x ∈ t)Bx; similarly for /∈.) For the present construction could be redone in those terms, showing that we can get Abstraction and Extensionality in the forms PROSPECTS FOR A NAÏVE THEORY OF CLASSES 28 RQ-Weakening-Axiom:  Everything is B→ all A are B. As yet, we have no equally good result on Extensionality for those who would avoid dialetheia and simply restrict excluded middle. We ourselves aren't entirely enthusiastic about dialetheic theories. One reason is that they inevitably lead to what we regard as a Sophie's Choice regarding restricted quantification: between giving up one or the other of the attractive principles Negative Conjunctive RQ-Weakening: ¬∃x(A(x)∧¬B(x))  All A are B; RQ-Modus Ponens: All A are B, A(c)  B(c). For the two together yield ¬∃x(A(x)∧¬B(x)),A(c)  B(c), and applying this to sentences A and B we get ¬(A∧¬B),A  B and hence ¬A,A  B which leads immediately to triviality in the presence of a dialetheia.31 Still, even setting this general worry about dialetheism aside, there is an additional reason why the construction in this section does not yield a satisfactory logic for a naıve theory of classes; and considering this problem will lead us to a much more general problem, which shows that any dialetheic naıve theory of classes will be unsatisfactory. The local problem is that the construction doesn't really avoid the problem for Brady's 3-valued logics that was discussed at the end of Section 7: it won't yield the result that all universal classes are coextensive, and hence won't yield what one would have thought would be a consequence of Extensionality, viz. Intuitive Extensionality for Universal Classes: ∀x(x ∈ a)∧∀x(x ∈ b)  ∀y(a ∈ y⇔ b ∈ y) Recall the example: a is {x :>}, b is {x : ¬(>→⊥)}. The logic has (*):  ¬(>→⊥) Abstraction: ∀u1...,∀un[(∀z ∈ {x : A(x;u1, ...,un)})A(z;u1, ...,un))∧ (∀z 3 A(z;u1, ...,un))(z ∈ {x : A(x;u1, ...,un)})∧ ∀z /∈ {x : A(x;u1, ...,un)})¬A(z;u1, ...,un))∧ (∀z 3 ¬A(z;u1, ...,un))(z /∈ {x : A(x;u1, ...,un)})] Extensionality Rule: Class(a)∧Class(b)∧ (∀u ∈ a)(u ∈ b)∧ (∀u ∈ b)(u ∈ a) ∧(∀u /∈ a)(u /∈ b)∧ (∀u /∈ b)(u /∈ a)  (∀z 3 (a ∈ z))(b ∈ z)∧ (∀z 3 (b ∈ z))(a ∈ z)∧ (∀z 3 (a /∈ z))(b /∈ z)∧ (∀z 3 (b /∈ z))(a /∈ z) while also having RQ-Weakening: ∀xBx  (∀x 3 Ax)Bx. 31We could have run the same argument using Negative RQ-Weakening, i.e. ¬∃xA(x)  All A are B, instead of Negative Conjunctive RQ-Weakening. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 29 (as presumably any satisfactory logic does), so it yields both (∀y)(y ∈ a) and (∀y)(y ∈ b). So a and b are on any reasonable criterion universal classes, and any interesting version of naıve class theory will declare them identical. We saw that Brady's theory didn't declare them identical, because it didn't even declare them co-extensive, and we blamed that on the absence of (Rule-)Weakening; but in fact, even the theory of this section, which has Weakening for→, fails to declare these universal classes coextensive. That would require the validity of (**): >⇔¬(>→⊥), which fails in the theory (since (>→ ⊥)→ ⊥ is invalid in it). Similarly, the failure of (**) shows that the right hand side of Intuitive Extensionality for Universal Classes fails for a and b, as can be seen by instantiating z with {y : ∀x(x ∈ y⇔>}. Of course we'd have (**) if we had Weakening for the conditional⇒ used in the Extentionality rule, but the construction of this section doesn't deliver that.32 In fact, there is a moral general argument here, which shows that no naıve dialetheic theory with involutive negation that validates modus ponens (and has the full structural rules) can satisfy a sufficiently strong form of the Extensionality Rule (where a dialetheic theory is one that postulates dialetheia, not merely allows them). The argument has three stages. First stage: in the presence of involutive negation, we need a contraposable ⇒ in the formulation of the Extensionality Rule, because with involutive negation, Extensionality with a non-contraposable→ simply can't hold. For suppose there are A and B such that (i)  A↔ B, but (ii) 6 ¬A↔¬B. Pick an arbitrary closed term t, and let c be {x : ¬(t ∈ x)}. Then  ¬A↔ ¬B is equivalent to  ¬(t ∈ {y : A})↔ ¬(t ∈ {y : B}), which in turn is equivalent to  {y : A} ∈ c↔ {y : B} ∈ c. So if (ii) then 6 {y : A} ∈ c↔ {y : B} ∈ c. But if (i) holds then also  ∀x(x ∈ {y : A} ↔ x ∈ {y : B}), and we have a violation of Extensionality for→. Second stage: no dialetheic theory (assuming it validates modus ponens) can validate even the following restricted version of Weakening for⇒: Special Weakening: If  A then >⇒ A. For suppose  B and  ¬B. Special Weakening applied to the first would yield  >⇒ B, and hence by contraposition  ¬B⇒ ⊥, which with modus ponens yields ⊥ and hence triviality. These two stages together show that in a dialetheic context with involutive negation (and modus ponens), we can't have even Special Weakening for the conditional used in the formulation of the Extensionality Rule. Now for the final stage, which is a generalization of the argument using (*) and (**): without Special Weakening for the conditional used in the formulation of Extensionality, Extensionality is too weak to deserve the name. To see this, suppose that B is a sentence for which Special Weakening fails: that is,  B but not  > ⇒ B. Since  B, the set b 32Perhaps its worth noting that the a and b for which the Intuitive Extentionality claim fails are ones for which the defining formula are not intuitively dialetheia; so there is no escape from the argument by confining the Intuitive Extensionality claim to classes not defined from dialetheia. (Such an attempted escape would in any case require a huge retreat. Someone who is dialetheic about the Russell class will think that it both is and isn't universal; if they want Extensionality, they should want the Russell class to both be and not be identical to {x : >}, and that there both aren't and are classes that contain {x : >} but don't contain the Russell class (or conversely). That involves accepting the Intuitive Extensionality claim as stated. A dialetheist who exempts classes defined from dialetheia from "counting" against Extensionality and related goals could no more claim success at a naıve theory of classes than could an advocate of restrictions on excluded middle who exempted from its purview classes defined by non-bivalent formulas.) PROSPECTS FOR A NAÏVE THEORY OF CLASSES 30 defined as {x : B} is a universal class. Let a be {x : >}; it too is a universal class. But they can only be declared coextensive if  > ⇔ B, which by hypothesis we don't have. Similarly by taking z to be {y : ∀x(x ∈ y⇔>}, we see that the right hand side of Intuitive Extensionality for Universal Classes fails. So we conclude that not only the particular dialetheic logic of this Section, but any dialetheic logic with modus ponens, must fail to yield a genuinely extensional naıve theory of classes. 10. TRIVIALITY But can we do better in some other way? The following theorem shows that, effectively, we cannot: Theorem 10.1. Suppose the logic of→ validates the axiom (**) (>→⊥)→⊥ and delivers the following axioms and rules: Modus Ponens A,A→ B ` B Weakening B ` A→ B Prefixing A→ B ` (C→ A)→ (C→ B) Suffixing A→ B ` (B→C)→ (A→C) Quantifier Axiom ` ∀xA(x)→ A(a) ⊥D f ` ⊥→ A Generalization A ` ∀xA ∧-Intro A,B ` A∧B Then the naıve theory of classes stated using→ is Post-inconsistent. To show this it will be convenient to introduce some abbreviations. First, we observe that every logic in this paper contains the prefixing and suffixing rules; in any such logic, the following three rules can be derived: Transitivity A→ B,B→C ` A→C Left Elimination A→ (B→C),D→ B ` A→ (D→C) Right Elimination A→ (B→C),C→ D ` A→ (B→ D). For readability, in what follows we'll abbreviate =ALT∗, introduced in n. 13, by ≈ (read as "is indiscernible from"), so that: a≈ b =d f ∀z(a ∈ z↔ b ∈ z) The crucial observation is that for≈ (though not for = as we've defined it), the suppositions of the theorem guarantee that the following quasi-substitutivity principle holds: Quasi-Substitutivity: A(a) ` a≈ b→ (>→ A(b)). This is reminiscent of the substitutivity principles we rejected in n. 14, but in addition to involving ≈ instead of =, the new principle has a '>→ ' that isn't in the principles we rejected; the following proof of Quasi-Substitutivity would not go through without it.33 33The principle also holds for =ALT when as we've been assuming identity in the ground language is classical (i.e. determinate), though it wouldn't hold in a more general setting. However, even there the modified principle Class(a),Class(b),A(a) ` a =ALT b→ (>→ A(b)) would hold; and since the main proof below will only use a,b such that ` Class(a)∧Class(b), this modified principle would suffice for the proof. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 31 Proof. (1) A(a) assumption (2) >→ A(a) 1, Weakening (3) a≈ b→ a ∈ {x : A(x)}↔ b ∈ {x : A(x)} Definition of ≈, Quantifier Axiom (4) a≈ b→ (A(a)→ A(b)) 3, Abstraction, Left and Right Elimination (5) a≈ b→ (>→ A(b)) 2, 4, Left Elimination  For any formula A, let pA =d f {x : A}, where x is any variable not free in A. Then it is clear that extensionality (together with Abstraction and Generalization) guarantees the following meta-rule: Extensionality Meta-Rule: If ` A↔ B then ` pA ≈ pB Finally, we define the Hinnion class h and the Hinnion sentence κ by: h=d f {y : py∈y ≈ p⊥} κ =d f h ∈ h The Hinnion class can be thought of as a variant of the Curry class, in which the relation ≈ between (naıve) classes is used in place of the conditional. The proof of the theorem is then as follows: Proof. (1) κ ↔ (p κ ≈ p⊥) Def. of h, Abstraction and Reflexivity (2) pκ ≈ {x : pκ ≈ p⊥} 1, Extensionality Rule (3) (pκ ≈ p⊥)→ (>→ {x : p⊥ ≈ p⊥} ≈ p⊥) 2, Quasi-Substitutivity (4) p⊥ ≈ p⊥ ⊥Df, Abstraction and Reflexivity (5) p⊥ ∈ {x : p⊥ ≈ p⊥} 4, Abstraction (6) ({x : p⊥ ≈ p⊥} ≈ p⊥)→ (>→ p⊥ ∈ p⊥) 5, Quasi-Substitutivity (7) p⊥ ∈ p⊥→⊥ Def. of p⊥+ Abstraction (8) ({x : p⊥ ≈ p⊥} ≈ p⊥)→ (>→⊥) 6, 7, Right Elimination (9) (>→⊥)→⊥ Assumption (**) (10) ({x : p⊥ ≈ p⊥} ≈ p⊥)→⊥ 8, 9, Transitivity (11) (pκ ≈ p⊥)→ (>→⊥) 3, 10, Right Elimination (12) (pκ ≈ p⊥)→⊥ 9, 11, Transitivity (13) κ →⊥ 1, 12, Transitivity (14) κ ↔⊥ 13, ⊥D f (15) pκ ≈ p⊥ 14, Extensionality Meta-Rule (16) ⊥ 12, 15, Modus Ponens (17) ⊥→ A Df⊥ (18) A 16, 17, Modus Ponens  How bad is this negative result? We argued extensively in Sections 7 and 9 that Weakening for the conditional used in the Extensionality law was non-negotiable. Modus Ponens, Prefixing, Suffixing and ⊥D f seem to us to have a similarly incontestable status. One might attempt to respond to the result by giving up on (**); indeed, that is the course followed in the Positive Brady and Positive Bacon logics of Section 8, where the Extensionality Rule used → rather than ⇒ and →-Weakening did hold. (The logic of Section 9 didn't have (**), but of course it didn't have Weakening for⇒ either.) But (**) follows from (∗) ` ¬(>→⊥) PROSPECTS FOR A NAÏVE THEORY OF CLASSES 32 by Weakening from a contraposable→. So if the Weakening Rule is assumed to hold for the conditional used in the Extensionality Rule, there are only two ways we could give up (**): (i) use a non-contraposable conditional in the Extensionality Rule or (ii) reject (*). We pointed out near the end of Section 9 that (i) is not viable in the presence of involutive negation. But (ii) is also seriously unattractive: not only does it require rejecting ContraModus-Ponens, it requires either rejecting all negated conditionals or else rejecting one of the rules ¬(A→ B) ` ¬(A→⊥) and ¬(A→⊥) ` ¬(>→⊥) since these obviously suffice for getting from any given negated conditional to ¬(>→⊥). So we conclude that the strength of Extensionality can be bought only at the price of naıveté, as we have been understanding it.34 There is, however, a different kind of response to the result, which is to offer an alternative understanding of naıveté. Perhaps the most obvious approach along these lines would be to try replacing the Abstraction Schema and Class-Abstracts from the Introduction by the following: Comprehension Schema: ∀u1, ...,∀un∃y[Class(y)∧∀x(x ∈ y⇔ A(x;u1, ...,un))] Obviously the Abstraction Schema and Class-Abstracts together entail Comprehension; but it's not obvious that the move from Comprehension to Abstraction could be achieved conservatively-even in the presence of Extensionality. With or without Extensionality, there is of course no problem in conservatively introducing 0-level abstracts, that is, abstracts of form {x : A(x;u1, ...,un)} where A(x;u1, ...,un) itself contains no abstracts. But the full Abstraction Schema we've been using was not limited in this way.35 There are two different ways to understand the Comprehension Schema, which yield different verdicts on whether Abstraction is or is not obtainable by what is understood as a conservative extension of Comprehension. The first ("indefinite extensibility") option takes the Schema to apply not just to the language in which it is stated (the language L+− that includes 'Class', '∈' and → but not the abstraction operator), but to any expansion of that language. In this case, the schema applies to the language with 0-level abstracts, and we can proceed to introduce 1-level abstracts in the same way we introduced 0-level abstracts. Since Comprehension applies to the language with 1-level abstracts too, we can introduce 2-level abstracts; and so on. In this way we get the Abstraction Schema conservatively from the Comprehension Schema so understood, without any form of Extensionality. 34It's worth noting that the proof targets not merely Extensionality, but the Extensionality Meta-Rule: it thus shows that even theories of naıve properties that accept the assumptions of Theorem 10.1 had better not individuate properties so coarsely that if two properties are defined by sentences that are provably coextensive in the full theory of properties, they are identical. For that would keep the indiscernibility of identicals from being valid. (For instance, in the theory of Field 2008, let a be the property {x : ⊥} and b the property {x : >→⊥}; then a and b are properties defined by logically equivalent sentences, but they shouldn't be declared identical because b definitely instantiates the property {x : ∀y(x ∈ y↔ b ∈ y} whereas a doesn't; see note 18.) But that doesn't seem at all devastating for naıve properties, for there are independent reasons for holding that properties should be individuated more finely than this; whereas the result cuts to the heart of the naıve theory of classes. 35We should note, however, that the argument against dialetheism at the end of Section 9 would remain even if Abstraction were weakened to Comprehension: for all of the abstracts employed in the argument were based on formulas that did not themselves contain abstracts, and would thus be available in the theory from Comprehension alone by conservative extension. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 33 A second, "tepid" understanding-which is perhaps the more usual one-is to take the Schema as applying only to its own language L+−. In that case, we could still advance beyond 0-level abstracts if our introduction of abstracts at each stage was definitional, for then we would in effect be introducing them into the object language. However, performing this extension conservatively requires Extensionality; moreover, it requires a stronger form of Extensionality than the rule form we've focused on, and indeed a stronger form than any we've shown to be consistent with Comprehension. So if the naıve theory of classes were formulated using Comprehension instead of Abstraction, and Comprehension interpreted in the "tepid" way, then it remains an open question whether the theory is consistent with a conditional logic including the "mandatory" principles we've been discussing. This leaves us with two open questions, a mathematical one and a philosophical one. The mathematical question concerns whether Comprehension and Extensionality are consistent in a reasonably strong logic (ideally, one of the attractive logics known to be consistent with the naıve theory of truth).36 The philosophical question concerns whether the theory with only Comprehension, and not Abstraction, can justly be called a naıve theory of classes. To put it another way: in accepting Comprehension, it seems plausible that the naıve theorist should accept the schema as applied to reasonable extensions of the language; but does the addition of { : } count as "reasonable" in the relevant sense? Clearly, if Comprehension and Extensionality are inconsistent in any logic with Weakening for⇒ (for example), the philosophical question would be moot. But if they do turn out to be consistent, there may be interesting questions about the appropriate understanding of "naıveté" in this context, questions we haven't attempted to address here. Even if Comprehension plus Extensionality are inconsistent in any reasonable logic, this result would not show that the logics of naıve truth and properties on offer in the literature are unsuited to the development of any extensional set theory at all. A naıve theory of classes is much stronger than what is required for a set theory which is consonant with the philosophical motivations for preserving naıveté for truth or properties. In fact, it's much stronger even than what's needed to use these non-classical logics to give a theory which is more satisfactory than standard classical theories which admit quantification over classes as well as over sets. For now, all we can say is this: If naıveté requires Abstraction, then the prospects for the naıve theory of classes are dim indeed. If Comprehension is enough for naıveté, however, there is still some hope. We've shown that a dialetheic theory will have trouble getting Comprehension and Extensionality in a reasonable logic, but we haven't shown that other logics will face analogous difficulties. Still, we have shown that if Comprehension and Extensionality are consistent, they must be shown to be so by a model construction which does not also validate Abstraction. So at the very least proving these principles consistent will require new methods, which are fundamentally different from Brady's, and from any of the techniques which are now standard in the literature on naıve truth. REFERENCES Bacon, A., "A new conditional for naive truth theory," Notre Dame Journal of Formal Logic, vol. 54 (2013), pp. 87–104. 3, 23, 38 Bacon, A., "Paradoxes of logical equivalence and identity," Topoi, (2013), pp. 1–10. 16 Beall, J., R. T. Brady, A. P. Hazen, G. Priest, and G. Restall, "Relevant restricted quantification," Journal of Philosophical Logic, vol. 35 (2006), pp. 587–598. 19, 27 36See Forti and Hinnion 1989 for an example where a form of Abstraction restricted to positive formulas is inconsistent with Extensionality, but the similarly restricted form of Comprehension is in fact consistent with it. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 34 Brady, R. T., "The consistency of the axioms of abstraction and extensionality in a threevalued logic.," Notre Dame Journal of Formal Logic, vol. 12 (1971), pp. 447–453. 2 Brady, R. T., "The simple consistency of a set theory based on the logic csq," Notre Dame Journal of Formal Logic, vol. 24 (1983), pp. 431–449. 2, 11, 12, 14, 18, 37 Brady, R. T., "The non-triviality of dialectical set theory," pp. 437–470 in Paraconsistent Logic: Essays on the Inconsistent, edited by Graham Priest and Richard Routley and Jean Norman, Munich: Philosophia Verlag, 2, 11, 26 Brady, R. T., Universal Logic, Stanford, California: CSLI Publications,, 2006. 2, 11, 12, 14, 18 Brady, R. T., "The simple consistency of naive set theory using metavaluations," Journal of Philosophical Logic, (2013), pp. 1–21. 2, 11, 20 Brady, R. T., and R. Routley, "The non-triviality of extensional dialectical set theory," pp. 415–437 in Paraconsistent Logic: Essays on the Inconsistent, edited by Graham Priest and Richard Routley and Jean Norman, Philosophia Verlag, 2 Cantini, A., "The undecidability of grisin's set theory," Studia logica, vol. 74 (2003), pp. 345–368. 2 Evans, G., "Can there be vague objects?," Analysis, vol. 38 (1978), p. 208. 16 Field, H. H., Saving truth from paradox, Oxford University Press Oxford, 2008. 19, 20, 23, 32 Field, H. H., "Naive truth and restricted quantification: Saving truth a whole lot better," The Review of Symbolic Logic, vol. 7 (2014), pp. 147–191. 19, 20 Forti, M., and R. Hinnion, "The consistency problem for positive comprehension principles," The Journal of symbolic logic, vol. 54 (1989), pp. 1401–1418. 2, 33 Friedman, H., "The consistency of classical set theory relative to a set theory with intuitionistic logic," The Journal of Symbolic Logic, vol. 38 (1973), pp. 315–319. 3 Gilmore, P. C., "The consistency of a positive set theory," Technical Report RC-1754, IBM Research Report, 2, 6 Gilmore, P. C., "The consistency of partial set theory without extensionality," in Axiomatic set theory, volume 13, pp. 147–153. American Mathematical Society, 6 Grišin, V., "Predicate and set-theoretic calculi based on logic without contractions," Mathematics of the USSR-Izvestiya, vol. 18 (1982), p. 41. 2 Hamkins, J., "Is there any research on set theory without extensionality axiom?," http://mathoverflow.net/questions/168287/is-there-any-research-on-set-theorywithout-extensionality-axiom, 3 Hinnion, R., "Naive set theory with extensionality in partial logic and in paradoxical logic," Notre Dame Journal of Formal Logic, vol. 35 (1994), pp. 15–40. 2 Hinnion, R., and T. Libert, "Positive abstraction and extensionality," The Journal of Symbolic Logic, vol. 68 (2003), pp. 828–836. 2 Hinnion, R., and T. Libert, "Topological models for extensional partial set theory," Notre Dame journal of formal logic, vol. 49 (2008), pp. 39–53. 2 Kripke, S. A., "Outline of a theory of truth," Journal of Philosophy, vol. 72 (1975), pp. 690–716. 6, 7 Maddy, P., "Proper classes," The Journal of Symbolic Logic, vol. 48 (1983), pp. 113–139. 2 Øgaard, T. F., "Paths to triviality," 2, 16 Scott, D., "More on the axiom of extensionality," Essays of the Foundations of Mathematics, (1961), pp. 115–139. 3 PROSPECTS FOR A NAÏVE THEORY OF CLASSES 35 Shirahata, M., "Linear set theory with strict comprehension," in Proc. 6th Annual Logic Conference, Singapore, pp. 223–245, 2 White, R., "The consistency of the axiom of comprehension in the infinite-valued predicate logic of łukasiewicz," Journal of Philosophical Logic, vol. 8 (1979), pp. 509–534. 2 APPENDIX A. AXIOMS AND RULES In the following array, we state rules and axioms for an arbitrary conditional→ (which except in the case of Ext1-5 is not assumed to be the "non-contraposable" conditional of the main text). The table which follows the array is not comprehensive; we have merely listed some important and familiar laws and rules validated by the various constructions in the paper. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 36 Ax1 A→ A reflexivity Ax2 A→ A∨B and B→ A∨B Ax3 A∧B→ A and A∧B→ B Ax4 ¬¬A↔ A double negation Ax5 A∧ (B∨C)→ (A∧B)∨ (A∧C) Ax6 (A→ B)∧ (A→C)→ (A→ B∧C) strong lattice ∧ Ax7 (A→C)∧ (B→C)→ (A∨B→C) strong lattice ∨ Ax8 (A→¬B)→ (B→¬A) contraposition axiom Ax9 ¬(A∧B)↔ (¬A∨¬B) deMorgan 1 Ax10 ¬(A∨B)↔ (¬A∧¬B) deMorgan 2 Ax11 (A→ B)→ ((B→C)→ (A→C)) suffixing axiom Ax12 (A→ B)→ ((C→ A)→ (C→ B)) prefixing axiom Ax13 (A→ B)∧ (B→C)→ (A→C) conjunctive syllogism Ax14 A→ (B→ A) weakening Ax15 (A→ B)∨¬(A→ B) conditional LEM R1 A,B ` A∧B adjunction R2 A,A→ B ` B modus ponens R3 A,¬B ` ¬(A→ B) Contra-Modus-Ponens R4 A,¬A ` B explosion MR1 A `C B `C A∨B `C reasoning by cases Q1 ∀xA→ A(x/a) a free for x Q2 ∀x(A∨B)→ A∨∀xB x 6∈ FV{A} Q3 ∀x(A→ B)→ (A→∀xB) x 6∈ FV{A} Q4 A(x/a)→∃xA a free for x Q5 A∧∃xB→∃x(A∧B) x 6∈ FV{A} Q6 ∀x(B→ A)→ (∃xB→ A) x 6∈ FV{A} RQ Γ ` A(x/y) Γ ` ∀xA y 6∈ FV (Γ∪{∀xA}) MR2 A(x/y) ` B ∃xA ` B y 6∈ FV{∃xA,B} Ext1 ∀x(x ∈ a⇔ x ∈ b) ` ∀x(a ∈ x⇔ b ∈ x) Ext2 ∀x(x ∈ a⇔ x ∈ b)→∀x(a ∈ x⇔ b ∈ x) Ext3 ∀x(x ∈ a⇔ x ∈ b)⇒∀x(a ∈ x⇔ b ∈ x) Ext4 ∀x(x ∈ a↔ x ∈ b) ` ∀x(a ∈ x↔ b ∈ x) Ext5 ∀x(x ∈ a↔ x ∈ b)→∀x(a ∈ x↔ b ∈ x) Some differences between our constructions are then as follows: PROSPECTS FOR A NAÏVE THEORY OF CLASSES 37 Section Axioms Name of Logic Brady 1983 (1983) ((0⇒A), Section 5) Logic For→: Ax1–Ax7, Ax9–Ax12, R1–R4, Q1–Q6, RQ, MR1–MR2 For⇒: Ax1–Ax12, R1–R4, Q1–Q6, RQ, MR1–MR2 TWQd+R3+R4 (Note: Brady uses "TNdQ" for TWQd+R3) Classes For⇒: Comprehension, Abstraction, Ext1, Ext2 Brady 2006 ((0⇒B), Section 5) Logic For→: Ax1–Ax7, Ax9–Ax13, Ax15 R1–R4, Q1–Q6, RQ, MR1–MR2 For⇒: Ax1–Ax13, Ax15 R1–R4, Q1–Q6, RQ, MR1–MR2 TJQd+R3+R4 Classes For⇒: Comprehension, Abstraction, Ext1, Ext2, Ext3 Positive Bacon (Only finite macro-stages; Logic For→: Ax1–Ax3, Ax5-Ax7, Ax11–Ax14, R1-R2, Q1–Q6, RQ, MR1 TJK+ Section 8) Classes For→: Comprehension, Abstraction, Ext4, Ext5 Positive Brady (Section 8) Logic For→: Ax1–Ax3, Ax5-Ax7, Ax11–Ax14, R1–R2, Q1–Q6, RQ, MR1–MR2 TJK+ + MR2 Classes For→: Comprehension, Abstraction, Ext4, Ext 5 4-valued Bacon (Only finite macro-stages; Section 9) Logic For→: Ax1–Ax7, Ax9–Ax14, R1–R3, Q1–Q6, RQ, MR1 For⇒: Ax1–Ax13, R1–R3, Q1–6, MR1 DTJK-MR2 Classes For⇒: Comprehension, Abstraction, Ext1, Ext2 4-valued Brady (Static version: Section 9) Logic For→: Ax1–Ax7, Ax9–Ax14, R1–R3, Q1–Q6, RQ, MR1–MR2 For⇒: Ax1–Ax13, R1–R3, Q1-6, MR1–MR2 DTJK Classes For⇒: Comprehension, Abstraction, Ext1, Ext2 We haven't included the variant of Brady with the 0⇒C clause (and dynamic microconstruction): as far as the laws in our list go it is like Brady 1983, and distinguished from that logic primarily by invalidating certain obviously undesirable laws such as ¬(> → ¬(A→ B)). In the 4-valued constructions, the dynamic variants lead to extra laws: for instance, Bacon in effect uses our (D−ii), which delivers the law ¬(A→>)→⊥ (as do (D−iii) and (D−iv) ); and the dynamic variants other than Bacon's yield ¬(⊥→ B)→⊥. Finally, we note a difference between our 4-valued BB-logic and Bacon's claims about his construction. On page 101 (12 in his list of axioms) Bacon claims that when negation is added his own "four-valued" construction delivers R4 (Explosion), together with Reflexivity. But as we show in the next Appendix, this claim must either be restricted or rejected entirely: the logic can achieve Explosion plus Reflexivity only by denying that certain paradoxical sentences express propositions, and restricting these laws to those sentences which express propositions. PROSPECTS FOR A NAÏVE THEORY OF CLASSES 38 APPENDIX B. BACON'S CONSTRUCTION The construction in Bacon 2013a appears on its face to be quite different from the one given here; the point of this Appendix is to show that they are in fact the same (so that Bacon's appeal to the Banach Fixed Point Theorem isn't really needed to give his result), and also to show that in his 4-valued construction he does not avoid the choice between accepting dialetheia and restricting Reflexivity or Contra-Modus-Ponens. B.1. Bacon's Positive Construction. We begin with the construction in the first five sections of his paper, which deal with the positive logic (i.e. the logic without involutive negation). Here are some of his definitions: Definition B.1. A function f : N→{0,1} flatlines if and only if for some n∈N, f (m) = 0 for each m > n. He takes W (the set of worlds) to be { f : N→ {0,1}| f flatlines}, and f 6 g iff for each n ∈ N, f (n)6 g(n). These "flatlining functions" are simply the characteristic functions of finite subsets of the natural numbers. His 6 corresponds to the relation of ⊆ on these subsets. For the positive logic, Bacon takes the propositions to be the non-empty downward closed subsets of W ; so the strongest proposition isn't /0, but rather { /0}. We can think of the proposition expressed by a sentence as the set of worlds at which it has value 1; so every sentence, even ⊥, has value 1 at the minimal world /0; it's the "inconsistent world". (So this minimal world is completely uninformative: perhaps it would be more natural to simply drop it, but to stay close to Bacon we will keep it.) We evaluate conjunction and disjunction by the lattice theoretic operations of meet (u) and join (t), defined as: ( f ug)(n) = min( f (n),g(n)), and ( f tg)(n) = max( f (n),g(n)). The quantifiers are evaluated analogously. This generates the usual conditions for the values of sentences at worlds. (Even the inconsistent world /0, since there is as yet no negation in the language.) To handle the conditional, Bacon proceeds as follows: Definition B.2. The rank of an element f ∈W , r( f ), is the smallest n such that f (m) = 0 for all m > n. (Equivalently, the rank of a finite subset f of N is the smallest n ∈ f such that all members of f are less than n.) Definition B.3. w∗ := { w(n) if n < r(w)−1 0 otherwise. Again, if we think of finite subsets of N, /0∗ = /0, and otherwise w∗ is what results from w by deleting its largest element. Bacon then defines the ternary accessibility relation R by: Definition B.4. Rwyz if and only if z6 w∗ and z6 y. In the set representation, that's z⊆ w∗∩ y. As usual in ternary relation semantics, he takes |A→ B|w = 1 iff ∀y∀z(if Rwyz and |A|y = 1 then |B|z = 1). So with this choice of R, we have in the set representation that PROSPECTS FOR A NAÏVE THEORY OF CLASSES 39 |A→ B|w = 1 iff ∀y∀z(if z⊆ w∗∩ y and |A|y = 1 then |B|z = 1); or equivalently, |A→ B|w = 1 iff ∀z[if ∃y(z⊆ w∗∩ y and |A|y = 1) then |B|z = 1]; which since the propositions are downward closed amounts to ($) |A→ B|w = 1 iff ∀z[if z⊆ w∗ and |A|z = 1 then |B|z = 1]. (This simplification of the ternary relation semantics is due to the particular choice of R.) To understand the implications of this, consider first the case where w is /0 or a singleton. Then w∗ is /0, and since every B gets value 1 at /0, the right-hand side of ($) always holds: thus at each singleton world, all conditionals get value 1. (Unlike the case of /0,⊥ and false ground-language sentences get value 0 at singletons.) So there is no difference in values from one singleton world to another. (None for conditionals; and so by induction, none for any other sentences either.) Thus the value of every sentence at a singleton world is just the value of that sentence at level 0 of the positive-logic Brady construction in Section 8. Similarly, when w is a doubleton world {m1,m2} where m1 < m2, ($) yields (since the case where z is /0 is trivial): |A→ B|{m1,m2} = 1 if and only if: if |A|{m1} = 1 then |B|{m1} = 1. But we saw above that for singleton worlds {m1}, values are the same whatever the m1; this then tells us that for doubletons too, the members don't matter, all that matters is that there be two members. A conditional gets value 1 at a doubleton if and only if, if the antecedent gets value 1 at stage 0 of the positive-logic Brady construction, so does the conclusion. And that's just the condition for it to have value 1 at level 1 of the positive-logic Brady construction. An obvious induction yields that when w is any finite set of positive cardinality n, the value of a conditional at w is just its value at n−1 in the positive logic Brady construction. So as we said in Section 8, validity in Bacon's positive logic is just preservation of value 1 at all finite levels of the positive-logic Brady construction. B.2. Bacon's Four-Valued Construction. We turn now to Bacon's introduction of involutive negation. It follows the general plan that we used to extend Brady's logic (no surprise, since we modeled that after Bacon's proposal): a 4-valued semantics obtained by assigning a positive and negative value (each in {0,1}) to each conditional, and extending this valuation to other sentences as in Section 9. The positive clause for the conditional is the one for the positive logic, except that here it's stated using + values: |A→ B|+w = 1 iff ∀y∀z( if z⊆ w∗∩ y and |A|+y = 1 then |B|+z = 1), which as above reduces to ($+) |A→ B|+w = 1 iff ∀z[ if z⊆ w∗ and |A|+z = 1 then |B|+z = 1]. As for the negative, Bacon says that there's a certain amount of choice, but he works with this one: |A→ B|−w = 1 iff ∃y∃z[w∗ 6 y and w6 z and |A|+y = 1 and |B|−z = 1], which simplifies to |A→ B|−w = 1 iff |A|+w∗ = 1 and |B|−w = 1; this is easily seen to be equivalent to Version (I) of the 4-valued Brady using dynamic clause (Dii) (though note again that Bacon goes only through the finite stages of this construction). Bacon takes validity to be the preservation of designatedness, but offers three choices for what that is. His preferred choice (his (iii), on pg. 100) is to take a sentence A to be PROSPECTS FOR A NAÏVE THEORY OF CLASSES 40 designated iff (a) for all w |A|+w is 1 and (b) for some w |A|−w is 0. Contrary to what Bacon claims, however, this has the consequence that A→ A is not generally valid: the sentence X of Section 9 illustrates this problem since, as we noted there, X → X has value 〈1,1〉 at every stage. Another of Bacon's proposals ((ii) on p. 100) is to take A to be designated iff for all w |A|+w is 1; this leads to a dialetheic logic similar to that in Section 9 (but which, because it doesn't proceed to the fixed point, has the additional oddity concerning reasoning by cases discussed in Section 8). The final possibility that he mentions (his (i)) involves taking a sentence A to be designated iff (a) for all w |A|+w is 1 and (b∗) for all w other than the inconsistent w, |A|−w is 0; this has the oddity that no negation of a conditional is ever valid, so we have an even worse failure of Contra-Modus-Ponens than the one contemplated in Section 9. These problems with Bacon's construction are masked by one particular oddity in his presentation. For some reason that he doesn't explain, Bacon builds into his proposal (iii) that a sentence doesn't count as expressing a proposition if it gets value 〈1,1〉 at every world. Similarly, he builds into his proposal (i) (the last one discussed here) that a sentence doesn't count as expressing a proposition if there are any worlds other than /0 where it has value 〈1,1〉. So on both of these proposals the sentence X from Section 9 is claimed not to express a proposition. Bacon doesn't tell us what to do with sentences which don't express propositions as far as the logic is concerned, but one might attempt to re-cast Bacon's proposal as one in which some sentences don't express propositions, and the laws are intended to hold only for those sentences which do. But this "fix" goes against the aim of the construction Bacon is engaged in, of achieving a logic for naıve truth: it deals with the paradoxes by taking a "no-proposition" view of some of them. (What's more, in the case of proposal (i) the "fix" wouldn't help with the problem about Contra-Modus-Ponens described at the end of the previous paragraph.) We conclude that the dialetheic version of Bacon's construction is the best; and that the construction of Section 9 gives a somewhat improved version of this, without the oddity about reasoning by cases. In any case, a nice thing about Bacon's logic (which he may not have known) is that it is one in which the naıve theory of classes, including extensionality, is consistent. (This virtue is preserved in the "4-valued Brady logic" of Section 9.) And unlike Brady's logics based on the 3-valued space {0,1/2,1}, it has the Rule of Weakening, indeed the Axiom of Weakening; we take this to be important at least if the conditional is to be of use for restricted quantification. But as we say in the text, further improvement would be needed before declaring this a satisfactory naıve theory of classes.