Skip to main content
Log in

Compositionality Solves Carnap’s Problem

  • Original Article
  • Published:
Erkenntnis Aims and scope Submit manuscript

Abstract

The standard relation of logical consequence allows for non-standard interpretations of logical constants, as was shown early on by Carnap. But then how can we learn the interpretations of logical constants, if not from the rules which govern their use? Answers in the literature have mostly consisted in devising clever rule formats going beyond the familiar what follows from what. A more conservative answer is possible. We may be able to learn the correct interpretations from the standard rules, because the space of possible interpretations is a priori restricted by universal semantic principles. We show that this is indeed the case. The principles are familiar from modern formal semantics: compositionality, supplemented, for quantifiers, with topic-neutrality.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Notes

  1. Koslow (2010) also discusses Carnap’s problem in the light of Church’s criticism, but does not put forward compositionality as a solution.

  2. There is debate about whether it is also a sufficient condition; see e.g. Bonnay (2008).

  3. Mustn’t we also assume that the learner knows which expressions are logical and which are not? This can be a problematic distinction in more complex languages. But for the simple logical languages under discussion in this paper, there is a test our hypothetical learner could in principle perform, again relying only on the given consequence relation: Consider a valid inference in which expression e occurs. Does it remain valid under all replacements of e by another expression of the same category (or better, under all appropriate reinterpretations of e)? If Yes (for all such inferences), e is not a logical expression; if No, it is. For (much) more about this methodology, see Bonnay and Westerståhl (2012).

  4. This is the current terminology, e.g. in Humberstone (2011); in Carnap (1943) they are called interpretations. We use “interpretation” too in later sections, though in a slightly different sense, since our interpretations are by definition compositional.

  5. Which is why we don’t give the proofs here.

  6. See Antonelli (2013) for an adequate framework to carry out this restriction.

  7. We could also treat W simply as a parameter, and let a (global) interpretation be a functor I which to each W assigns a (local) interpretation \(I^W\) as defined here. The same results would hold for global interpretations.

  8. Assignments are usually called valuations in possible worlds semantics, but we already used that term in Sect. 4 for mappings from sentences to truth values. There too, we could have avoided the non-triviality assumption by treating propositional letters as variables, but we wanted to stay as close as possible to Carnap’s original setting.

  9. Thus, a model in the usual sense, for the basic modal language with operators \(\lnot ,\wedge ,\vee ,\rightarrow ,\Box \), has the form (Wf) if we, like Carnap, think of necessity as truth in all worlds, and the form (WRf), where \(R\subseteq W^2\), in Kripke semantics. Then we have, for \(w\in W\),

    $$ (W,R,f),w\,\vDash \,\varphi \,\,{\hbox{iff}}\,\,w \in \llbracket \varphi \rrbracket ^{I_{\mathrm{n}}}_f $$

    The normal interpretation of \(\Box \) is, in the Carnap setting,

    $$ I_{\mathrm{n}}(\Box )(X) = \left\{ \begin{array}{ll} W &{} \quad {\hbox{if}} \quad X=W \\ \emptyset &{} \quad {\hbox{if}} \quad X\ne W \end{array} \right. $$

    for \(X \subseteq W\), whereas in the Kripke setting it is

    $$ I_{\mathrm{n}}(\Box )(X) = \{w\in W: R(w) \subseteq X\} $$

    where \(R(w) = \{w^{\prime }: wRw^{\prime }\}\). Here, however, we are only concerned with the interpretations of \(\lnot ,\wedge ,\vee ,\rightarrow \), and then it doesn’t matter which setting is chosen.

  10. We thank an anonymous referee for drawing our attention to this fact, and for encouraging us to include the proof of (5) in the paper.

  11. Antonelli’s result concerns non-standard models in which \(\forall \) is interpreted by a (not necessarily principal) filter. Since such non-standard models falsify some first-order validities, it is natural to ask which further restriction on the interpretation of \(\forall \) would be such that it guarantees that valid formulas are exactly those deemed valid by first-order logic, and that it is forced in any model of the validities of first-order logic.

  12. If persistence had been a requirement on assignments, this particular f might not be allowed; see the remarks at the end of Sect. 6.

References

  • Antonelli, A. (2013). On the general interpretation of quantifiers. Review of Symbolic Logic, 6, 637–658.

    Article  Google Scholar 

  • Bencivenga, E. (1986). Free logics. In D. Gabbay & F. Guenthner (Eds.), Handbook of philosophical logic, Vol. III: Alternatives to classical logic (pp. 373–426). Dordrecht: Reidel.

    Chapter  Google Scholar 

  • Bonnay, D. (2008). Logicality and invariance. Bulletin of Symbolic Logic, 14(1), 29–68.

    Article  Google Scholar 

  • Bonnay, D., & Westerståhl, D. (2012). Consequence mining: Constants versus consequence relations. Journal of Philosophical Logic, 41(4), 671–709.

    Article  Google Scholar 

  • Carnap, R. (1943). Formalization of logic. Vol. 2 of Studies in semantics. Cambridge, MA: Harvard University Press.

  • Church, A. (1944). Review of Carnap 1943. Philosophical Review, 53, 493–498.

    Article  Google Scholar 

  • Cori, R., & Lascar, D. (2000). Mathematical logic. Oxford: Oxford University Press.

    Google Scholar 

  • Garson, J. W. (2013). What logics mean. Cambridge: Cambridge University Press.

    Book  Google Scholar 

  • Hacking, I. (1979). What is logic? Journal of Philosophy, 76, 285–319.

    Article  Google Scholar 

  • Hjortland, O. T. (2014). Speech acts, categoricity and the meaning of logical connectives. Notre Dame Journal of Formal Logic, 55(4), 445–467.

    Article  Google Scholar 

  • Holliday, W. (2015). Possibility frames and forcing for modal logic. Manuscript.

  • Humberstone, L. (1981). From worlds to possibilities. Journal of Philosophical Logic, 10(3), 313–339.

    Article  Google Scholar 

  • Humberstone, L. (2011). The connectives. Cambridge, MA: MIT Press.

    Google Scholar 

  • Koslow, A. (2010). Carnap’s problem: What is it like to be a normal interpretation of classical logic? Abstracta, 6(1), 117–135.

    Google Scholar 

  • Mostowski, A. (1957). On a generalization of quantifiers. Fundamenta Mathematicae, 44, 12–35.

    Google Scholar 

  • Murzi, J., & Hjortland, O. (2009). Inferentialism and the categoricity problem: Reply to Raatikainen. Analysis, 69(3), 480–488.

    Article  Google Scholar 

  • Pagin, P., & Westerståhl, D. (2010). Compositionality I (definitions and variants) and II (arguments and problems). Philosophy Compass, 5(3), 250–282.

    Article  Google Scholar 

  • Rumfitt, I. (2000). ‘Yes’ and ‘no’. Mind, 109, 781–823.

    Article  Google Scholar 

  • Shoesmith, D. J., & Smiley, T. J. (1978). Multiple-conclusion logic. Cambridge: Cambridge University Press.

    Book  Google Scholar 

  • Smiley, T. (1996). Rejection. Analysis, 56(1), 1–9.

    Article  Google Scholar 

  • Steinberger, F. (2011). Why conclusions should remain single. Journal of Philosophical Logic, 44, 333–355.

    Article  Google Scholar 

  • van Lambalgen, M. (1992). Independence, randomness and the axiom of choice. Journal of Symbolic Logic, 57, 1274–1304.

    Article  Google Scholar 

  • Westerståhl, D. (1996). Self-commuting quantifiers. Journal of Symbolic Logic, 61, 212–224.

    Article  Google Scholar 

Download references

Acknowledgments

We are grateful to an anonymous referee for comments that helped us improve an earlier version of this paper, and we thank Aldo Antonelli, Johan van Benthem, Serge Bozon, Paul Egré, Fredrik Engström, Wes Holliday, Ed Keenan, Sebastiano Moruzzi, Peter Pagin, Stanley Peters, and Shane Steinert-Threlkeld for remarks and suggestions at the various occasions when one of us presented our results about Carnap’s Problem. Both authors wish to acknowledge support from the ANR-10-LABX-0087 IEC and ANR-10-IDEX-0001-02 PSL Grants. The second author thanks the Center for the Study of Language and Information (CSLI) at Stanford University for hospitality and intellectual stimulation when he was a visiting Research Fellow there for the spring quarter 2015, during which the final version of the paper was written.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dag Westerståhl.

Appendix

Appendix

Our first aim in this appendix is to prove (3) and (4) above. Claim (3) states that an interpretation Q for \(\forall \) is consistent with \(\vDash _L\) iff Q is a principal filter, closed under the interpretation of terms. As announced, we consider a language with predicate variables. Given a set M, we shall say that a set Q of subsets of M is a commutative filter if and only if it is a filter and is such that for all relations R on M,

$$ \{ a \in M | \,R(a) \in Q \} \in Q\,\,{\hbox{iff}}\,\,\{a \in M |\,R^{-1}(a) \in Q \} \in Q $$

where R(a) is \(\{ b \in M | \,aRb \}\). We shall prove that the following are equivalent:

  1. (a)

    \({\mathcal {M}}, Q\) is consistent with \(\vDash _L\),

  2. (b)

    Q is a commutative filter on M and, for any term \(t(x_1,\ldots ,x_n)\),

    \({\mathcal {M}},Q\,\vDash\,\forall x Px \rightarrow \forall x_1\ldots \forall x_n Pt(x_1,\ldots ,x_n)\)

  3. (c)

    Q is a principal filter on M which is closed under the interpretation of terms in \({\mathcal {M}}\).

(3) is the equivalence between (a) and (c). In a different context, the proof that commutative filters are principal can be found in Lambalgen (1992), and the connection with the interpretation of the universal quantifier is already made in Westerståhl (1996). We provide here a somewhat shorter proof for that part. It is inspired by the filter model given by Antonelli (2013), Theorem 6.10, to prove that some valid first-order formulas are not valid in some models where the universal quantifier is interpreted by a filter (of course, in those models, the filter is not principal).Footnote 11 And we provide proofs of the other parts.

Proof

(a) implies (b). The fact that Q is a filter is easily read off from some familiar first-order validities. Closure under finite intersection follows from the fact that \((\forall x \varphi \wedge \forall x \psi ) \rightarrow \forall x (\varphi \wedge \psi )\) is valid. The validity of \(\forall x(\varphi \wedge \psi ) \rightarrow \forall x\varphi \) ensures closure under supersets. The empty set does not belong to Q because of \(\forall x\varphi \rightarrow \lnot \forall x \lnot \varphi \). Moreover, Q is commutative because \(\forall x \forall y \varphi \rightarrow \forall y \forall x \varphi \) is valid. Finally, \({\mathcal {M}}, Q\,\vDash\,\forall x Px \rightarrow \forall x_1\ldots \forall x_n Pt(x_1,\ldots ,x_n)\) simply because the formula is valid in first-order logic.

(b) implies (c). First, let Q be a non-principal filter; we show that Q is not commutative. Take any X in Q and consider (using Zorn’s Lemma) a maximal set \(Q^{\prime }\) of subsets of X belonging to Q and totally ordered by inclusion. We define a parallel indexed family \(\{ \Delta _Y \}_{Y \in Q^{\prime }}\) by setting \(\Delta _Y = X - Y\) for \(Y \in Q^{\prime }\). We shall furthermore consider the sets \(Z = \bigcup _{Y \in Q^{\prime }} \Delta _Y\) and \(Z^{\prime } = X - Z\). The intuition is the following. \(Q^{\prime }\) can be thought of as dropping more and more elements out of X. The indexed family \(\{ \Delta _Y \}_{Y \in Q^{\prime }}\) collects the elements which have been dropped at each stage. Z is then the set of elements which have been dropped at some stage and \(Z^{\prime }\) is the set of those that are never dropped. We now define a relation \(R \subseteq M \times M\) for which commutativity will fail, by

$$ R = \left(\bigcup_{Y \in Q^{\prime }} (\Delta _Y \times Y)\right) \cup (Z^{\prime } \times X) $$

First, note that \(\{ a \in M |\, R(a) \in Q \} = Z \cup Z ^{\prime } =X\). It is then sufficient to show that \(\{ a \in M | \,R^{-1} (a) \in Q \} =Z^{\prime }\), because \(Z^{\prime }\) does not belong to Q. If it did, \(Z^{\prime }\) would be by construction the smallest element of \(Q^{\prime }\). Since Q is not principal, \(Q^{\prime }\) cannot have a smallest element. To see this, let C be such a smallest element. Since Q is not principal, there is D in Q such that \(C \cap D\) is a proper subset of C, but then, since \(Q^{\prime }\) is assumed to be maximal, \(C \cap D\), which belongs to Q, also belongs to \(Q^{\prime }\), contradicting the fact that C is the smallest element of \(Q^{\prime }\). Thus, it remains only to be shown that \(\{a \in M | \,R^{-1} (a)\in Q\}=Z^{\prime }\). We reason by cases:

  • If a is not in X, \(R^{-1} (a)= \emptyset \), which is not in Q.

  • If a is in Z, \(R^{-1} (a)= (\bigcup _{a \in Y} \Delta _Y )\cup Z^{\prime }\). Since a is in Z, there is a B in \(Q^{\prime }\) such that a is not in B. Moreover, \((\bigcup _{a \in Y} \Delta _Y) \cap B = \emptyset \), since \(a \in Y\) implies that B is strictly included in Y, \(Q^{\prime }\) being totally ordered by inclusion. As a consequence, \(R^{-1} (a) \cap B=Z^{\prime }\). Since B is in Q and \(Z^{\prime }\) is not, closure under intersection of Q implies that \(R^{-1} (a)\) is itself not in Q.

  • If a is in \(Z^{\prime }\), \(R^{-1} (a)=Z \cup Z^{\prime }=X\), which is in Q.

This proves that Q is not commutative. Hence, by contraposition, if Q is commutative, Q is principal. It remains to be shown that Q is closed under the interpretation of terms in \({\mathcal {M}}\). Let Q be generated by A, let \(t(x_1,\ldots ,x_n)\) be a term, and let \(a_1,\ldots ,a_n \in A\). We need to show that \(||t(x_1,\ldots ,x_n)||^{\mathcal {M}}(a_1,\ldots ,a_n) \in A\). First, interpret P by A. Then \({\mathcal {M}}, Q\,\vDash\,\forall x Px\), and therefore \({\mathcal {M}},Q\,\vDash\,\forall x_1\ldots \forall x_n Pt(x_1,\ldots ,x_n)\) by hypothesis. By the satisfaction clause for \(\forall \), it follows that

$$ \{\langle a_1,\ldots ,a_n \rangle \,| \,||t||^{\mathcal {M}}(a_1,\ldots ,a_n) \in A\} \supseteq A^n $$

as desired.

(c) implies (a). Let Q be a principal filter generated by a subset A of M. Let H be a Hilbert proof system for \(\vDash _L\) as for example the one given by Cori and Lascar (2000), p. 194; predicate symbols and predicate variables being treated on a par. It suffices to establish that if \(\varphi \) is deducible from a set \(\Gamma \) of formulas without free individual variables in H, then \({\mathcal {M}}, Q\,\vDash\,\Gamma \) implies \({\mathcal {M}}, Q\,\vDash\,\varphi \,\sigma \) for any assignment \(\sigma \) whose individual variables have values in A. This is shown by induction on the length of proofs in H. The two key steps are the soundness of universal instantiation, which, in H, is the axiom \(\forall x\varphi \rightarrow \varphi [t/x]\) with t free for x in \(\varphi \), and the soundness of universal generalization, which, in H, is inferring \(\forall x \varphi \) from \(\varphi \) (no restriction on x is needed since we consider only derivability from a set of sentences without free individual variables).

For universal instantiation, we need to show that \({\mathcal {M}}, Q\,\vDash\,\varphi [t/x]\) for any assignment \(\sigma \) with values in A, for any term t free for x in \(\varphi \). Assume \({\mathcal {M}},Q\,\vDash\,\forall x\varphi \,\sigma \). By the satisfaction clause for \(\forall \) and the fact that Q is a filter generated by A, the set of objects \(a \in M\) such that \({\mathcal {M}}, Q\,\vDash\,\varphi \,\sigma [x:=a]\) is a superset of A. Hence, for our term t with free variables \(x_1,\ldots ,x_n\), \({\mathcal {M}},Q\,\vDash\,\varphi [t/x]\,\sigma \), because \(\sigma (x_1),\ldots ,\sigma (x_n)\) are in A and A is closed under the interpretation of t.

For universal generalization, we assume that we have a proof of a formula \(\forall x \varphi \) from a set of sentences \(\Gamma \) in H ending with an application of universal generalization. By induction hypothesis, \({\mathcal {M}}, Q\,\vDash\,\Gamma \) implies \({\mathcal {M}}, Q\,\vDash\,\varphi \,\sigma \) for any assignment \(\sigma \) with values in A. Assume \({\mathcal {M}},Q\,\vDash\,\Gamma \). Let \(\sigma \) be an arbitrary assignment with values in A. By induction hypothesis, the set of objects \(a \in M\) such that \({\mathcal {M}},Q\,\vDash\,\varphi \,\sigma [x:=a]\) is a superset of A. Therefore, since Q is interpreted by a filter generated from A, we have \({\mathcal {M}}, Q\,\vDash\,\forall x\varphi \,\sigma \), as required. \(\square \)

Next, we show

  1. (4)

    A principal filter Q on M is invariant under permutation if and only if \(Q= \{ M \}\).

Proof

The direction from right to left is immediate. We prove the direction from left to right by contraposition. Assume that Q is a principal filter generated by a set A different from M. There are \(a,b \in M\) such that \(a \in A\) but \(b \not \in A\). Consider the permutation \(\pi \) which swaps a and b and is the identity everywhere else. a is not in \(\pi (A)\), so A is not included in \(\pi (A)\), so \(\pi (A)\not \in Q\), contradicting the invariance of Q. \(\square \)

Finally, we give a proof of

  1. (5)

    If I is an interpretation consistent with an intensional logic, then I is normal on the connectives \(\lnot ,\wedge ,\vee ,\rightarrow \).

Let W be a set of worlds, and I an interpretation (over W) consistent with an intensional logic \(\vdash \). We must show that \(I(\#)\) is normal for \(\# \in \{\lnot ,\wedge ,\vee ,\rightarrow \}\). Recall that for all formulas \(\varphi ,\psi ,\theta \),

  1. (6)

    \(\varphi ,\psi \vdash \theta \,\) implies that for each assignment f, \(\llbracket \varphi \rrbracket ^{I}_f \cap \llbracket \psi \rrbracket ^{I}_f \subseteq \llbracket \theta \rrbracket ^{I}_f\).

To begin, it is clear that \(\wedge \) is normal:

  1. (7)

    For \(X,Y \subseteq W\), we have \(I(\wedge )(X,Y) = X\cap Y\).

This follows from the introduction and elimination rules for \(\wedge \), in the form

  • \(p\wedge q\,\vDash _{PL}\, p\),   \(\;p\wedge q\,\vDash _{PL}\,q\),   \(p, q\,\vDash _{PL}\,p\wedge q\)

Since \(\vdash \) is an intensional logic, these rules hold for \(\vdash \) too. Let f be such that \(f(p)=X\) and \(f(q)=Y\). Using (6), (7) follows.

Next, we observe that it suffices to show that negation must be interpreted normally by I.

  1. (8)

    If \(I(\lnot )(X) = W-X\) for all \(X\subseteq W\), then also \(\vee \) and \(\rightarrow \) are interpreted normally by I.

Consider \(\vee \). From the fact that \(p \vdash p\vee q\) and \(q \vdash p\vee q\) we obtain from (6) (with a suitable f) that \(X \cup Y \subseteq I(\vee )(X,Y)\). Also, since \(p \vee q,\lnot p \vdash q\) we get, using our assumption,

$$ I(\vee )(X,Y) - X \,\subseteq \, Y $$

In other words, \(I(\vee )(X,Y) \subseteq X \cup Y\), so \(I(\vee )(X,Y) = X\cup Y\), as was to be proved. In a similar way, one shows that \(I(\rightarrow )(X,Y) = (W-X)\cup Y\). (The assumption about negation is used to show \(W-X \subseteq I(\rightarrow )(X,Y)\), which one gets from \(\,\lnot p \vdash p\rightarrow q\).) This proves (8).

Concerning negation, it is easy to see that the following holds:

  1. (9)

    For all \(X\subseteq W\), \(\,X \cap I(\lnot )(X) = \emptyset \).

This follows from the fact that \(p,\lnot p \vdash q\), letting \(f(p) = X\) and \(f(q)=\emptyset \). Thus, all that remains to prove is:

  1. (10)

    For all \(X\subseteq W\), \(\,X \cup I(\lnot )(X) = W\).

First, it follows from (9) that

  1. (11)

    \(I(\lnot )(W) = \emptyset \)

Also,

  1. (12)

    \(I(\lnot )(\emptyset ) = W\)

This is because \(\vdash \lnot (p\wedge \lnot p)\) (so for all f, \(\llbracket \lnot (p\wedge \lnot p) \rrbracket ^{I}_f = W\)), with \(f(p)=\emptyset \); recall that conjunction is interpreted as intersection.

Now suppose \(\emptyset \subsetneq X \subsetneq W\) and, for contradiction, that \(X \cup I(\lnot )(X) \ne W\). Take \(b \in W - (X \cup I(\lnot )(X))\), and let \(f(p)=X\) and \(f(q)=\{b\}\).Footnote 12 Then we have

  1. (13)

    \(\llbracket \lnot (p\wedge q) \rrbracket ^{I}_f = W\)

[by (12), since \(X \cap \{b\} = \emptyset \)], and

  1. (14)

    \(\llbracket \lnot p\wedge q \rrbracket ^{I}_f = \emptyset \)

(since \(I(\lnot )(X) \cap \{b\} = \emptyset \)). Since \(\,\lnot (p\wedge q) \vdash \lnot p\vee \lnot q\), it follows from (13) that

  1. (15)

    \(\llbracket \lnot p\vee \lnot q \rrbracket ^{I}_f = W\)

And since \(\,\lnot (p\vee \lnot q) \vdash \lnot p\wedge q\), it follows from (14) that \(\llbracket \lnot (p\vee \lnot q) \rrbracket ^{I}_f = \emptyset \), and hence by (12) that \(\llbracket \lnot \lnot (p\vee \lnot q) \rrbracket ^{I}_f = W\). Thus, since \(\,\lnot \lnot (p\vee \lnot q) \vdash p\vee \lnot q\), we have

  1. (16)

    \(\llbracket p\vee \lnot q \rrbracket ^{I}_f = W\)

But we also have

$$ p\vee \lnot q, \lnot p\vee \lnot q \vdash \lnot q $$

By (15) and (16), this entails that \(\llbracket \lnot q \rrbracket ^{I}_f = I(\lnot )(\{b\}) = W\), contradicting the fact [from (9)] that \(\{b\}\cap I(\lnot )(\{b\}) = \emptyset \). This proves (10), and thereby (5), as desired.\(\square \)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bonnay, D., Westerståhl, D. Compositionality Solves Carnap’s Problem. Erkenn 81, 721–739 (2016). https://doi.org/10.1007/s10670-015-9764-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10670-015-9764-8

Keywords

Navigation