Abstract
On the Lockean thesis one ought to believe a proposition if and only if one assigns it a credence at or above a threshold (Foley in Am Philos Q 29(2):111–124, 1992). The Lockean thesis, thus, provides a way of characterizing sets of all-or-nothing beliefs. Here we give two independent characterizations of the sets of beliefs satisfying the Lockean thesis. One is in terms of betting dispositions associated with full beliefs and one is in terms of an accuracy scoring system for full beliefs. These characterizations are parallel to, but not merely derivative from, the more familiar Dutch Book (de Finetti in Theory of probability, vol 1, Wiley, London, 1974) and accuracy (Joyce in Philos Sci 65(4):575–603, 1998) arguments for probabilism.
Similar content being viewed by others
1 Introduction
The Lockean thesis is a way of connecting all-or-nothing beliefs to graded beliefs (Foley 1992). On the Lockean thesis to have an all-or-nothing belief in a proposition is just to assign that proposition a credence at or above a certain threshold.Footnote 1 One of the interesting and controversial features of Lockeanism is that it does not require that one’s beliefs either be closed under logical implication or even be logically consistent. For a failure of closure note that at any threshold less than one it is possible, on the Lockean thesis, to believe two atomic propositions P and Q while not believing their conjunction \( P \& Q\).Footnote 2 For a failure of consistency note that at a threshold of .6 it is possible to believe two propositions P and Q, while also believing the negation of their conjunction, \( \lnot (P \& Q)\).Footnote 3
We might want to ask what constraints the Lockean thesis does put on one’s beliefs. Some constraints are well-known: for example, if the threshold for belief is greater than .5, then, on the Lockean thesis, one cannot hold pairwise inconsistent beliefs.Footnote 4 However, these constraints do not give necessary and sufficient conditions for satisfying the Lockean thesis. For, as we will see in Sect. 3, there are sets of beliefs that are pairwise consistent but that cannot be believed by a Lockean with a threshold greater than one-half. Pairwise consistency is thus a necessary but not sufficient condition for the Lockean thesis with a threshold greater than one-half.
In this paper, I present two types of characterizations of sets of beliefs for Lockean agents. I draw on a pair of related traditions for characterizing graded beliefs. The first tradition is that of the Dutch Book argument for probabilism. In this tradition, graded beliefs can be characterized by how they rationalize bets. Very roughly speaking, the Dutch Book argument establishes that a set of numerically graded beliefs is probabilistically coherent if and only if there is no collection of bets it rationalizes that leads to a sure loss.Footnote 5 The second tradition is that of the accuracy argument for probabilism. In this tradition, numerical credences are given scores corresponding to their accuracy.Footnote 6 The standard genre of result in this area is to establish that the numerical credences that are probabilistically coherent are equivalent to those that are not dominated in all worlds by another set of credences.
In the case of graded beliefs, both Dutch Book arguments and accuracy arguments are used to characterize probabilistically coherent beliefs. What I show here is that these arguments can also be used to characterize all-or-nothing beliefs satisfying the Lockean hypothesis.
It is worth noting that such a characterization is not directly extractable from the current literature. For accuracy arguments and Dutch Book arguments characterize probabilistically coherent numerical credences, and probabilistically coherent beliefs have a many-to-one relationship to Lockean belief sets.
What I give here is a direct characterization of Lockeanism via betting dispositions and accuracy scoring rules. To do this I propose mappings from sets of all-or-nothing beliefs to betting dispositions and scores and, I show that, relative to these mappings, Lockean belief sets can be characterized by properties of betting dispositions and scores.
In the accuracy tradition there is already a small literature that addresses how the Lockean thesis fares vis-a-vis accuracy arguments.Footnote 7 This paper advances that literature by providing more general results than previously available and by proving, as a corollary of these results, the main conjecture of Easwaran (2016). There is, by contrast, no literature that I am aware of relating Dutch Book arguments to the Lockean account of belief. This paper fills that lacuna by giving two Dutch Book results for the Lockean account of belief.
The significance of these results varies depending on your starting assumptions. For example, these results show that if a set of full beliefs satisfy certain decision-theoretic properties relative to a range of scoring systems then they are compatible with the Lockean thesis. If you think of full belief as basic and use Lockeanism as a way to derive ranges of graded beliefs from them (as in Easwaran 2016), then these results gives you the conditions on which this is possible. Likewise, if, as in classical works in decision theory, you take certain kinds of betting dispositions as basic, these results give necessary conditions for the dispositions to be associated with sets of full beliefs. More generally, these results make connections between three ways of thinking about epistemic commitments: graded beliefs, full beliefs, and betting dispositions.
Here is the plan: In the next Sect. 2, I outline the basic formal framework for modeling full beliefs. In Sect. 3, I define three different ways in which sets of full beliefs can be said to satisfy the Lockean thesis. In Sect. 4, I discuss ways of deriving betting dispositions from full beliefs and what it takes for those dispositions to be subject to a Dutch Book. In Sect. 5, I give an accuracy scoring system for full beliefs and state some standard decision-theoretic properties of full belief sets relative to the system. In the main section, Sect. 6, I give results linking these different ways of thinking about full beliefs. I discuss the significance of these results in Sect. 7 and outline a few directions for future work in Sect. 8.
2 Framework
Let W be a finite set of worlds, which we will enumerate \(w_1 \ldots w_m\), let P denote the set of proposition \(2^W\),Footnote 8 enumerated \(p_1 \ldots p_n\), with \(n = 2^m\). Let \(B \subseteq P\) be a subset of o beliefs, \(o \le n\), enumerated \(b_1 \ldots b_o\). Let t be a real number, \(0 < t \le 1\), which we’ll call the threshold. We think of B as representing the set of propositions an agent believes (and hence \(P \backslash B\) is the set of propositions the agent doesn’t believe). In the next three sections we define various properties that B can have, always in terms of the threshold t. These properties fall into three categories: Lockean properties, betting properties, and accuracy properties.
3 Lockean Belief Sets
On the Lockean thesis one should believe a proposition p if and only if you assign it a credence greater than or equal to some threshold t. In this section we will define three different versions of the Lockean thesis.
Let us begin by defining credences. A credence function is a function c from P to [0, 1] such that \(c(W) = 1\), \(c(\emptyset ) = 0\) and for any \(X \subseteq W\), \(c(X) = \sum \nolimits _{w \in W} c(\{w\})\).Footnote 9
Since we use linear algebra to prove most of our main results it will be useful to rephrase some of the notions in terms of vectors. We correspond to each proposition p an m-dimensional vector \(\mathbf {p} = \langle p_1, \ldots ,p_m \rangle \) such that
We correspond to the credence function c the vector \(\mathbf {c} = \langle c_1 \ldots c_m \rangle \) such that \(c_i = c(\{w_i\})\). In this case it is easy to see that \(c(p) = \mathbf {c} \cdot \mathbf {p}\).Footnote 10
We begin with the weakest sense in which B can satisfy the Lockean thesis relative to the threshold t. The set of beliefs B is Lockean Compatible relative to the threshold t iff there is a credence function c such that: \(c(b) \ge t\) (or \(\mathbf {c} \cdot \mathbf {b} \ge t\)) for all \(b \in B\).
To say that B is Lockean Compatible is not equivalent to saying that B can be a Lockean’s total set of beliefs. For if a Lockean’s belief set B contains two propositions x and y it also ought to contain \(x \cup y\), but Lockean Compatibility does not guarantee this.
For this reason, we will define two stronger notions as well. The set of beliefs B is Lockean Complete at threshold t iff there is a credence function c such that \(c(p) \ge t\) iff \(p \in B\) for all \(p \in P\). We will also use a weakening of Lockean Completeness to allow cases where belief is optional at the threshold t. We say B is Almost Lockean Complete relative to a threshold t iff there is a credence function c such that for all \(p \in P\) if \(c(p)>t\) then \(p \in B\) and if \(c(p)<t\) then \(p \not \in B\). As we shall see, this last notion connects more naturally with the decision-theoretic notions we define in relation to accuracy scoring.
We can now prove the claim in the introduction that there are sets of beliefs that are pairwise compatible but are not holdable by the Lockean with \(t>.5\). A belief set B is pairwise compatible iff for all \(b, b' \in B\), \(b \cap b' \ne \emptyset \). It is well known that this is a necessary condition for Lockean Compatibility, Completeness, or Almost Completeness for \(t>.5\).Footnote 11 However it is not a sufficient condition for any of these properties as the following result shows:
Observation 1
For any \(t>0\), there is a set of beliefs B that is pairwise compatible but not Lockean Compatible at t.
Proof
Let W be the set
Consider the set B enumerated \(b_1 \ldots b_n\) such that \(b_k = \{ w \in W: k \in w\}\). Note that for any \(i \ne j\), \(\{i,j\}\) is a member of both \(b_i\) and \(b_j\) and no other element of B. So, B is pairwise compatible and each element of W is a member of exactly two elements of B. It follows that for any credence function c:
It follows that for some j between 1 and n that \(c(b_j) \le \frac{2}{n}\). So if we choose n large enough we can find a member of B with an arbitrarily low credence.Footnote 12\(\square \)
Note that since Lockean Compatibility is the weakest of our three notions, we also cannot use pairwise compatibility to give sufficient conditions for being Lockean Complete or Almost Lockean Complete.Footnote 13
4 Betting on Beliefs
Dutch Book arguments traditionally assume that an agent’s rational betting behavior is determined by their credences. Dutch Book arguments are used to argue for the thesis that a rational agent’s credences are probabilistic (something we assumed above in the definition of credences). The leading assumption behind the Dutch Book argument is that credences determine a rational agent’s betting preferences. The arguments typically assume some divisible good in which an agent’s utility is linear (conventionally assumed to be dollars). Expected utility theory provides a standard way of linking rational betting dispositions to credences: a rational agent acts in order to maximize her utility relative to here credences. Thus, if offered a bet, an agent ought to be willing to take it only if she expects to gain by it or at least not lose by it. Indeed the very idea of credences or subjective probability is often considered to be determined by or to determine betting behavior.Footnote 14 On this framework it is typically assumed that if an agent has a credence y in a proposition p then she ought to be willing to buy or sell a bet that pays out \(\$1\) if p is true and \(\$0\) otherwise (we’ll call this bet a bet for p) for the price \(\$y\). This is because the agent’s expected return from the bet is \(\$y\). (In the decision theoretic framework, the agent is also rationally obliged to buy the bet for any lower value and sell it for any higher value.) A Dutch Book is a way of exploiting these dispositions, by presenting an agent with a collection of bets she is willing to take which will result in a sure loss for the agent, and, hence, a sure gain for the bookie (i.e. the person offering the bets). Traditional Dutch Book arguments show that, roughly, an agent will not be subject to a Dutch Book if and and only if her credences are probabilistically coherent.Footnote 15
It is less straightforward to link all-or-nothing beliefs to betting behavior. The Lockean thesis, however, by providing a link between belief and credence suggests some ways of linking beliefs to betting behavior. On the Lockean thesis an agent ought to believe p if and only if she assigns p a credence greater than or equal to t. The agent, then, for each belief b ought to be willing to buy a bet for b for \(\$t\) (or a lower price), since her expected return of the bet is greater than or equal to zero. So we start with the assumption that there are some bets that a Lockean agent ought to be willing to buy (in any quantities). If there is a collection of such bets for an agent that guarantee her a loss at each world then the agent’s beliefs B is subject to a one-way Dutch Book with respect to t.
To state the notion of a one-way Dutch Book formally we will again use vector notation. For an agent’s beliefs \(b_1, \ldots , b_o\) we can use a non-negative o-dimensional vector \(\mathbf {x}\), the stake vector, to represent the number of bets the agent buys for each proposition \(b_1, \ldots , b_o\) (i.e. the agent buys \(x_i\) bets for \(b_i\)).Footnote 16 We assume here that all bets are priced at \(\$t\) since that is the highest price that the Lockean view guarantees the agent will be willing to pay. At a world w these bets lead to the payoff,
By associating each world w with an o-dimensional vector \(\mathbf {w}\), such that
the payoff at w can be concisely stated as \(\mathbf {w} \cdot \mathbf {x} \). A collection of beliefs \( b_1, \ldots , b_o\) is subject to a one-way Dutch Book just in case there is some vector of non-negative stakes \(\mathbf {x}\) such that for all worlds w, \(\mathbf {x} \cdot \mathbf {w} <0\).
A one-way Dutch Book only takes advantage of the beliefs \(B = b_1, \ldots , b_o\) the agent has, it does not exploit those propositions she does not belief, i.e. \(P\backslash B\). On the Lockean view, the agent must have less than t credence in the propositions not in B. She, thus, ought to be willing to sell \(\$1\) bets on each proposition in \(P \backslash B\) for \(\$t\) each. A two-way Dutch Book is a collection of bets in B that the agent is willing to buy at \(\$t\) and a collection of bets in \(P \backslash B\) she is willing to sell at \(\$t\) that together guarantee her a loss.
We will formally state the notion of two-way Dutch Book in vector notation. Let a non-negative n-dimensional vector \(\mathbf {x}\) represent the stakes an agent takes in bets costing \(\$t\) on the propositions \(p_1, \ldots , p_n\) as follows: if \(p_i\) is one of the agent’s beliefs then she buys \(x_i\) bets, if \(p_i\) is not one of the agent’s beliefs then she sells \(x_i\) bets.Footnote 17 That \(\mathbf {x}\) is required to be nonnegative captures the fact that the choice of buying or selling a bet in a proposition is determined by whether or not the proposition is believed by the agent.
The payout an agent gets for stakes \(\mathbf {x}\) with the set of beliefs B at a world w is:
We can represent this as \(\mathbf {x} \cdot \mathbf {w}\), where the n-dimensional vector \(\mathbf {w}\) is defined as follows:
An agent is subject to a two-way Dutch Book if there is a non-negative n-dimensional stake vector \(\mathbf {x}\) such that for each world w, \(\mathbf {x} \cdot \mathbf {w} < 0\). In other words, there is a set of bets the agent’s beliefs rationalize that leads to a loss at each world.
5 Scoring Beliefs for Accuracy
Another perspective associates sets of beliefs with scores, depending on the accuracy of the beliefs in the set. The picture here is that an agents’ beliefs B are scored for their accuracy. Scoring all-or-nothing beliefs is simpler conceptually than scoring graded beliefs: while saying whether a real-valued confidence in a proposition is right or wrong at a world is a tricky business, saying whether a belief is right or wrong is straightforward.Footnote 18 For this reason, we’ll jump right into how to score sets of all-or-nothing beliefs without comparison to the scoring of graded beliefs.
We’ll consider a scoring system for beliefs following Easwaran and Fitelson (2015), Easwaran (2016) and Dorst (2019). The crucial assumption made in this literature is that we assign scores to an agent for their total beliefs at a world by assigning numerical scores to individual propositions believed and summing those scores.Footnote 19 We will further assume that at a world w if a belief in a proposition \(p_i\) turns out to be correct (i.e. \(w \in p_i\)) then the agent gets a certain non-negative score \(r_i \ge 0\), and if it is incorrect (i.e. \(w \notin p_i\)) the agent gets a non-positive score, \(s_i \le 0\). Note that if an agent fails to believe a proposition there is no score associated with that proposition. We will further assume that the ratio between the reward for true belief and penalty for false belief is constant across propositions so that for any two propositions \(p_i\) and \(p_j\), \(\frac{r_i}{s_i} = \frac{r_j}{s_j}\), if all of \(r_i>0\), \(s_i\) \(> 0\), \(r_j>0\) and \(s_j\) are non-zero.Footnote 20 We also assume that for any \(p_i\), \(r_i=0\) iff \(s_i = 0\). On this system of scoring beliefs, at a world w a set of beliefs B gets the score
We now rephrase this scoring system in a way that makes more transparent its connection to Lockeanism. Using the threshold t, for reasons that will become apparent, let \(t = \frac{-s_i}{r_i -s_i}\), for some i such that \(r_i>0\). Note that t does not depend on the choice of i and \(0 \le t \le 1\). Let \(x_i = r_i - s_i\). Note that \(-tx_i = s_i\) and \((1-t)x_i = r_i\). We will refer to \(x_i\) as the weight on the score for proposition \(p_i\), as the higher an \(x_i\) is the more extreme a score the proposition \(p_i\) has, relative to a fixed t. Note that \(x_i\) is always non-negative. Let \(\mathbf {x}\) be the m-dimensional vector whose ith entry is \(x_i\). Given this mapping, it is clear that any scoring system, as defined in the previous paragraph, can be fully specified by giving a \(t :0 < t \le 1\) and a non-negative n-dimensional vector \(\mathbf {x}\). In what follows, we will specify scoring systems in terms of t and \(\mathbf {x}\) without loss of generality.
Let \(\mathbf {w}\), again, be the o-dimensional vector defined in (1) on p. 8. Let \(\mathbf {x}\), the weight vector, be a non-negative o-dimensional vector giving the weights on a scoring system for the propositions \(b_1, \ldots , b_n\).Footnote 21 The agent’s total score at w, S(w, B) can be concisely stated in vector notation as \(\mathbf {w} \cdot \mathbf {x}\).
Since we have a scoring system for beliefs, we can apply some standard notions from decision theory to the choice of belief sets. A choice of a set of beliefs B is a sure loss if for every world w, \(S(w,B) <0\). A belief set B is rational for an agent given a credence function c if and only if there is no other set \(B'\) such the expected score of B on c, \(E(S(w,B'))\), is greater than the expected score of \(B'\) on c, \(E(S(w,B'))\).Footnote 22 The choice of one set of beliefs B strictly dominates another choice of beliefs, \(B'\) if for all worlds w, \(S(w,B) > S(w, B')\). A set of beliefs B weakly dominates another set of beliefs, \(B'\), if for all worlds w, \(S(w,B) \ge S(w, B')\) and there is some world \(w'\) such that \(S(w,B) > S(w, B')\). Note that all of these notions are only defined relative to the scoring system, and hence the weight vector \(\mathbf {x}\) (for all propositions) as well as the threshold t.
6 Connecting the Three Perspectives
First, we will draw some connections between properties of Dutch Books and properties of scoring systems on our framework.
Observation 2
For any set of worlds \(W= w_1, \ldots , w_m\), set of beliefs \(B =b_1, \ldots , b_o\), and positive real number \(t\le 1\), the following two statements are equivalent.
-
(a)
The agent holding B is subject to a one-way Dutch Book at threshold t.
-
(b)
At some weight vector \(\mathbf {x}\) the agent holding B will realize a sure loss (i.e. a loss at all worlds) at threshold t.
Proof
Corresponding to each world w there is an o-dimensional vector \(\mathbf {w}\) as defined in (1) on p. 8. Note that \(\mathbf {x} \cdot \mathbf {w}\) is both the payoff at w frzzaom holding bets on B with stake vector \(\mathbf {x}\) with threshold t and the score one gets at w for having beliefs B with weight vector \(\mathbf {x}\) and threshold t. So, both (a) and (b) are equivalent to the statement that there is some non-negative o-dimensional vector \(\mathbf {x}\) such that for every world w, \(\mathbf {x} \cdot \mathbf {w} < 0\). \(\square \)
Another, less obvious, connection between scoring and betting can be made as follows:
Observation 3
For any set of worlds \(W= w_1, \ldots , w_m\), set of beliefs \(B =b_1, \ldots , b_o\), and positive real number \(t \le 1\) the following two statements are equivalent.
-
(a)
The agent holding B is subject to a two-way Dutch Book at threshold t.
-
(b)
The set of beliefs B is strictly dominated by another set of beliefs \(B'\) on the scoring system with threshold t and some weight vector \(\mathbf {x}\).
Proof
We will start by showing (a) implies (b): Let \(\mathbf {w}\) be a non-negative n-dimensional vector as defined in (2) on p. 9. Since B is subject to a two-way Dutch Book we know that there is some weight vector \(\mathbf {x}\) such that for every world w, \(\mathbf {w} \cdot \mathbf {x} <0\). In scoring terms \(\mathbf {w} \cdot \mathbf {x}\) represents the score at world w for holding B minus the score for holding \(P \backslash B\). So \(P \backslash B\) strictly dominates B on weights \(\mathbf {x}\).
Now we will show that (b) implies (a). If B is strictly dominated we know that there is some other set of bets \(B'\) and some set of non-negative weights \(\mathbf {x}\) such that for every world w the score for holding B with t and \(\mathbf {x}\) is strictly less then the score for holding \(B'\). Now define \(\mathbf {x'}\) such that,
It is straightforward to see that on \(\mathbf {x'}\), \(B'\) strictly dominates B: since B strictly dominates \(B'\) on \(\mathbf {x}\), and \(\mathbf {x'}\) is like \(\mathbf {x}\) except for not weighting propositions held in common by \(\mathbf {B}\) and \(\mathbf {B}'\). We can also see that \(P\backslash B\) strictly dominates B on \(\mathbf {x}'\) since the scores for \(P\backslash B\) and \(B'\) on \(\mathbf {x}'\) are equal at every world. Since \(\mathbf {x'} \cdot \mathbf {w}\) is the score for holding B minus the score for holding \(P \backslash B\), it must be negative for every w. Therefore \(\mathbf {x'}\) are the stakes for a two-way Dutch Book. \(\square \)
We have just seen the close relationship between the betting dominance and accuracy dominance of a set of beliefs B. Now we can make connections between these and Lockeanism.
We begin by reviewing a connection between the scoring system and Lockeanism that has been established in the literature. The central result in the literature is the following.Footnote 23
Observation 4
Given a scoring system with threshold t, a set of beliefs is rational relative to some credences c if it is Almost Lockean Complete with respect to t.
Proof
We give a sketch, referring the reader to Dorst (2019) and Easwaran (2016) for more detail. Given the additivity of our scoring system and of expectations, an agent maximizes expected score for the total set of beliefs, by making the ‘best’ choice for each proposition p whether to belief it or not. If the scoring system assigns a positive weight to p this requires believing p only if \(c(p) \ge t\) and whenever \(c(p) > t\), hence being Almost Lockean Complete. Note that since weights are not required to be positive the converse fails. \(\square \)
Relating rationality, which is expressed in terms of credences, to Lockeanism is straightforward as the previous result showed. The notion of a sure loss and the dominance relations are not stated in terms of credences and so their relationship to Lockeanism less transparent. We begin with a negative observation, due to Easwaran (2016). Suppose a belief set B is not weakly dominated on the scoring system with weights \(\mathbf {x}\) and threshold t. This does not entail that B is (Almost) Lockean Complete. A simple example suffices, where there are two worlds \(w_1\), \(w_2\) and threshold .6 and all propositions have weight 1 except \(\{ w_1\}\) which has weight 100. In this case the belief set \(\{ \{w_1\}, \{w_2\}, \{w_1, w_2,\} \} \) is not weakly dominated. It yields score 40 at \(w_1\) and − 59.2 at \(w_2\). Dropping \(\{w_1\}\) would reduce the score at \(w_1\) while dropping \(\{w_2\}\) would reduce the score at \(w_2\), while dropping both would reduce the score at \(w_1\). However, given the threshold of .6, an Almost Lockean Complete set of beliefs cannot include two pairwise incompatible propositions such as a \(\{w_1\}\) and \(\{w_2\}\).
We now turn to our main results: characterizations of Lockean Compatibility and Almost Lockean Completeness in betting and scoring terms.
Here is our first result:
Theorem 1
The following three statements are equivalent.
-
(a)
The agent holding B is not subject to a one-way Dutch Book at threshold t.
-
(b)
There is no weight vector \(\mathbf {x}\) such that the agent holding B will realize a sure loss at threshold t.
-
(c)
B is Lockean Compatible with threshold t.
Proof
As we have already showed that (a) and (b) are equivalent in Observation 2, we will only need to show that (c) is equivalent to those two. To do so we will need to represent the entire situation in terms of matrix-vector multiplication.
Consider the \(m \times o\) matrix \(\mathbf {A}\), constructed so that each row i consists of the row vector \(\mathbf {w}_i\) corresponding to the world \(w_i\) as defined in (1) on p. 8.
Or equivalently we can directly define \(\mathbf {A}\) as follows:
where
Note that if we multiply \(\mathbf {A}\) by an o-dimensional column vector \(\mathbf {x}\) we get the following:
We can see, then, that iff there is no non-negative \(\mathbf {x}\) such that \(\mathbf {A} \mathbf {x} < 0\) then (a) and (b) hold.Footnote 24 So, to put it more compactly, (a) and (b) are equivalent to the following holding:
Consider an m-dimensional vector corresponding to some credence function \(\mathbf {c}\), as in Sect. 3. Now consider multiplying such a vector in row format by \(\mathbf {A}\) as follows:
We can see that the ith coordinate of \(\mathbf {c}^T \mathbf {A}\) is greater than or equal to 0 iff \(c(b_i)\ge t\). Another way of seeing this is to note that the ith column of \(\mathbf {A}\) is the vector \(\mathbf {b}_i\) minus the vector \(\mathbf {t}\) which has value t everywhere.Footnote 25 Thus, the ith entry of \(\mathbf {c}^T \mathbf {A}\) is \(\mathbf {c} \cdot (\mathbf {p} - \mathbf {t})\) = \(\mathbf {c} \cdot \mathbf {p} -t\). If this is greater than or equal to 0, then \(c(p) \ge t\).
We can now compactly state what it is for a set of beliefs to satisfy (c) as follows:
Note that we do not require \(\mathbf {y}\) to sum to 1 in (4). However, it is easy to see that (4) is equivalent to
since multiplying \(\mathbf {y}\) by a positive scalar does not affect any of the inequalities.
So all we need to show is that (3) and (4) are equivalent statements about the matrix \(\mathbf {A}\), which is the content of the following theorem from linear algebra, whose proof, using Farkas’s Lemma, is in the appendix. \(\square \)
Theorem 2
Let \(\mathbf {A}\) be any \(m\times n\) matrix, then:
The next theorem relates the notion of strict dominance to the scoring and betting frameworks.
Theorem 3
The following three statements are equivalent:
-
(a)
There is no two-way Dutch Book on B.
-
(b)
There is no weight vector \(\mathbf {x}\) on which B is strictly dominated.
-
(c)
B is Almost Lockean Complete.
Proof
We have already established that (a) and (b) are equivalent in Observation 3.
Consider the following \(m \times n\) matrix \(\mathbf {D}\).
where,
Note that the ith row of \(\mathbf {D}\) corresponds to the vector \(\mathbf {w}_i\) associated with the world \(w_i\) as defined in (2) on p. 9. Given an n-dimensional non-negative weighting vector \(\mathbf {x}\), \(\mathbf {x} \cdot \mathbf {w}_i < 0\) iff taking stakes \(\mathbf {x}\) on the bets associated with the belief set B leads to a loss at \(w_i\). Since
the statement,
is equivalent to saying that there is no two-way Dutch Book on B (at the threshold t), i.e. that (a) and (b) are true.
Considered column-wise, the jth column of \(\mathbf {D}\) is \(\mathbf {p_j} -\mathbf {t}\) if \(p_j \in B\) and \(-(\mathbf {p_j} -\mathbf {t})\) if \(p_j \notin B\), where \(\mathbf {p_j}\) is the m-dimensional proposition vector as defined in Sect. 3 and \(\mathbf {t}\) is the m-dimensional vector with value t at all coordinates. There being a credence function that is Almost Lockean Complete, (c) above, is thus equivalent to there being a credence vector \(\mathbf {c}\) such that \(\mathbf {c} ^T D \ge 0\). This is equivalent to this simplified condition:
To complete our proof we note that Theorem 2 establishes that (5) and (6) are equivalent statements about \(\mathbf {D}\). It follows that (a), (b), and (c) are equivalent.
\(\square \)
The following conjecture from Easwaran (2016) follows directly from Theorem 3.
Lemma 1
(Easwaran’s conjecture) If for all positive weight vectors \(\mathbf {x}\), B is not weakly dominated by some \(B'\), then B is Almost Lockean Complete.
Proof
Given Theorem 3 what we can show is that if B is not weakly dominated on any positive weighting function, then there is no non-negative weighting function on which B is strictly dominated. We will prove the contrapositive. Suppose B is strictly dominated by \(B'\) on some non-negative weight vector \(\mathbf {x}\). Let \(\mathbf {x'}\) be a strictly positive weight vector such that
Let \(B''\) be the following belief set:
We can see that since \(B'\) strictly dominates B on \(\mathbf {x}\), \(B''\) will strictly dominate B on \(\mathbf {x}'\). So B is weakly dominated on the positive weight vector \(\mathbf {x'}\). \(\square \)
Note that the converse of Lemma 1 fails. B can be Almost Lockean Complete but still weakly dominated on some strictly positive weighting function. This is because if the credence function c assigns 0 to the proposition \(\{ w_1\}\) and t to \(\{w_2\}\) then B might include \(\{w_2\}\) and not include \(\{w_1, w_2\}\) and still be Almost Lockean Complete. It will be weakly dominated (on a positive weighting vector) by a variation \(B'\) that is like B except it includes \(\{w_1,w_2\}\) but not \(\{w_2\}\).
7 Discussion
Theorems 1 and 3 provide characterizations of Lockean belief sets (both Lockean Compatible and Almost Lockean Complete) in terms of the betting and accuracy frameworks.
As motivations for the Lockean thesis the significance of the results depends (a) on how natural the scoring and betting frameworks used are and (b) how compelling the conditions put on collections of beliefs relative to these frameworks are. With respect to the betting framework: the natural question to ask is whether we are rationally obliged to bet on our beliefs, or against our non-beliefs, in the ways specified in Sect. 4. While it is natural to think that one is rationally obliged to bet on one’s full beliefs, it is perhaps less obvious that one has to bet against one’s non-beliefs. Certainly though, if such betting dispositions are normatively linked to beliefs, the requirement that we not be subject to Dutch Books is a plausible condition to put on rational beliefs.Footnote 26 With respect to the scoring framework: if the idea of scoring beliefs for accuracy is attractive in general, then the framework used is quite natural.Footnote 27 However, we might ask why rational agents ought to choose belief sets that are not either (a) subject to a sure loss on some weighting vector, or (b) strictly dominated on some weighting vector. It is clear that if we know what the actual weighting vector is we should avoid a sure loss/strict domination on that vector. But why should we avoid it on all weighting vectors? Two thoughts are possible here: one is that we have uncertainty over what the right weights are and we are treating that uncertainty conservatively by ensuring we are okay (i.e. our choice is not dominated) no matter what the weighting vector is, the other is that there is unquantifiable uncertainty about what the weights are (since perhaps we don’t have graded beliefs at all) and avoiding sure loss or weakly dominated beliefs on any weights is the best policy.Footnote 28
8 Further Directions
8.1 Connections to Wald’s Complete Class Theorem
I would conjecture that an alternative characterization of Almost Lockean Completeness can be found using Wald’s Complete Class Theorem.Footnote 29 Wald’s Complete Class Theorem establishes that in a certain class of decision problems any strategy that is not weakly dominated is a rational strategy (i.e. one maximizing expected reward subject to her credences). Wald’s Complete Class theorem does not directly apply to the choice problem here: for, as I noted in Sect. 6, there are some belief sets that are not weakly dominated at some weighting function but that are also not Almost Lockean Complete. The reason Wald’s Complete Class theorem does not apply is that it requires the class of strategies to be convex: i.e. if two strategies are available to an agent then any probabilistic mix of them also is. If we had allowed mixed strategies in our decision problem, then the Complete Class Theorem would apply and any strategy that was not weakly dominated by any mixed strategy would be Almost Lockean Complete. However, the main point of modeling outright belief is to have a notion of belief that contrasts with graded belief. If we allowed mixed strategies involving outright belief, it is not so clear that we have not introduced an analogue of graded beliefs. Nonetheless, I conjecture that by applying Wald’s Complete Class Theorem we can show that outright beliefs that are Almost Lockean Complete will be just those that are not weakly dominated by any mixture of other outright beliefs. Future work might also investigate how we ought to think about mixed strategies in a system of outright belief and how such mixed strategies relate to graded beliefs.
8.2 Extensions to Other Scoring Systems
Note that we assumed here that at the threshold t, the betting odds, and the scoring system ratios of cost to rewards, were the same for every proposition. A simple extension of these results could handle cases in which the odds are allowed to vary for each proposition.Footnote 30 In addition to this last extension, future work might also explore other scoring systems and see whether they also give rise to Lockean patterns of beliefs. Most interesting, I think, would be to see what can be found when the additivity assumption is weakened.
8.3 Connection to Another Characterization
In this paper I characterized the Lockean belief sets by way of their relationship to bets and accuracy scoring systems. Fernando (1998, Theorem 4, p. 230) gives a different characterization of Lockeanism that is based on qualitative properties of sets of beliefs.Footnote 31 It would be interesting to explore the connections between this result and those presented here.
Notes
Sometimes the Lockean thesis is framed as one about what one ought to believe, rather than about what one does believe. The difference is not essential here.
See Leitgeb (2014) for an exploration of constraints, other than having a threshold of 1, on credences and thresholds that ensure closure (and consistency).
Assume you assign a credence of .6 to P and .6 to Q, then you can assign at most .8 to \( \lnot (P \& Q)\).
See, e.g., Hawthorne and Bovens (1999) for this and related observations.
Notation: \(2^X\) is the powerset of X.
This is equivalent to saying \(\langle W, P, c \rangle \) is a probability space. Note that, unlike in standard Dutch Book arguments, we are here stipulating that credences satisfy the axioms of probability theory.
For two n-dimensional vectors, \(\mathbf {x} =\langle x_1 \ldots x_n\rangle \) and \(\mathbf {y} = \langle y_1 \ldots y_n\rangle \), their inner product, \(\mathbf {x}\cdot \mathbf {y}\), is \(x_1y_1+\cdots + x_ny_n\).
This is because if two propositions are disjoint then both their probabilities cannot be greater than .5.
Thanks to Kenny Easwaran for noting and fixing a mistake here.
Note that the combination of single premise closure (for every \(b \in B\) if \(p \supseteq b\) then \(p \in B\)) and pairwise compatibility also does not yield necessary and sufficient conditions for a Lockean complete sets of beliefs.
As in Ramsey (1926).
See Pettigrew (2019) for a recent review of these arguments.
We are using an o-dimensional vector to represent a function from B to non-negative stakes.
Note that we reuse \(\mathbf {x}\) and \(\mathbf {w}\) for both o-dimensional and n-dimensional vectors. This abuse of notation is justified because of the similar construction and role these vectors play across different arguments and definitions. It will always be clear, I hope, which one we are discussing.
See Gneiting and Raftery (2007) for a review of scoring rules for graded beliefs.
The additivity assumption, that the total score for one’s beliefs is just the sum of the individual scores, is defended by Dorst (2019).
These are the only weights necessary for determining the score for the set of beliefs B, which is why we do not use our original n-dimensional weight vector also called \(\mathbf {x}\). Again the reuse of the \(\mathbf {x}\) and \(\mathbf {w}\) for both o-dimensional and n-dimensional vectors is to emphasize their parallel roles.
Where \(E(S(w,B)) = \sum _{w \in W} c(\{w\})S(w,B)\).
Note that, as is standard, \(\mathbf {v}<0\) means each coordinate of \(\mathbf {v}\), \(v_1 \ldots v_k\), is less than zero.
Vectors for propositions were defined in Sect. 3 and simply are m-dimensional vectors that are 1 at index i when proposition is true at \(w_i\) and 0 otherwise.
See Pettigrew (2019) for a recent review and discussions of some of the philosophical questions about whether rational agents are obliged to make bets on their graded beliefs.
I am grateful to Kevin Dorst for this last point.
I am grateful to Gary Chamberlain here for pointing out the relevance of Wald’s Complete Class Theorem. See, e.g., Ferguson (1967) for details.
Wes Holliday pointed out to me that Fernando’s result can also be derived as a direct consequence of a result in Adams (1965, Theorem 1.4, p. 213).
References
Adams, E. W. (1965). Elements of a theory of inexact measurement. Philosophy of Science, 32(3/4), 205–228. https://doi.org/10.1086/288044.
Brier, G. W. (1950). Verification of forecasts expressed in terms of probability. Monthly Weather Review, 78(1), 1–3. https://doi.org/10.1175/1520-0493.
de Finetti, B. (1974). Theory of probability (Vol. 1). Wiley.
Dorst, K. (2019). Lockeans maximize expected accuracy. Mind, 128, 175–211.
Easwaran, K. (2016). Dr. Truthlove or: How I learned to stop worrying and love Bayesian probabilities. Noûs, 50(4), 816–853. https://doi.org/10.1111/nous.12099
Easwaran, K., & Fitelson, B. (2015). Accuracy, coherence, and evidence. Oxford Studies in Epistemology, 5, 61–96.
Ferguson, T. S. (1967). Mathematical Statistics. A decision theoretic approach. Academic Press.
Fernando, T. (1998). In conjunction with qualitative probability. Annals of Pure and Applied Logic, 92(3), 217–234.
Foley, R. (1992). The epistemology of belief and the epistemology of degrees of belief. American Philosophical Quarterly, 29(2), 111–124.
Gneiting, T., & Raftery, A. E. (2007). Strictly proper scoring rules, prediction, and estimation. Journal of the American Statistical Association, 102(477), 359–378. https://doi.org/10.1198/016214506000001437.
Hawthorne, J., & Bovens, L. (1999). The preface, the lottery, and the logic of belief. Mind, 108(430), 241–264.
Joyce, J. M. (1998). A nonpragmatic vindication of probabilism. Philosophy of Science, 65(4), 575–603.
Leitgeb, H. (2014). The stability theory of belief. Philosophical Review, 123(2), 131–171.
Matoušek, J. & Gärtner, B. (2007). Understanding and Using Linear Programming. Springer.
Pettigrew, R. (2016a). Accuracy and the laws of credence. Oxford University Press.
Pettigrew, R. (2016b). Jamesian epistemology formalised: An explication of ‘The Will to Believe’. Episteme, 13(3), 253–268.
Pettigrew, R. (2019). The Dutch Book Arguments. Cambridge University Press.
Ramsey, F. P. (1926). “Truth and Probability”, in Foundations of Mathematics and other Essays, R. B. Braithwaite (ed.), London: Kegan, Paul, Trench, Trubner, & Co., 1931, 156–198.
Strang, G. (2006). Linear algebra and its applications, 4th edn. Thomson Brooks/Cole.
Wikipedia contributors. (2019). ‘Farkas’ lemma—Wikipedia, the free encyclopedia. https://en.wikipedia.org/w/index.php?title=Farkas%27_lemma&oldid=879978648. [Online; accessed 6-February-2019].
Acknowledgements
I am grateful to audiences at London School of Economics and University of Amsterdam, Gary Chamberlain, Nilanjan Das, Kevin Dorst, Kenny Easwaran, Branden Fitelson, Wes Holliday, Matt Mandelkern, Richard Pettigrew and Linda Rothschild for comments and discussion. Special thanks to an anonymous referee at Review of Symbolic Logic who gave extremly helpful and extensive comments on three drafts of the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix: Proof of Theorem 2
Appendix: Proof of Theorem 2
Theorem 2
Let \(\mathbf {A}\) be any \(m\times n\) matrix, then:
Proof
We start with this widely used variant of Farkas’s Lemma:Footnote 32
We can get the following by taking a universal instantiation of the both sides of the biconditional. (i.e. going from ‘ p iff q’ \(\rightarrow \) ‘(for all x p) iff (for all x q)’).
This can be simplified to:
We can then switch signs on the right-hand side to get:
Theorem 2 follows immediately (by substitution of \(\mathbf {A}\) with \(-\mathbf {A}\)):
\(\square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Rothschild, D. Lockean Beliefs, Dutch Books, and Scoring Systems. Erkenn 88, 1979–1995 (2023). https://doi.org/10.1007/s10670-021-00438-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10670-021-00438-1