Skip to main content
Log in

A critique of benchmark theory

  • Published:
Synthese Aims and scope Submit manuscript

Abstract

Benchmark theory (BT), introduced by Ralph Wedgwood, departs from decision theories of pure expectation maximization like evidential decision theory (EDT) and causal decision theory (CDT) and instead ranks actions according to the desirability of an outcome they produce in some state of affairs compared to a standard—a benchmark—for that state of affairs. Wedgwood motivates BT through what he terms Gandalf’s principle, that the merits of an action in a given state should be evaluated relative only to the performances of other actions in that state, and not to their performances in other states. Although BT succeeds in selecting intuitively rational actions in a number of cases—including some in which EDT or CDT seem to go wrong—it places constraints on rational decision-making that either lack motivation or are untenable. Specifically, I argue that as it stands BT is committed both to endorsing and rejecting the independence of irrelevant alternatives. Furthermore its requirement that weakly dominated actions be excluded from consideration of rational action lacks motivation and threatens to collide with traditional game theory. In the final section of the paper, I construct a counterexample to BT.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

Notes

  1. Specifically, this is typically taken to be a von Neumann–Morgenstern utility function, i.e. it is unique up to positive affine transformations: neither utility 0 nor utility 1 has special significance, but same-sized differences in utility are significant. Nothing in what follows turns on the details of these functions. Indeed, strictly speaking, utility is a measure of subjective preference and Wedgwood is keen to emphasize that he does not assume in his theory that such preferences are measurable by utility functions. In this paper, I use utility in a looser sense to denote the payoffs an agent associates with outcomes; these need not be interpreted as measures of subjective preference, but whatever values Wedgwood needs to underwrite the details of his theory.

  2. As sketched in the last footnote, more structure is usually built into these utility functions than mere monotonic preservation of preference orderings, but the details are not important here since this is not relevant for Wedgwood’s theory.

  3. Some theorists advocate modifications to this basic model in order to account for e.g. the Egan cases discussed below. In this paper, I will focus only on Wedgwood’s response to these kinds of cases.

  4. See Jeffrey (1983) for a classic treatment.

  5. This term is taken from Gibbard and Harper (1978).

  6. (Rational) conditional credences conform to Bayes’ Rule, i.e. \(Cr(P|Q)=Cr(P\cap Q)/Cr(Q)\), for \(Cr(Q)\not = 0\).

  7. There is, however, more to be said in EDT’s defence against such cases than space permits here. See e.g. Jeffrey (1983) and Eells (1981) for responses to these cases.

  8. For an example of such research, see Marcus et al. (1997).

  9. Such philosophers include Stalnaker (1972), Gibbard and Harper (1978) and Lewis (1981).

  10. For an exposition of this constraint, see Lewis (1981).

  11. This does not generalize to cases with more options: BT, as I have presented it thus far, sometimes will choose a dominated action. See below for discussion.

  12. This is Wedgwood’s term.

  13. In a decision problem with infinitely many actions to choose from (and only such problems), every action can be strictly (or weakly) dominated. It does not seem in general therefore that it is unreasonable to rule that every action in a problem is irrational to choose. In an infinite problem, there might always be a better action available than whatever one chooses, and this is why one’s choice might always be irrational. This plainly cannot be the case in a finite problem. In particular in a two-action problem in which both actions are nearly dominated, they cannot both be better than each other. An anonymous reviewer comments that it is implausible that every action in a decision problem be irrational to choose. While I’m not sure that I agree with this point, if it is true then it serves to emphasize that it could be rational to choose a nearly dominated action.

  14. I have, for clarity of exposition, relabelled actions to accord with my description of IIA above. However, the structure of the problem and the values used for credences and utilities are true to Wedgwood’s example.

  15. Wedgwood claims that wherever the benchmark is set, BT requires choosing \(A\). This is false. There is a relatively small region of values (where the weighting given to the top-ranked action is small in comparison to the weighting assigned to the second-ranked action) where the ECV of \(B\) exceeds that of \(A\). In particular \(63 < 72w - 16v\), where \(v\) is the weighting given to the top-ranked action and \(w\) that given to the second, is a sufficient and necessary condition for \(B\) to receive a higher ECV than \(A\). As an aside, \(C\) is never preferred to \(A\) in this problem but there is a small region of values in which \(C\) receives a higher ECV than \(B\).

  16. He cannot use probabilistic choice dependence as any grounds for rejecting IIA because his only reason for thinking that IIA fails when there is choice dependence is that BT is inconsistent with IIA under these circumstances. It would be begging the question to use this as a defence of BT. As far as I am aware, he has no independent reason for thinking that IIA fails in these dependency situations.

  17. Nozick (1969) and Kahneman and Tversky (1986) are examples of the former; Weirich (2004) is an example of the latter.

  18. Note that Briggs’ proof that BT agrees with EDT and CDT in cases where the state of the world is probabilistically independent of one’s choice of action assumes that weakly dominated actions are not ruled out beforehand.

  19. Proof sketch: First note that since weak domination is irreflexive and transitive, every player with only finitely many strategies must have some undominated strategy. Nash showed that every finite game has a Nash equilibrium, and hence if we reduce a (finite) game \(\varGamma \) by eliminating a weakly dominated strategy, the resulting game will have one, \(\pi \). But every strategy did at least as well in the non-reduced game \(\varGamma \), since we only eliminated a weakly dominated strategy. Therefore \(\pi \) must contain only best responses in \(\varGamma \) and is therefore Nash in \(\varGamma \). It follows by induction that equilibria of \(\varGamma \) will remain regardless of how many weakly dominated strategies are eliminated (or indeed if they are eliminated iteratively).

  20. Or at least if there is, this seems to cast a long shadow over the prospects of unifying decision theory and game theory.

  21. Of course, in the benchmark theorist’s defence, independent motivation for the strict dominance principle will be more easily found than that for the weak dominance principle.

  22. This, of course, should not be given a causal or counterfactual reading. In the lottery ticket example, if it turns out that the ticket is a winner then I must’ve chosen it. In Newcomb’s problem, if it turns out that box \(B\) has the million then I must’ve chosen to one-box.

  23. An anonymous reviewer suggests that perhaps Wedgwood does not conceive of ratifiability as a necessary condition on rational decision-making, but rather a feature that counts in favour of rationally choosing an action. If this is so, then Wedgwood can maintain that the ratifiability of \(A\) may explain why it is rationally permissible to choose, while denying that \(B\) and \(C\) are rationally impermissible simply because they are not ratifiable choices. However, if Wedgwood does think this, it still seems that we are owed some explanation of why \(B\) and \(C\) are rationally permissible when they are to be thought of as using one’s powers badly. Wedgwood might say here that, while this fact should count against them as rational choices, there remain other considerations in their favour that it does not outweigh.

References

  • Briggs, R. (2010). Decision-theoretic paradoxes as voting paradoxes. Philosophical Review, 119(1), 1–30.

    Article  Google Scholar 

  • Eells, E. (1981). Causality, utility and decision. Synthese, 48(2), 295–329.

    Article  Google Scholar 

  • Egan, A. (2007). Some counterexamples to causal decision theory. Philosophical Review, 116(1), 93–114.

    Article  Google Scholar 

  • Gibbard, A., & Harper, W. (1978). Counterfactuals and two kinds of expected utility. In C. A. Hooker, J. J. Leach, & E. F. McClennen (Eds.), Foundations and applications of decision theory (pp. 125–162). Dordrecht: Reidel.

    Google Scholar 

  • Jeffrey, R. (1983). The logic of decision (2nd ed.). Chicago: University of Chicago Press.

    Google Scholar 

  • Kahneman, D., & Tversky, A. (1986). The behavioural foundations of economic theory. The Journal of Business, 59(4), S251–S278.

    Google Scholar 

  • Lewis, D. (1981). Causal decision theory. Australasian Journal of Philosophy, 59(1), 5–30.

    Article  Google Scholar 

  • Marcus, D. A., Schraff, L., & Turk, D. C. (1997). A double-blind provocative study of chocolate as a trigger of headache. Cephalagia, 17, 855–862.

    Article  Google Scholar 

  • Nozick, R. (1969). Newcomb’s problem and two principles of choice. In R. Nicholas (Ed.), Essays in honor of Carl G. Hempel (pp. 114–146). Dordrecht: Reidel.

    Chapter  Google Scholar 

  • Sen, A. (1993). Internal consistency of choice. Econometrica, 61(3), 495–521.

    Article  Google Scholar 

  • Stalnaker, R. (1972). Letter to David Lewis. In W. Harper, R. Stalnaker, & G. Pearce (Eds.), Ifs: Conditionals, belief, decision, chance, and time (pp. 151–152). Dordrecht: Reidel.

    Google Scholar 

  • Wedgwood, R. (2011). Gandalf’s solution to the newcomb problem. Synthese. doi:10.1007/s11229-011-9900-1.

  • Weirich, P. (2004). Realistic decision theory: Rules for nonideal agents in nonideal circumstances. New York: Oxford University Press.

    Book  Google Scholar 

Download references

Acknowledgments

I am very grateful to Dorothy Edgington, Corine Besson, Dan Adams, Jonathan Nassim, Chris Sykes and to two anonymous referees for this journal for helpful suggestions and comments on this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Robert Bassett.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bassett, R. A critique of benchmark theory. Synthese 192, 241–267 (2015). https://doi.org/10.1007/s11229-014-0566-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11229-014-0566-3

Keywords

Navigation