Skip to main content

Advertisement

Log in

Conservativeness in jury decision-making

  • Published:
Theory and Decision Aims and scope Submit manuscript

Abstract

This paper studies the three kinds of conservativeness in a jury decision-making structure: the voting rule, the threshold of reasonable doubt, and the legal information system. In a model of simultaneous voting, Feddersen and Pesendorfer (The American Political Science Review, 92(1), 23-35, 1998) argue that the unanimity rule is the worst-performing voting rule because voters with strategic behaviour mitigate the bias brought about by the voting rule. If this bias can be offset by an opposing (less conservative) bias in the legal information system, we would be able to restore the rationality of informative voting. The new informative equilibrium is strictly better than the original strategic equilibrium in terms of ex post error probability. When a strategic equilibrium survives, we can lower the probability of convicted defendant being innocent by making the information system more informative or by forming a jury with a higher (more conservative) threshold of reasonable doubt.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

Similar content being viewed by others

Notes

  1. Dietrich and Spiekermann (2021) point out that conjecturing pivotality and subsequent epistemic-strategic voting is a game-theoretic artefact. Although this point is recognized, making the model more realistic in this direction is beyond the scope of this paper; thus, the task remains open for future research.

  2. There is an ongoing critique about the plausibility of conditional independence on the state. we opt to lose some rigor here and recommend that future research consult Dietrich and Spiekermann (2013b).

  3. As we will see later, a voter with an innocent signal and conjecturing pivotality uses a mixed strategy between voting for acquittal and conviction. Duggan and Martinelli (2001) point out that this randomisation is an undesirable artefact from binary signal modelling as seen in Feddersen and Pesendorfer (1998).

  4. Many authors, including us, assume equiprobable states. Although we assume this to ease exposition, all the propositions and corollaries in this paper hold without the equiprobability assumption. Attempts to explicitly relax this condition can be found in Wit (1998), Coughlan (2000), Duggan and Martinelli (2001), and Mengel and Rivas (2017)), among others.

  5. This example follows Gerardi and Yariv (2007), although the exposition is fairly standard.

  6. One may differentiate voting behaviour into strategic and nonstrategic behaviour. Nonstrategic voting can be classified into informative and sincere voting. Sincere voting means voter votes for an alternative that is deemed to maximize his or her expected utility. A similar classification can be found in Coughlan (2000).

  7. Although the space of equilibria is immense, following the literature by confining the analysis to symmetric equilibria, we focus on the model of Feddersen and Pesendorfer (1998) and study its possible extension along the line of information structure.

  8. Our construction of legal information systems and definition of the conservativeness of legal information systems largely follow the work of Kwon (2005) on constructing and defining accounting information systems.

  9. If the signal is commonly shared by all the jurors and all the jurors vote as the signal prescribes, one could interpret this signal accuracy as juror competence. A critique of the classic competence assumption and a way of avoiding it can be found in Dietrich and Spiekermann (2013a).

  10. Since we presuppose that \(f(\cdot |G)\) and \(f(\cdot |I)\) only differ in location parameters, the definition implies that in the neutral case \(\gamma =\delta\), in the conservative case \(\gamma >\delta\) and in the liberal case \(\gamma <\delta\). However, the implication is one way. As will be clarified in Lemma 1 and Fig. 1, we need to discern between region III (more informative than A) and region V (more conservative than A).

  11. This figure appears in the working paper of Kwon (2002) but is omitted in the later published version of Kwon (2005).

  12. We can ignore the other triangle since the signal generated by the information system in this triangle can be interpreted exactly the opposite way.

  13. See p. 24 of Feddersen and Pesendorfer (1998). In our study, as well as in Feddersen and Pesendorfer (1998), no voter observes other voters’ signals since there is no straw poll. The posterior probability is deducted by a voter when he or she is put in a position of pivotality through his or her thought experiment.

  14. One might wonder why one should bother to randomise if any deterministic behaviour may bring about at least as a good payoff as that brought about by a mixed strategy. A mixed strategy has little room for realistic interpretation. Since we are facing a one-shot, simultaneous-move game, a mixed strategy cannot be interpreted as a probability distribution of the voter’s action over acquit or convict when he or she plays the same game repeatedly. A realistic interpretation \(\grave{a}\) la Aumann (1987) is that a mixed strategy for a voter ‘i’ is the other voters’ beliefs of what voter ‘i’ will vote for.

  15. Proposition 1 by Feddersen and Pesendorfer (1998)

  16. Determining whether this improved system is uniformly better-or at least there is \(n^{*}\) such that n larger than \(n^{*}\) brings about a better outcome for the improved system-is beyond the scope of this study.

References

  • Aumann, R. J. (1987). Correlated equilibrium as an expression of Bayesian rationality. Econometrica, 55(1), 1–18.

    Article  Google Scholar 

  • Austen-Smith, D., & Banks, J. S. (1996). Information aggregation, rationality, and the Condorcet jury theorem. American Political Science Review, 90, 34–45.

    Article  Google Scholar 

  • Bhattacharya, S., Duffy, J., & Kim, S.-T. (2014). Compulsory versus voluntary voting: An experimental study. Games and Economic Behavior, 84, 111–131.

    Article  Google Scholar 

  • Blackwell, D. (1953). Equivalent comparisons of experiments. The Annals of Mathematical Statistics, 24, 265–272.

    Article  Google Scholar 

  • Blinder, A. (2004). The quiet revolution: central banking goes modern. Yale University Press.

    Book  Google Scholar 

  • Coughlan, P. J. (2000). In defense of unanimous jury verdicts: Mistrials, communication, and strategic voting. American Political Science Review, 94(2), 375–393.

    Article  Google Scholar 

  • Dietrich, F., & Spiekermann, K. (2013). Epistemic democracy with defensible premises. Economics and Philosophy, 29(1), 87–120.

    Article  Google Scholar 

  • Dietrich, F., & Spiekermann, K. (2013). Independent opinions? On the causal foundations of belief formation and jury theorems. Mind, 122(487), 655–685.

    Article  Google Scholar 

  • Dietrich, F., & Spiekermann, K. (2021). Jury Theorems. In: Zalta E. N. (ed) The Stanford Encyclopedia of Philosophy. Metaphysics Research Lab, Stanford University, Winter 2021 edn.

  • Duggan, J., & Martinelli, C. (2001). A Bayesian model of voting in juries. Games and Economic Behavior, 37(2), 259–294.

    Article  Google Scholar 

  • Feddersen, T., & Pesendorfer, W. (1998). Convicting the innocent: The inferiority of unanimous jury verdicts under strategic voting. The American Political Science Review, 92(1), 23–35.

    Article  Google Scholar 

  • Feddersen, T., & Pesendorfer, W. (1999). Elections, information aggregation, and strategic voting. PNAS, 96(19), 10572–10574.

    Article  Google Scholar 

  • Gerardi, D., & Yariv, L. (2007). Deliberative voting. Journal of Economic Theory, 134(1), 317–338.

    Article  Google Scholar 

  • Kwon, Y. K. (2002). Accounting conservatism and the value of accounting reports for incentive contracting in moral hazard settings with limited liability and risk-neutrality,” manuscript.

  • Kwon, Y. K. (2005). Accounting conservatism and managerial incentives. Management Science, 51(11), 1626–1632.

    Article  Google Scholar 

  • Levy, G. (2007). Decision making in committees: Transparency, reputation and voting rules. American Economic Review, 97(1), 150–168.

    Article  Google Scholar 

  • Li, H. (2001). A theory of conservatism. Journal of Political Economy, 109(3), 617–636.

    Article  Google Scholar 

  • Li, H., Rosen, S., & Suen, W. (2001). Conflicts and common interests in committees. American Economic Review, 91(5), 1478–1497.

    Article  Google Scholar 

  • Li, H., & Suen, W. (2004). Delegating decisions to experts. Journal of Political Economy, 112(1), 311–335.

    Article  Google Scholar 

  • Mengel, F., & Rivas, J. (2017). Common value elections with private information and informative priors: Theory and experiments. Games and Economic Behavior, 104, 190–221.

    Article  Google Scholar 

  • Noh, H. (2007). Essays on committee decision making. Ph.D. dissertation, University of Illinois.

  • Persico, N. (2004). Committee design with endogenous information. The Review of Economic Studies, 71(1), 165–191.

    Article  Google Scholar 

  • Pollard, P. (2004). Monetary policy-making around the world, Powerpoint presentation. Federal Reserve Bank of St. Louis.

  • Wit, J. (1998). Rational choice and the Condorcet jury theorem. Games and Economic Behavior, 22(2), 364–376.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hyoungsik Noh.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This paper is based on Essay 1 of my Ph.D. dissertation from the University of Illinois, Noh (2007). I thank the editor and the referee for their helpful comments. The remaining errors are my own.

Appendix: Proofs

Appendix: Proofs

Proof of Lemma 1

The ‘more informative’ part

It is well known that Blackwell’s notion of being more informative is equivalent to the existence of a stochastic matrix that transforms one matrix to another. An information structure expressed by a matrix P is more informative than Q if and only if there exists a Markov matrix M such that

$$\begin{aligned} P\times M=Q \end{aligned}$$

Let

$$\begin{aligned}P=\left( \begin{array}{cc} \gamma _2 &{} 1-\delta _2 \\ 1-\gamma _2 &{} \delta _2 \\ \end{array} \right) \end{aligned}$$

and

$$\begin{aligned}Q=\left( \begin{array}{cc} \gamma _1 &{} 1-\delta _1 \\ 1-\gamma _1 &{} \delta _1 \\ \end{array} \right) \end{aligned}$$

Since \(\gamma _2+\delta _2>1\), P is invertible. Then,

$$\begin{aligned}M=\frac{1}{\gamma _2+\delta _2-1}\left( \begin{array}{cc} \gamma _1\delta _2-(1-\gamma _2)(1-\delta _1) &{} \gamma _2(1-\delta _1)-\gamma _1(1-\delta _2) \\ (1-\gamma _1)\delta _2-(1-\gamma _2)\delta _1 &{} \gamma _2\delta _1-(1-\gamma _1)(1-\delta _2) \\ \end{array} \right) \end{aligned}$$

We need to evaluate the nonnegativity of each element in the matrix.

\(M_{11}\ge 0\) iff \(\gamma _1\delta _2-(1-\gamma _2)(1-\delta _1)\ge 0\), which is equivalent to \(\frac{\delta _2}{1-\gamma _2}\ge \frac{1-\delta _1}{\gamma _1}\). From \(\gamma _2+\delta _2>1\) we derive \(\delta _2>1-\gamma _2\) and hence,

$$\begin{aligned} \frac{\delta _2}{1-\gamma _2}>1 \end{aligned}$$
(25)

From \(\gamma _1+\delta _1>1\), it is derived that \(\gamma _1>1-\delta _1\) and hence,

$$\begin{aligned} \frac{1-\delta _1}{\gamma _1}<1 \end{aligned}$$
(26)

Based on (25) and (26), the condition \(M_{11}\ge 0\) always holds.

\(M_{22}\ge 0\) iff \(\gamma _2\delta _1-(1-\gamma _1)(1-\delta _2)\ge 0\), which is equivalent to \(\frac{\gamma _2}{1-\delta _2}\ge \frac{1-\gamma _1}{\delta _1}\). From \(\gamma _2+\delta _2>1\), it is derived that \(\gamma _2>1-\delta _2\); hence,

$$\begin{aligned} \frac{\gamma _2}{1-\delta _2}>1 \end{aligned}$$
(27)

From \(\gamma _1+\delta _1>1\), \(\delta _1>1-\gamma _1\) is derived; hence,

$$\begin{aligned} \frac{1-\gamma _1}{\delta _1}<1 \end{aligned}$$
(28)

Based on (27) and (28), the condition \(M_{22}\ge 0\) always holds.

\(M_{21}\ge 0\) iff \((1-\gamma _1)\delta _2-(1-\gamma _2)\delta _1\ge 0\), which is equivalent to the following:

$$\begin{aligned} \frac{\delta _2}{1-\gamma _2}\ge \frac{\delta _1}{1-\gamma _1} \end{aligned}$$
(29)

\(M_{12}\ge 0\) iff \(\gamma _2(1-\delta _1)-\gamma _1(1-\delta _2)\ge 0\), which is equivalent to the following:

$$\begin{aligned} \frac{\gamma _2}{1-\delta _2}\ge \frac{\gamma _1}{1-\delta _1} \end{aligned}$$
(30)

Therefore, information system \((\gamma _2,\delta _2)\) is more informative than information system \((\gamma _1,\delta _1)\) if and only if (29) and (30) hold.

The ‘more conservative’ part

By construction,

$$\begin{aligned} \frac{\gamma (\kappa )}{1-\delta (\kappa )}=\frac{F(\kappa |I)}{F(\kappa |G)} \end{aligned}$$

The derivative of this with respect to \(\kappa\) yields the following:

$$\begin{aligned} \frac{d}{d\kappa }\left( \frac{\gamma (\kappa )}{1-\delta (\kappa )}\right) =\frac{f(\kappa |I)F(\kappa |G)-F(\kappa |I)f(\kappa |G)}{F(\kappa |G)^2} \end{aligned}$$

Therefore, \(\frac{\gamma (\kappa )}{1-\delta (\kappa )}\) is strictly decreasing iff

$$\begin{aligned} \frac{f(\kappa |I)}{f(\kappa |G)}<\frac{F(\kappa |I)}{F(\kappa |G)} \end{aligned}$$
(31)

The model assumes that \(f(\eta )\) is a normal density, \(G=1\) and \(I=0\). By the property of normal distribution, the MLRP holds and for \(\theta <\kappa\) and \(I=0<G=1\) we have the following:

$$\begin{aligned} \frac{f(\kappa |G)}{f(\kappa |I)}>\frac{f(\theta |G)}{f(\theta |I)} \end{aligned}$$

Put differently,

$$\begin{aligned} \frac{f(\theta |I)}{f(\kappa |I)}>\frac{f(\theta |G)}{f(\kappa |G)} \end{aligned}$$

Integrating both sides of this with respect to \(\theta\) from \(-\infty\) to \(\kappa\) yields the following:

$$\begin{aligned} \frac{F(\kappa |I)}{f(\kappa |I)}>\frac{F(\kappa |G)}{f(\kappa |G)}, \end{aligned}$$

which tells us that the condition (31) holds.

On the other hand,

$$\begin{aligned} \frac{\delta (\kappa )}{1-\gamma (\kappa )}=\frac{1-F(\kappa |G)}{1-F(\kappa |I)} \end{aligned}$$

The derivative of this with respect to \(\kappa\) yields the following:

$$\begin{aligned} \frac{d}{d\kappa }\left( \frac{\delta (\kappa )}{1-\gamma (\kappa )}\right) =\frac{-f(\kappa |G)(1-F(\kappa |I)+f(\kappa |I)(1-F(\kappa |G)}{(1-F(\kappa |I))^2} \end{aligned}$$

Therefore,\(\frac{\delta (\kappa )}{1-\gamma (\kappa )}\) is strictly increasing iff

$$\begin{aligned} \frac{f(\kappa |I)}{f(\kappa |G)}<\frac{1-F(\kappa |I)}{1-F(\kappa |G)} \end{aligned}$$
(32)

Provided by the property of MLRP, for \(\theta '>\kappa\) and \(G=1>I=0\),

$$\begin{aligned} \frac{f(\theta '|G)}{f(\theta '|I)}>\frac{f(\kappa |G)}{f(\kappa |I)} \end{aligned}$$

Put differently,

$$\begin{aligned} \frac{f(\theta '|G)}{f(\kappa |G)}>\frac{f(\theta '|I)}{f(\kappa |I)} \end{aligned}$$

and integrating both sides of this inequality with respect to \(\theta '\) from \(\kappa\) to \(\infty\) yields the following:

$$\begin{aligned} \frac{1-F(\kappa |G)}{f(\kappa |G)}>\frac{1-F(\kappa |I)}{f(\kappa |I)} \end{aligned}$$

which is the same condition as (32). \(\square\)

Proof of Proposition 3

Note that \(\left\{ piv,i\right\}\) is a joint event in the following inequality. Signals are conditionally independent, and pivotality is generated by the responsive strategies (a strategy that is a function of a signal). Hence, \(\left\{ piv\right\}\) and \(\left\{ i\right\}\) are independent events conditional on a true state.

Then, using Bayes’ rule, \(\Pr (G|piv,i)\) is given by the following:

$$\begin{aligned} \Pr (G|piv,i)&=\frac{\Pr (piv,i|G)\Pr (G)}{\Pr (piv,i|G)\Pr (G)+\Pr (piv,i|I)\Pr (I)} \end{aligned}$$
(33)
$$\begin{aligned}&=\frac{\Pr (piv|G)\Pr (i|G)\Pr (G)}{\Pr (piv|G)\Pr (i|G)\Pr (G)+\Pr (piv|I)\Pr (i|I)\Pr (I)}\end{aligned}$$
(34)
$$\begin{aligned}&=\frac{\Pr (G|piv)\Pr (piv)(1-\delta )}{\Pr (G|piv)\Pr (piv)(1-\delta )+\Pr (I|piv)\Pr (piv)\gamma }\end{aligned}$$
(35)
$$\begin{aligned}&=\frac{\Pr (G|piv)(1-\delta )}{\Pr (G|piv)(1-\delta )+(1-\Pr (G|piv))\gamma } \end{aligned}$$
(36)

The first equality is a direct application of Bayes’ rule. The second equality comes from conditional independence. The third one uses the equalities of Bayes’ rule, \(\Pr (piv|G)\Pr (G)=\Pr (G|piv)\Pr (piv)\) and \(\Pr (piv|I)\Pr (I)=\Pr (I|piv)\Pr (piv)\).

Hence,

$$\begin{aligned} q\ge \frac{\Pr (G|piv)(1-\delta )}{\Pr (G|piv)(1-\delta )+(1-\Pr (G|piv))\gamma }, \end{aligned}$$

which gives the following:

$$\begin{aligned} \Pr (G|piv)(1-\delta )-q\Pr (G|piv)(1-\delta )\le q(1-\Pr (G|piv))\gamma \end{aligned}$$

Hence

$$\begin{aligned} \Pr (G|piv)\le \frac{q\gamma }{(1-\delta )(1-q)+q\gamma } \end{aligned}$$
(37)

Define \(W_G\) as the probability that a voter votes for conviction, that is, uses strategy c when the defendant is guilty and \(W_I\) as the probability that a voter votes for conviction when the defendant is innocent. Let \(\nu (s)\) be the probability of voting for conviction in the presence of signal s. Then,

$$\begin{aligned} \Pr (c|G)=W_G=&\Pr (g|G)\nu (g)+\Pr (i|G)\nu (i)=\delta \nu (g)+(1-\delta )\nu (i) \end{aligned}$$
(38)
$$\begin{aligned} \Pr (c|I)=W_I=&\Pr (i|I)\nu (i)+\Pr (g|I)\nu (g)=\gamma \nu (i)+(1-\gamma )\nu (g) \end{aligned}$$
(39)

Based on this definition and the fact that conviction, C, is the event in which the pivotal voter votes for conviction, C is seen as the joint event of pivotality and the use of strategy; c. piv and c are conditionally independent for the same reason that piv and i are conditionally independent.

$$\begin{aligned} \Pr (G|piv,c)&=\frac{\Pr (piv,c|G)\Pr (G)}{\Pr (piv,c|G)\Pr (G)+\Pr (piv,c|I)\Pr (I)} \end{aligned}$$
(40)
$$\begin{aligned}&=\frac{\Pr (piv|G)\Pr (c|G)\Pr (G)}{\Pr (piv|G)\Pr (c|G)\Pr (G)+\Pr (piv|I)\Pr (c|I)\Pr (I)}\end{aligned}$$
(41)
$$\begin{aligned}&=\frac{\Pr (G|piv)\Pr (piv)W_G}{\Pr (G|piv)\Pr (piv)W_G+\Pr (I|piv)\Pr (piv)W_I}\end{aligned}$$
(42)
$$\begin{aligned}&=\frac{\Pr (G|piv)W_G}{\Pr (G|piv)W_G+(1-\Pr (G|piv))W_I} \end{aligned}$$
(43)

That is,

$$\begin{aligned} \Pr (G|C)=\frac{1}{1+\left( \frac{1}{\Pr (G|piv)}-1\right) \frac{W_I}{W_G}} \end{aligned}$$
(44)

Again, the first equality is a direct application of Bayes’ rule. The second equality comes from conditional independence. The third one uses the equalities of Bayes’ rule, \(\Pr (piv|G)\Pr (G)=\Pr (G|piv)\Pr (piv)\) and \(\Pr (piv|I)\Pr (I)=\Pr (I|piv)\Pr (piv)\).

On the other hand,

$$\begin{aligned}&\delta W_I-(1-\gamma )W_G \end{aligned}$$
(45)
$$\begin{aligned}&=\delta \gamma \nu (i)+\delta (1-\gamma )\nu (g)-(1-\gamma )\delta \nu (g)-(1-\gamma )(1-\delta )\nu (i)\end{aligned}$$
(46)
$$\begin{aligned}&=\left[ \delta \gamma -(1-\gamma )(1-\delta )\right] \nu (i) \end{aligned}$$
(47)
$$\begin{aligned}&=\left[ \gamma +\delta -1\right] \nu (i)\end{aligned}$$
(48)
$$\begin{aligned}&\ge 0 \end{aligned}$$
(49)

The last inequality comes from \(2>\gamma +\delta >1\), which indicates that the information structure is informative but not perfectly informative and that \(\nu (i)\ge 0\). From this, we have the following:

$$\begin{aligned} \frac{W_I}{W_G}\ge \frac{1-\gamma }{\delta } \end{aligned}$$
(50)

From (37)

$$\begin{aligned} \left( \frac{1}{\Pr (G|piv)}-1\right) \ge \frac{(1-\delta )(1-q)}{q\gamma } \end{aligned}$$

Combining this with (50) yields the following:

$$\begin{aligned} \left( \frac{1}{\Pr (G|piv)}-1\right) \frac{W_I}{W_G}\ge \frac{(1-\delta )(1-q)(1-\gamma )}{q\gamma \delta } \end{aligned}$$
(51)

Combining (51) with (44) yields the following:

$$\begin{aligned} \Pr (G|C)&\le \frac{1}{1+\frac{(1-\delta )(1-q)(1-\gamma )}{q\gamma \delta }} \end{aligned}$$
(52)
$$\begin{aligned}&=\frac{q\delta \gamma }{q\delta \gamma +(1-q)(1-\delta )(1-\gamma )} \end{aligned}$$
(53)

Therefore,

$$\begin{aligned} \Pr (I|C)&\ge \frac{(1-q)(1-\delta )(1-\gamma )}{q\delta \gamma +(1-q)(1-\delta )(1-\gamma )}\end{aligned}$$
(54)
$$\begin{aligned}&=\frac{1}{\left( \frac{q}{1-q}\right) \left( \frac{\delta }{1-\gamma }\right) \left( \frac{\gamma }{1-\delta }\right) +1} \end{aligned}$$
(55)

which completes the proof. \(\square\)

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Noh, H. Conservativeness in jury decision-making. Theory Decis 95, 151–172 (2023). https://doi.org/10.1007/s11238-022-09915-7

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11238-022-09915-7

Keywords

Navigation