Skip to main content
Log in

Hyperbolic discount curves: a reply to Ainslie

  • Published:
Theory and Decision Aims and scope Submit manuscript

Abstract

Ainslie (Theory and Decision, 73, 3–34, 2012) challenges our interpretation of the properties of hyperbolic discount curves in an iterated prisoners’ dilemma (IPD) model. In this reply, we discuss the emergence of hyperbolic discount functions in the behavioral economics literature and evaluate their properties. Furthermore, we present a summarized version of our IPD model and evaluate Ainslie’s points of contention.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. In Herrnstein’s experiment, pigeons in an operant chamber could peck at one of two response-keys, each of which was on a variable interval (VI) reinforcement schedule. The experiment used concurrent schedules of intermittent reward (VI-VI).

  2. Many writers in behavioral economics get this point wrong. In fact, no paper until Ainslie (1975) pointed out that the “matching relationship” would be a hyperbola if applied to individual, discrete choices, and thus cause preference reversals.

  3. Ainslie’s hyperbolic function is such that events \(\tau \) periods away are discounted with factor \(\frac{1}{\tau }\).

  4. The function was later used by Laibson (1997) to model intrapersonal dynamic conflict.

  5. Myopia in common usage is near-sightedness. O’Donoghue and Rabin (1999) define the term as “present-biased” time preference in the context of intertemporal choice.

  6. The elasticity of \(D(t)\) with respect to \(t\) represents the ratio of the incremental change of the logarithm of \(D(t)\) with respect to the incremental change of the logarithm of \(t\).

  7. The constant in the denominator of Eq. 1 ensures that the value of the function is equal to 1 if either \(k=0\) or \(t=0\). Otherwise, the value of the function is not defined at this point.

  8. The derivative of \(D(t)\) with respect to \(t\) measures the change in the function as the time delay changes marginally holding the discount rate \(k\) constant.

  9. The existing models primarily utilize \(\beta -\delta \) discounting.

  10. It is usual practice in economics to model a time-inconsistent agent as a sequence of sub-agents, in effect splitting her up on diachronic dimensions (see Ross 2005).

  11. Refer to Ainslie (2012) for a review of the models.

  12. “selves” here representing “oneself in different motivational states”.

  13. To draw an analogy with our earlier discussion, the limited conflict described here is also a feature of interpersonal bargaining where it gives rise to self-enforcing agreements.

  14. Notably, the effect in not observed among the group of non smokers.

  15. In the normal form specification, strategies are equivalent to actions.

  16. The Prisoners’ Dilemma game was originally framed by Merrill Flood and Melvin Dresher working at RAND Corporation in 1950 and later formalized by Albert W. Tucker.

  17. This is an implication of the folk theorem which states that in repeated games, conditional on players’ minimax conditions being satisfied, any outcome is a feasible solution concept.

  18. This result is standard since most folk-theorem analysis employ an exponential discount function.

  19. refer to Sect. 2.3 for a summary of the function.

  20. Streich and Levy (2007) obtain the same conditions for the discount factor when comparing a tit-for-tat strategy versus an always-defect strategy in the same game.

  21. \(\beta \) here reflects the degree of “present-biased” time preferences (refer to Sect. 2.3).

  22. In our case, therefore, one may specify within limits any set of values for \(A, \; C, \; D, \;Z,\) and \(\beta \) and obtain a value for \(\delta \) for which (Cooperate, Cooperate) constitutes an SPE.

  23. In particular, \(\sum \nolimits _{t=0}^\infty \delta ^{t}= \frac{1}{1-\delta }\) and \(\sum \nolimits _{t=1}^\infty \delta ^{t}= \frac{\delta }{1-\delta }\).

References

  • Ainslie, G. (1975). Specious reward: A behavioral theory of impulsiveness and impulse control. Psychological Bulletin, 82(4), 463–496.

    Article  Google Scholar 

  • Ainslie, G. (1992). Picoeconomics: The strategic interaction of successive motivational states within the person. New York: Cambridge University Press.

    Google Scholar 

  • Ainslie, G. (2001). Breakdown of will. New York: Cambridge University Press.

    Book  Google Scholar 

  • Ainslie, G. (2012). Pure hyperbolic discount curves predict “eyes open” self-control. Theory and Decision, 73, 3–34.

    Article  Google Scholar 

  • Albuquerque, R., & Hopenhayn, H. A. (2004). Optimal lending contracts and firm dynamics. Review of Economic Studies, 71(2), 285–315.

    Article  Google Scholar 

  • Bulow, J., & Rogoff, K. (1989). Sovereign debt: Is to forgive to forget? The American Economic Review, 1, 43–50.

    Google Scholar 

  • Deutsch, M. (1960). Trust, trustworthiness, and the F-scale. Journal of Abnormal and Social Psychology, 61, 138–140.

    Article  Google Scholar 

  • Fredrick, S., Loewenstein, G., & O’Donoghue, T. (2002). Time discounting and time preference: A critical review. Journal of Economic Literature, 40(2), 351–401.

    Article  Google Scholar 

  • Frydman, R., & Goldberg, M. D. (2007). Imperfect knowledge economics: Exchange rates and risk. Princeton: Princeton University Press.

    Google Scholar 

  • Harris, M., & Holmstrom, B. (1982). A theory of wage dynamics. Review of Economic Studies, 44(3), 315–333.

    Article  Google Scholar 

  • Herrnstein, R. (1961). Relative and absolute strengths of response as a function of frequency of reinforcement. Journal of the Experimental Analysis of Animal Behavior, 4, 267–272.

    Article  Google Scholar 

  • Hofmeyr, A., Ainslie, G., Charlton, R., & Ross, D. (2010). The relationship between addiction and reward bundling: An experiment comparing smokers and non-smokers. Addiction, 106, 402–409.

    Article  Google Scholar 

  • Kirby, K. N., & Guastello, B. (2001). Making choices in anticipation of similar future choices can increase self-control. Journal of Experimental Psychology: Applied, 7, 154–164.

    Google Scholar 

  • La Porta, R., Lopez-de-Silanes, F., Shleifer, A., & Vishny, R. W. (1997). Trust in large organizations. American Economic Review Papers and Proceedings, 87(2), 333–338.

    Google Scholar 

  • Laibson, D. (1997). Golden eggs and hyperbolic discounting. The Quarterly Journal of Economics, 112(2), 443–477.

    Article  Google Scholar 

  • Loewenstein, G., & Prelec, D. (1992). Anomalies in intertemporal choice: Evidence and an interpretation. The Quarterly Journal of Economics, 107(2), 573–597.

    Article  Google Scholar 

  • Mazur, J. E. (1987). An adjustment procedure for studying delayed reinforcement. In M. L. Commons, J. E. Mazur, J. A. Nevin, & H. Rachlin (Eds.), Quantitative analyses of behavior V: The effect of delay and of intervening events on reinforcement value. Hillsdale: Erlbaum.

    Google Scholar 

  • Musau, A. (2009). Modeling alternatives to exponential discounting. Master thesis, University of Agder. http://brage.bibsys.no/hia/bitstream/URN:NBN:no-bibsys_brage_10519/4/Musau_thesis09.pdf. Accessed 21 March 2012.

  • O’Donoghue, T., & Rabin, M. (1999). Doing it now or later. American Economic Review, 89(1), 103–124.

    Article  Google Scholar 

  • Phelps, E. S., & Pollak, R. (1968). On second-best national saving and game-equilibrium growth. The Review of Economic Studies, 35(2), 185–199.

    Article  Google Scholar 

  • Ross, D. (2005). Economic theory and cognitive science: Microexplanation. Cambridge: MIT Press.

    Google Scholar 

  • Ross, D. (2010). Economic models of procrastination. In C. Andreou & M. White (Eds.), The thief of time. New York: Oxford University Press.

    Google Scholar 

  • Rubinstein, A. (1979). Equilibrium in supergames with overtaking criterion. Journal of Economic Theory, 21, 1–9.

    Article  Google Scholar 

  • Rubinstein, A. (1998). Modeling bounded rationality. Cambridge: MIT Press.

    Google Scholar 

  • Samuelson, P. A. (1937). A note on measurement of utility. The Review of Economic Studies, 4(2), 155–161.

    Article  Google Scholar 

  • Shapiro, C., & Stiglitz, J. (1984). Equilibrium unemployment as a worker discipline device. American Economic Review, 74(3), 433–444.

    Google Scholar 

  • Shy, O. (1995). Industrial organization: Theory and applications. Cambridge: MIT Press.

    Google Scholar 

  • Streich, P., & Levy, J. S. (2007). Time horizons, discounting, and intertemporal choice. Journal of Conflict Resolution, 51(2), 199–226.

    Article  Google Scholar 

  • Strotz, R. H. (1956). Myopia and inconsistency in dynamic utility maximization. The Review of Economic Studies, 23(3), 165–180.

    Article  Google Scholar 

  • Telser, L. G. (1980). A theory of self-enforcing agreements. Journal of Business, 53(1), 27–44.

    Article  Google Scholar 

  • Thomas, J., & Worrall, T. (1994). Foreign direct investment and the risk of expropriation. Review of Economic Studies, 61, 81–108.

    Article  Google Scholar 

Download references

Acknowledgments

I am especially grateful to Jochen Jungeilges for his comments, suggestions, and guidance in relation to the article that this paper references. I also thank Ellen K. Nyhus for numerous discussions on the subject, and George Ainslie for helpful comments. Financial support from the Competence Development Fund of Southern Norway is gratefully acknowledged.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andrew Musau.

Appendices

Appendix 1: finitely iterated prisoners’ dilemma

Consider the IPD model described in Sect. 4 and suppose that the game is repeated \(T\) times in periods \(1, 2,\ldots , T\) where \(T\in \mathbb Z \) s.t. \(1 \le T \ge \infty \). If this is common knowledge, we prove that the game has a unique SPE in which each firm plays Defect at all periods. We split the proof into 2 parts.

  •   [Part 1: First, we determine the Nash equilibrium of the stage game using the best response function of firm \(i\in N\):

  • Definition \(D-2\): In a 2 player game, the best response function of player \(i\) is the function \(R^{i}(s_{j})\) that for every given strategy \(s_{j}\) of player \(j\) assigns a strategy \(s_{i}= R^{i}(s_{j})\) that maximizes player \(i\)’s payoff \(\pi ^{i}(s_{i},s_{j})\) (Shy 1995, p. 21).

  • From the model description in Sect. 4, the best response function of firm \(i\in N\) is:

    $$\begin{aligned} R^{i}(s_{j}) ={\left\{ \begin{array}{ll} Defect \;\;if\;s_{j} \;=\;Cooperate &{}\\ Defect \;\; if\;s_{j} \;=\; Defect &{} \end{array}\right. } \end{aligned}$$
    (34)
  • Definition \(D-3\): An outcome \(\hat{s} = (\hat{s}^{1}, \hat{s}^{2},\ldots , \hat{s}^{N})\) (where \(\hat{s}^{i}\in S_{i}\) for every \(i= 1,2,\ldots ,N\)) is said to be a Nash equilibrium (NE) if for every player \(i\), \(\pi ^{i}(\hat{s}^{i}, \hat{s}^{\lnot i})\ge \pi ^{i}(s^{i}, \hat{s}^{\lnot i})\) for every \(s^{i}\in S_{i}\) (Shy 1995, p.18).

    • For the outcome \(s^{1}= (s_{i}^{1}, s_{i}^{1})\); \(\pi ^{1}(s_{i}^{1}, s_{i}^{1})= C < \pi ^{1}(s_{i}^{2}, s_{i}^{1})= A\), contradicting \(D-3\; \Rightarrow \; s^{1}\) is not NE.

    • For the outcome \(s^{2}= (s_{i}^{1}, s_{i}^{2})\); \(\pi ^{1}(s_{i}^{1}, s_{i}^{2})= Z < \pi ^{1}(s_{i}^{2}, s_{i}^{2})= D\), contradicting \(D-3\; \Rightarrow \; s^{2}\) is not NE.

    • For the outcome \(s^{3}= (s_{i}^{2}, s_{i}^{1})\); \(\pi ^{2}(s_{i}^{2}, s_{i}^{1})= Z < \pi ^{2}(s_{i}^{2}, s_{i}^{2})= D\), contradicting \(D-3\; \Rightarrow \; s^{3}\) is not NE.

    • For the outcome \(s^{4}= (s_{i}^{2}, s_{i}^{2})\); \(\pi ^{1}(s_{i}^{2}, s_{i}^{2})= D > \pi ^{1}(s_{i}^{1}, s_{i}^{2})= Z\) & \(\pi ^{2}(s_{i}^{2}, s_{i}^{2})= D > \pi ^{2}(s_{i}^{2}, s_{i}^{1})= Z\) \(\Rightarrow \; s^{4}\) is NE.

  • The outcome \(s^{4}= (Defect, Defect )\) thus constitutes an equilibrium in dominant strategies (EDS) and a unique NE for the stage IPD game.

  •   [Part 2: Having established that (Defect, Defect) is an NE of the stage game, we suppose that both firms have played the IPD game in \(T-1\) periods and they are ready to play for one last time in period \(T\). At this point, the game is identical to the stage game and firm \(i\in N\) plays its dominant strategy Defect (refer to the best response function of firm \(i\) in Eq. 34). Therefore, the outcome of the game is the NE of the stage game (Defect, Defect). Now consider the game in period \(T-1\). Both firms know that following this period, they will have one game to play and the outcome of the game involves both playing Defect. Again, at this period, both firms will play their dominant strategy resulting in the outcome (Defect, Defect). Using backward induction, we note that at each period \(T-2, T-3, \ldots ,1\), the outcome where both firms play Defect will result hence SPE. \(\square \)

Appendix 2: cooperation under exponential discounting

Consider Case 1 and Case 2 defined in Sect. 4. We prove that the outcome (Cooperate, Cooperate) is SPE under exponential discounting if \(\delta \ge \frac{A-C}{A-D}\).

  • The sum of discounted payoffs under Case 1 is given by:

    $$\begin{aligned} C + \delta C + \delta ^{2} C + \ldots \end{aligned}$$
    (35)

    To find the sum of the series in Eq. 35, we exploit a property of the exponential discount function. Claim: The following sum, \(1 + \delta + \delta ^{2} + \ldots \), converges to \(\frac{1}{1-\delta }\) if \(\delta < 1\).

Proof

Define the partial sums of the series as follows: \(s_{1} = 1\), \(s_{2} = 1+ \delta \), \(s_{3} =1 + \delta + \delta ^{2}\),..., \(s_{n} =1 + \delta + \cdots + \delta ^{n-1}\) where \(s_{i}\) represents the \(i\)th partial sum \((i=1,2,\ldots n)\). Multiply \(s_{n}\) by \(\delta \) and obtain \(\delta s_{n} = \delta + \delta ^{2}+ \cdots + \delta ^{n}\). Subtract \(\delta s_{n}\) from \(s_{n}\) and obtain: \(s_{n} - \delta s_{n} = 1 - \delta ^{n}\). Solve for \(s_{n}\): \(s_{n} = \frac{1 - \delta ^{n}}{1 - \delta },\;\;(\delta \ne 1)\). Finally taking the value for \(s_{n}\), note that if \(\mid \delta \mid \!<\! 1\) then \(\delta ^{n}\!\rightarrow \! 0\) as \(n \!\rightarrow \! \infty \) and \(s_{n}\!\rightarrow \! \frac{1}{1 -\delta }\). \(\square \)

  • From this property, we establish that the sum in Eq. 35 is \(C\left( \frac{1}{1-\delta }\right) \)

  • The sum of discounted payoffs under Case 2 is given by:

    $$\begin{aligned} A + \delta D + \delta ^{2} D + \cdots = A + D\left( \frac{\delta }{1-\delta }\right) \end{aligned}$$
    (36)
  • For the outcome (Cooperate, Cooperate) to be an SPE, we require that:

    $$\begin{aligned} C\left( \frac{1}{1-\delta }\right) \ge A + D\left( \frac{\delta }{1-\delta }\right) \end{aligned}$$
    (37)
    $$\begin{aligned} \Leftrightarrow \frac{C}{1-\delta }\ge A+ \frac{\delta D}{1-\delta }\Leftrightarrow \frac{C-\delta D}{1-\delta }\ge A \Leftrightarrow C- \delta D \ge A - \delta A \end{aligned}$$
    $$\begin{aligned} \Leftrightarrow \delta (A-D)\ge A-C \Leftrightarrow \delta \ge \frac{A-C}{A-D}\square \end{aligned}$$

Appendix 3: cooperation under quasi-hyperbolic discounting

Consider Case 1 and Case 2 defined in Sect. 4. We prove that the outcome (Cooperate, Cooperate) is SPE under quasi-hyperbolic discounting if \(\delta \ge \frac{A-C}{\beta (C-D)+A-C}\).

  • The sum of discounted payoffs under Case 1 is given by:

    $$\begin{aligned} C +\beta \delta C + \beta \delta ^{2} C + \cdots \end{aligned}$$
    (38)

    Similarly, we exploit the convergence property of exponential discounting to determine this sum. The sum in Eq. 38 is thus:

    $$\begin{aligned} \beta C\left( \frac{\delta }{1-\delta }\right) + C \end{aligned}$$
  • The sum of discounted payoffs under Case 2 is given by:

    $$\begin{aligned} A +\beta \delta D + \beta \delta ^{2} D + \cdots \end{aligned}$$
    (39)

    The sum in Eq. 39 is:

    $$\begin{aligned} \beta D\left( \frac{\delta }{1-\delta }\right) + A \end{aligned}$$
  • For the outcome (Cooperate, Cooperate) to be an SPE, we require that:

    $$\begin{aligned} \beta C\left( \frac{\delta }{1-\delta }\right) + C \ge \beta D\left( \frac{\delta }{1-\delta }\right) + A \end{aligned}$$
    (40)
    $$\begin{aligned} \Leftrightarrow \frac{\delta (\beta C - \beta D)}{1-\delta }\ge A-C \Leftrightarrow \delta (\beta C - \beta D)\ge (A-C) (1-\delta ) \end{aligned}$$
    $$\begin{aligned} \Leftrightarrow \delta (\beta C- \beta D + A - C) \ge A- C \Leftrightarrow \delta \ge \frac{A-C}{\beta (C-D)+ A-C}\square \end{aligned}$$

Appendix 4: analysis of the break-even quasi-hyperbolic discount factor

We show that the following relation in Eq. 20 holds:

$$\begin{aligned} \frac{d}{d\beta }\; \delta ^{*}(\beta )<0 \end{aligned}$$

Define \((A-C)\) as \(\alpha \) and \((C-D)\) as \(\gamma \) in Eq. 19 and re-write \(\delta ^{*}(\beta )\) as follows:

$$\begin{aligned} \delta ^{*}(\beta ) = \frac{A-C}{\beta (C-D) +A-C}= \frac{\alpha }{\beta \gamma + \alpha } \end{aligned}$$

Differentiating \(\delta ^{*}(\beta )\) with respect to \(\beta \), we obtain:

$$\begin{aligned} \frac{d}{d\beta }\left( \frac{\alpha }{\beta \gamma + \alpha }\right) = - \frac{\alpha \gamma }{(\beta \gamma + \alpha )^{2}} \end{aligned}$$

From the model description in Sect. 4, we have that \(A>\;C>\;D\) implying \(\alpha >0\) and \(\gamma >0\):

$$\begin{aligned} \alpha = \underbrace{(A - C)}_{ +}\;\;\;\gamma = \underbrace{(C - D)}_{ +} \;\;\;\Rightarrow (A-C)(C-D)\;>\;0 \end{aligned}$$

Therefore, we establish the result in Eq. 20:

$$\begin{aligned} - \frac{\alpha \gamma }{(\beta \gamma + \alpha )^{2}}\;<\;0\;\Leftrightarrow - \frac{(A-C)(C-D)}{(\beta (C-D) + (A-C))^{2}}\;<\;0 \end{aligned}$$

Similarly, we show that the following relation holds for the second order derivative:

$$\begin{aligned} \frac{d^{2}}{d\beta ^{2}}\;\delta ^{*}(\beta )> 0 \end{aligned}$$
$$\begin{aligned} \frac{d^{2}}{d\beta ^{2}}\left( \frac{A - C}{\beta (C - D) + A - C}\right) = \frac{d}{d\beta }\;\left( \frac{-\alpha \gamma }{(\beta \gamma + \alpha )^{2}}\right) =-\alpha \gamma \;\frac{d}{d\beta }\left( \frac{1}{(\beta \gamma + \alpha )^{2}}\right) \end{aligned}$$
$$\begin{aligned} = -\alpha \gamma \;\cdot \;\frac{-\;2\gamma (\beta \gamma + \alpha )}{(\beta \gamma + \alpha )^{4}}= 2\alpha \gamma ^{2}\;\frac{1}{(\beta \gamma + \alpha )^{3}}\;>\;0 \end{aligned}$$
$$\begin{aligned} \Leftrightarrow \;\frac{2(A-C)(C-D)^{2}}{(\beta (C-D) + (A-C))^{3}}\;>\;0 \end{aligned}$$

\(\square \)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Musau, A. Hyperbolic discount curves: a reply to Ainslie. Theory Decis 76, 9–30 (2014). https://doi.org/10.1007/s11238-013-9361-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11238-013-9361-8

Keywords

JEL Classification

Navigation