Hyperbolic discount curves: a reply to Ainslie

Musau, Andrew

doi:10.1007/s11238-013-9361-8

Hyperbolic discount curves: a reply to Ainslie

Published: 24 March 2013

Volume 76, pages 9–30, (2014)
Cite this article

Theory and Decision Aims and scope Submit manuscript

Andrew Musau^1,2

964 Accesses
3 Citations
Explore all metrics

Abstract

Ainslie (Theory and Decision, 73, 3–34, 2012) challenges our interpretation of the properties of hyperbolic discount curves in an iterated prisoners’ dilemma (IPD) model. In this reply, we discuss the emergence of hyperbolic discount functions in the behavioral economics literature and evaluate their properties. Furthermore, we present a summarized version of our IPD model and evaluate Ainslie’s points of contention.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Term Structure of Psychological Discount Rate: Characteristics and Functional Forms

Remembering Kenneth Arrow: discount rates

Article 21 February 2018

Intertemporal Choice

Notes

In Herrnstein’s experiment, pigeons in an operant chamber could peck at one of two response-keys, each of which was on a variable interval (VI) reinforcement schedule. The experiment used concurrent schedules of intermittent reward (VI-VI).
Many writers in behavioral economics get this point wrong. In fact, no paper until Ainslie (1975) pointed out that the “matching relationship” would be a hyperbola if applied to individual, discrete choices, and thus cause preference reversals.
Ainslie’s hyperbolic function is such that events $\tau $ periods away are discounted with factor $\frac{1}{\tau }$.
The function was later used by Laibson (1997) to model intrapersonal dynamic conflict.
Myopia in common usage is near-sightedness. O’Donoghue and Rabin (1999) define the term as “present-biased” time preference in the context of intertemporal choice.
The elasticity of $D(t)$ with respect to $t$ represents the ratio of the incremental change of the logarithm of $D(t)$ with respect to the incremental change of the logarithm of $t$.
The constant in the denominator of Eq. 1 ensures that the value of the function is equal to 1 if either $k=0$ or $t=0$. Otherwise, the value of the function is not defined at this point.
The derivative of $D(t)$ with respect to $t$ measures the change in the function as the time delay changes marginally holding the discount rate $k$ constant.
The existing models primarily utilize $\beta -\delta $ discounting.
It is usual practice in economics to model a time-inconsistent agent as a sequence of sub-agents, in effect splitting her up on diachronic dimensions (see Ross 2005).
Refer to Ainslie (2012) for a review of the models.
“selves” here representing “oneself in different motivational states”.
To draw an analogy with our earlier discussion, the limited conflict described here is also a feature of interpersonal bargaining where it gives rise to self-enforcing agreements.
Notably, the effect in not observed among the group of non smokers.
In the normal form specification, strategies are equivalent to actions.
The Prisoners’ Dilemma game was originally framed by Merrill Flood and Melvin Dresher working at RAND Corporation in 1950 and later formalized by Albert W. Tucker.
This is an implication of the folk theorem which states that in repeated games, conditional on players’ minimax conditions being satisfied, any outcome is a feasible solution concept.
This result is standard since most folk-theorem analysis employ an exponential discount function.
refer to Sect. 2.3 for a summary of the function.
Streich and Levy (2007) obtain the same conditions for the discount factor when comparing a tit-for-tat strategy versus an always-defect strategy in the same game.
$\beta $ here reflects the degree of “present-biased” time preferences (refer to Sect. 2.3).
In our case, therefore, one may specify within limits any set of values for $A, \; C, \; D, \;Z,$ and $\beta $ and obtain a value for $\delta $ for which (Cooperate, Cooperate) constitutes an SPE.
In particular, $\sum \nolimits _{t=0}^\infty \delta ^{t}= \frac{1}{1-\delta }$ and $\sum \nolimits _{t=1}^\infty \delta ^{t}= \frac{\delta }{1-\delta }$.

References

Ainslie, G. (1975). Specious reward: A behavioral theory of impulsiveness and impulse control. Psychological Bulletin, 82(4), 463–496.
Article Google Scholar
Ainslie, G. (1992). Picoeconomics: The strategic interaction of successive motivational states within the person. New York: Cambridge University Press.
Google Scholar
Ainslie, G. (2001). Breakdown of will. New York: Cambridge University Press.
Book Google Scholar
Ainslie, G. (2012). Pure hyperbolic discount curves predict “eyes open” self-control. Theory and Decision, 73, 3–34.
Article Google Scholar
Albuquerque, R., & Hopenhayn, H. A. (2004). Optimal lending contracts and firm dynamics. Review of Economic Studies, 71(2), 285–315.
Article Google Scholar
Bulow, J., & Rogoff, K. (1989). Sovereign debt: Is to forgive to forget? The American Economic Review, 1, 43–50.
Google Scholar
Deutsch, M. (1960). Trust, trustworthiness, and the F-scale. Journal of Abnormal and Social Psychology, 61, 138–140.
Article Google Scholar
Fredrick, S., Loewenstein, G., & O’Donoghue, T. (2002). Time discounting and time preference: A critical review. Journal of Economic Literature, 40(2), 351–401.
Article Google Scholar
Frydman, R., & Goldberg, M. D. (2007). Imperfect knowledge economics: Exchange rates and risk. Princeton: Princeton University Press.
Google Scholar
Harris, M., & Holmstrom, B. (1982). A theory of wage dynamics. Review of Economic Studies, 44(3), 315–333.
Article Google Scholar
Herrnstein, R. (1961). Relative and absolute strengths of response as a function of frequency of reinforcement. Journal of the Experimental Analysis of Animal Behavior, 4, 267–272.
Article Google Scholar
Hofmeyr, A., Ainslie, G., Charlton, R., & Ross, D. (2010). The relationship between addiction and reward bundling: An experiment comparing smokers and non-smokers. Addiction, 106, 402–409.
Article Google Scholar
Kirby, K. N., & Guastello, B. (2001). Making choices in anticipation of similar future choices can increase self-control. Journal of Experimental Psychology: Applied, 7, 154–164.
Google Scholar
La Porta, R., Lopez-de-Silanes, F., Shleifer, A., & Vishny, R. W. (1997). Trust in large organizations. American Economic Review Papers and Proceedings, 87(2), 333–338.
Google Scholar
Laibson, D. (1997). Golden eggs and hyperbolic discounting. The Quarterly Journal of Economics, 112(2), 443–477.
Article Google Scholar
Loewenstein, G., & Prelec, D. (1992). Anomalies in intertemporal choice: Evidence and an interpretation. The Quarterly Journal of Economics, 107(2), 573–597.
Article Google Scholar
Mazur, J. E. (1987). An adjustment procedure for studying delayed reinforcement. In M. L. Commons, J. E. Mazur, J. A. Nevin, & H. Rachlin (Eds.), Quantitative analyses of behavior V: The effect of delay and of intervening events on reinforcement value. Hillsdale: Erlbaum.
Google Scholar
Musau, A. (2009). Modeling alternatives to exponential discounting. Master thesis, University of Agder. http://brage.bibsys.no/hia/bitstream/URN:NBN:no-bibsys_brage_10519/4/Musau_thesis09.pdf. Accessed 21 March 2012.
O’Donoghue, T., & Rabin, M. (1999). Doing it now or later. American Economic Review, 89(1), 103–124.
Article Google Scholar
Phelps, E. S., & Pollak, R. (1968). On second-best national saving and game-equilibrium growth. The Review of Economic Studies, 35(2), 185–199.
Article Google Scholar
Ross, D. (2005). Economic theory and cognitive science: Microexplanation. Cambridge: MIT Press.
Google Scholar
Ross, D. (2010). Economic models of procrastination. In C. Andreou & M. White (Eds.), The thief of time. New York: Oxford University Press.
Google Scholar
Rubinstein, A. (1979). Equilibrium in supergames with overtaking criterion. Journal of Economic Theory, 21, 1–9.
Article Google Scholar
Rubinstein, A. (1998). Modeling bounded rationality. Cambridge: MIT Press.
Google Scholar
Samuelson, P. A. (1937). A note on measurement of utility. The Review of Economic Studies, 4(2), 155–161.
Article Google Scholar
Shapiro, C., & Stiglitz, J. (1984). Equilibrium unemployment as a worker discipline device. American Economic Review, 74(3), 433–444.
Google Scholar
Shy, O. (1995). Industrial organization: Theory and applications. Cambridge: MIT Press.
Google Scholar
Streich, P., & Levy, J. S. (2007). Time horizons, discounting, and intertemporal choice. Journal of Conflict Resolution, 51(2), 199–226.
Article Google Scholar
Strotz, R. H. (1956). Myopia and inconsistency in dynamic utility maximization. The Review of Economic Studies, 23(3), 165–180.
Article Google Scholar
Telser, L. G. (1980). A theory of self-enforcing agreements. Journal of Business, 53(1), 27–44.
Article Google Scholar
Thomas, J., & Worrall, T. (1994). Foreign direct investment and the risk of expropriation. Review of Economic Studies, 61, 81–108.
Article Google Scholar

Download references

Acknowledgments

I am especially grateful to Jochen Jungeilges for his comments, suggestions, and guidance in relation to the article that this paper references. I also thank Ellen K. Nyhus for numerous discussions on the subject, and George Ainslie for helpful comments. Financial support from the Competence Development Fund of Southern Norway is gratefully acknowledged.

Author information

Authors and Affiliations

Faculty of Economics and Social Sciences, University of Agder, Gimlemoen 51, 4604 , Kristiansand, Norway
Andrew Musau
School of Social Sciences, University of Trento, Trento, Italy
Andrew Musau

Authors

Andrew Musau
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrew Musau.

Appendices

Appendix 1: finitely iterated prisoners’ dilemma

Consider the IPD model described in Sect. 4 and suppose that the game is repeated $T$ times in periods $1, 2,\ldots , T$ where $T\in \mathbb Z $ s.t. $1 \le T \ge \infty $. If this is common knowledge, we prove that the game has a unique SPE in which each firm plays Defect at all periods. We split the proof into 2 parts.

[Part 1: First, we determine the Nash equilibrium of the stage game using the best response function of firm $i\in N$:
Definition $D-2$: In a 2 player game, the best response function of player $i$ is the function $R^{i}(s_{j})$ that for every given strategy $s_{j}$ of player $j$ assigns a strategy $s_{i}= R^{i}(s_{j})$ that maximizes player $i$’s payoff $\pi ^{i}(s_{i},s_{j})$ (Shy 1995, p. 21).
From the model description in Sect. 4, the best response function of firm $i\in N$ is:
$$\begin{aligned} R^{i}(s_{j}) ={\left\{ \begin{array}{ll} Defect \;\;if\;s_{j} \;=\;Cooperate &{}\\ Defect \;\; if\;s_{j} \;=\; Defect &{} \end{array}\right. } \end{aligned}$$
(34)
Definition $D-3$: An outcome $\hat{s} = (\hat{s}^{1}, \hat{s}^{2},\ldots , \hat{s}^{N})$ (where $\hat{s}^{i}\in S_{i}$ for every $i= 1,2,\ldots ,N$) is said to be a Nash equilibrium (NE) if for every player $i$, $\pi ^{i}(\hat{s}^{i}, \hat{s}^{\lnot i})\ge \pi ^{i}(s^{i}, \hat{s}^{\lnot i})$ for every $s^{i}\in S_{i}$ (Shy 1995, p.18).
- For the outcome $s^{1}= (s_{i}^{1}, s_{i}^{1})$; $\pi ^{1}(s_{i}^{1}, s_{i}^{1})= C < \pi ^{1}(s_{i}^{2}, s_{i}^{1})= A$, contradicting $D-3\; \Rightarrow \; s^{1}$ is not NE.
- For the outcome $s^{2}= (s_{i}^{1}, s_{i}^{2})$; $\pi ^{1}(s_{i}^{1}, s_{i}^{2})= Z < \pi ^{1}(s_{i}^{2}, s_{i}^{2})= D$, contradicting $D-3\; \Rightarrow \; s^{2}$ is not NE.
- For the outcome $s^{3}= (s_{i}^{2}, s_{i}^{1})$; $\pi ^{2}(s_{i}^{2}, s_{i}^{1})= Z < \pi ^{2}(s_{i}^{2}, s_{i}^{2})= D$, contradicting $D-3\; \Rightarrow \; s^{3}$ is not NE.
- For the outcome $s^{4}= (s_{i}^{2}, s_{i}^{2})$; $\pi ^{1}(s_{i}^{2}, s_{i}^{2})= D > \pi ^{1}(s_{i}^{1}, s_{i}^{2})= Z$ & $\pi ^{2}(s_{i}^{2}, s_{i}^{2})= D > \pi ^{2}(s_{i}^{2}, s_{i}^{1})= Z$ $\Rightarrow \; s^{4}$ is NE.
The outcome $s^{4}= (Defect, Defect )$ thus constitutes an equilibrium in dominant strategies (EDS) and a unique NE for the stage IPD game.
[Part 2: Having established that (Defect, Defect) is an NE of the stage game, we suppose that both firms have played the IPD game in $T-1$ periods and they are ready to play for one last time in period $T$. At this point, the game is identical to the stage game and firm $i\in N$ plays its dominant strategy Defect (refer to the best response function of firm $i$ in Eq. 34). Therefore, the outcome of the game is the NE of the stage game (Defect, Defect). Now consider the game in period $T-1$. Both firms know that following this period, they will have one game to play and the outcome of the game involves both playing Defect. Again, at this period, both firms will play their dominant strategy resulting in the outcome (Defect, Defect). Using backward induction, we note that at each period $T-2, T-3, \ldots ,1$, the outcome where both firms play Defect will result hence SPE. $\square $

Appendix 2: cooperation under exponential discounting

Consider Case 1 and Case 2 defined in Sect. 4. We prove that the outcome (Cooperate, Cooperate) is SPE under exponential discounting if $\delta \ge \frac{A-C}{A-D}$.

The sum of discounted payoffs under Case 1 is given by:
$$\begin{aligned} C + \delta C + \delta ^{2} C + \ldots \end{aligned}$$
(35)
To find the sum of the series in Eq. 35, we exploit a property of the exponential discount function. Claim: The following sum, $1 + \delta + \delta ^{2} + \ldots $, converges to $\frac{1}{1-\delta }$ if $\delta < 1$.

Proof

Define the partial sums of the series as follows: $s_{1} = 1$, $s_{2} = 1+ \delta $, $s_{3} =1 + \delta + \delta ^{2}$,..., $s_{n} =1 + \delta + \cdots + \delta ^{n-1}$ where $s_{i}$ represents the $i$th partial sum $(i=1,2,\ldots n)$. Multiply $s_{n}$ by $\delta $ and obtain $\delta s_{n} = \delta + \delta ^{2}+ \cdots + \delta ^{n}$. Subtract $\delta s_{n}$ from $s_{n}$ and obtain: $s_{n} - \delta s_{n} = 1 - \delta ^{n}$. Solve for $s_{n}$: $s_{n} = \frac{1 - \delta ^{n}}{1 - \delta },\;\;(\delta \ne 1)$. Finally taking the value for $s_{n}$, note that if $\mid \delta \mid \!<\! 1$ then $\delta ^{n}\!\rightarrow \! 0$ as $n \!\rightarrow \! \infty $ and $s_{n}\!\rightarrow \! \frac{1}{1 -\delta }$. $\square $

From this property, we establish that the sum in Eq. 35 is $C\left( \frac{1}{1-\delta }\right) $
The sum of discounted payoffs under Case 2 is given by:
$$\begin{aligned} A + \delta D + \delta ^{2} D + \cdots = A + D\left( \frac{\delta }{1-\delta }\right) \end{aligned}$$
(36)
For the outcome (Cooperate, Cooperate) to be an SPE, we require that:
$$\begin{aligned} C\left( \frac{1}{1-\delta }\right) \ge A + D\left( \frac{\delta }{1-\delta }\right) \end{aligned}$$
(37)

$$\begin{aligned} \Leftrightarrow \frac{C}{1-\delta }\ge A+ \frac{\delta D}{1-\delta }\Leftrightarrow \frac{C-\delta D}{1-\delta }\ge A \Leftrightarrow C- \delta D \ge A - \delta A \end{aligned}$$

$$\begin{aligned} \Leftrightarrow \delta (A-D)\ge A-C \Leftrightarrow \delta \ge \frac{A-C}{A-D}\square \end{aligned}$$

Appendix 3: cooperation under quasi-hyperbolic discounting

Consider Case 1 and Case 2 defined in Sect. 4. We prove that the outcome (Cooperate, Cooperate) is SPE under quasi-hyperbolic discounting if $\delta \ge \frac{A-C}{\beta (C-D)+A-C}$.

The sum of discounted payoffs under Case 1 is given by:
$$\begin{aligned} C +\beta \delta C + \beta \delta ^{2} C + \cdots \end{aligned}$$
(38)
Similarly, we exploit the convergence property of exponential discounting to determine this sum. The sum in Eq. 38 is thus:
$$\begin{aligned} \beta C\left( \frac{\delta }{1-\delta }\right) + C \end{aligned}$$
The sum of discounted payoffs under Case 2 is given by:
$$\begin{aligned} A +\beta \delta D + \beta \delta ^{2} D + \cdots \end{aligned}$$
(39)
The sum in Eq. 39 is:
$$\begin{aligned} \beta D\left( \frac{\delta }{1-\delta }\right) + A \end{aligned}$$
For the outcome (Cooperate, Cooperate) to be an SPE, we require that:
$$\begin{aligned} \beta C\left( \frac{\delta }{1-\delta }\right) + C \ge \beta D\left( \frac{\delta }{1-\delta }\right) + A \end{aligned}$$
(40)

$$\begin{aligned} \Leftrightarrow \frac{\delta (\beta C - \beta D)}{1-\delta }\ge A-C \Leftrightarrow \delta (\beta C - \beta D)\ge (A-C) (1-\delta ) \end{aligned}$$

$$\begin{aligned} \Leftrightarrow \delta (\beta C- \beta D + A - C) \ge A- C \Leftrightarrow \delta \ge \frac{A-C}{\beta (C-D)+ A-C}\square \end{aligned}$$

Appendix 4: analysis of the break-even quasi-hyperbolic discount factor

We show that the following relation in Eq. 20 holds:

$$\begin{aligned} \frac{d}{d\beta }\; \delta ^{*}(\beta )<0 \end{aligned}$$

Define $(A-C)$ as $\alpha $ and $(C-D)$ as $\gamma $ in Eq. 19 and re-write $\delta ^{*}(\beta )$ as follows:

$$\begin{aligned} \delta ^{*}(\beta ) = \frac{A-C}{\beta (C-D) +A-C}= \frac{\alpha }{\beta \gamma + \alpha } \end{aligned}$$

Differentiating $\delta ^{*}(\beta )$ with respect to $\beta $, we obtain:

$$\begin{aligned} \frac{d}{d\beta }\left( \frac{\alpha }{\beta \gamma + \alpha }\right) = - \frac{\alpha \gamma }{(\beta \gamma + \alpha )^{2}} \end{aligned}$$

From the model description in Sect. 4, we have that $A>\;C>\;D$ implying $\alpha >0$ and $\gamma >0$:

$$\begin{aligned} \alpha = \underbrace{(A - C)}_{ +}\;\;\;\gamma = \underbrace{(C - D)}_{ +} \;\;\;\Rightarrow (A-C)(C-D)\;>\;0 \end{aligned}$$

Therefore, we establish the result in Eq. 20:

$$\begin{aligned} - \frac{\alpha \gamma }{(\beta \gamma + \alpha )^{2}}\;<\;0\;\Leftrightarrow - \frac{(A-C)(C-D)}{(\beta (C-D) + (A-C))^{2}}\;<\;0 \end{aligned}$$

Similarly, we show that the following relation holds for the second order derivative:

$$\begin{aligned} \frac{d^{2}}{d\beta ^{2}}\;\delta ^{*}(\beta )> 0 \end{aligned}$$

$$\begin{aligned} \frac{d^{2}}{d\beta ^{2}}\left( \frac{A - C}{\beta (C - D) + A - C}\right) = \frac{d}{d\beta }\;\left( \frac{-\alpha \gamma }{(\beta \gamma + \alpha )^{2}}\right) =-\alpha \gamma \;\frac{d}{d\beta }\left( \frac{1}{(\beta \gamma + \alpha )^{2}}\right) \end{aligned}$$

$$\begin{aligned} = -\alpha \gamma \;\cdot \;\frac{-\;2\gamma (\beta \gamma + \alpha )}{(\beta \gamma + \alpha )^{4}}= 2\alpha \gamma ^{2}\;\frac{1}{(\beta \gamma + \alpha )^{3}}\;>\;0 \end{aligned}$$

$$\begin{aligned} \Leftrightarrow \;\frac{2(A-C)(C-D)^{2}}{(\beta (C-D) + (A-C))^{3}}\;>\;0 \end{aligned}$$

$\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Musau, A. Hyperbolic discount curves: a reply to Ainslie. Theory Decis 76, 9–30 (2014). https://doi.org/10.1007/s11238-013-9361-8

Download citation

Published: 24 March 2013
Issue Date: January 2014
DOI: https://doi.org/10.1007/s11238-013-9361-8

Keywords

JEL Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Hyperbolic discount curves: a reply to Ainslie

Abstract

Access this article

Similar content being viewed by others

The Term Structure of Psychological Discount Rate: Characteristics and Functional Forms

Remembering Kenneth Arrow: discount rates

Intertemporal Choice

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1: finitely iterated prisoners’ dilemma

Appendix 2: cooperation under exponential discounting

Proof

Appendix 3: cooperation under quasi-hyperbolic discounting

Appendix 4: analysis of the break-even quasi-hyperbolic discount factor

Rights and permissions

About this article

Cite this article

Keywords

JEL Classification

Navigation

Hyperbolic discount curves: a reply to Ainslie

Abstract

Access this article

Similar content being viewed by others

The Term Structure of Psychological Discount Rate: Characteristics and Functional Forms

Remembering Kenneth Arrow: discount rates

Intertemporal Choice

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1: finitely iterated prisoners’ dilemma

Appendix 2: cooperation under exponential discounting

Proof

Appendix 3: cooperation under quasi-hyperbolic discounting

Appendix 4: analysis of the break-even quasi-hyperbolic discount factor

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation