Abstract
Ainslie (Theory and Decision, 73, 3–34, 2012) challenges our interpretation of the properties of hyperbolic discount curves in an iterated prisoners’ dilemma (IPD) model. In this reply, we discuss the emergence of hyperbolic discount functions in the behavioral economics literature and evaluate their properties. Furthermore, we present a summarized version of our IPD model and evaluate Ainslie’s points of contention.
Similar content being viewed by others
Notes
In Herrnstein’s experiment, pigeons in an operant chamber could peck at one of two response-keys, each of which was on a variable interval (VI) reinforcement schedule. The experiment used concurrent schedules of intermittent reward (VI-VI).
Many writers in behavioral economics get this point wrong. In fact, no paper until Ainslie (1975) pointed out that the “matching relationship” would be a hyperbola if applied to individual, discrete choices, and thus cause preference reversals.
Ainslie’s hyperbolic function is such that events \(\tau \) periods away are discounted with factor \(\frac{1}{\tau }\).
The function was later used by Laibson (1997) to model intrapersonal dynamic conflict.
Myopia in common usage is near-sightedness. O’Donoghue and Rabin (1999) define the term as “present-biased” time preference in the context of intertemporal choice.
The elasticity of \(D(t)\) with respect to \(t\) represents the ratio of the incremental change of the logarithm of \(D(t)\) with respect to the incremental change of the logarithm of \(t\).
The constant in the denominator of Eq. 1 ensures that the value of the function is equal to 1 if either \(k=0\) or \(t=0\). Otherwise, the value of the function is not defined at this point.
The derivative of \(D(t)\) with respect to \(t\) measures the change in the function as the time delay changes marginally holding the discount rate \(k\) constant.
The existing models primarily utilize \(\beta -\delta \) discounting.
It is usual practice in economics to model a time-inconsistent agent as a sequence of sub-agents, in effect splitting her up on diachronic dimensions (see Ross 2005).
Refer to Ainslie (2012) for a review of the models.
“selves” here representing “oneself in different motivational states”.
To draw an analogy with our earlier discussion, the limited conflict described here is also a feature of interpersonal bargaining where it gives rise to self-enforcing agreements.
Notably, the effect in not observed among the group of non smokers.
In the normal form specification, strategies are equivalent to actions.
The Prisoners’ Dilemma game was originally framed by Merrill Flood and Melvin Dresher working at RAND Corporation in 1950 and later formalized by Albert W. Tucker.
This is an implication of the folk theorem which states that in repeated games, conditional on players’ minimax conditions being satisfied, any outcome is a feasible solution concept.
This result is standard since most folk-theorem analysis employ an exponential discount function.
refer to Sect. 2.3 for a summary of the function.
Streich and Levy (2007) obtain the same conditions for the discount factor when comparing a tit-for-tat strategy versus an always-defect strategy in the same game.
\(\beta \) here reflects the degree of “present-biased” time preferences (refer to Sect. 2.3).
In our case, therefore, one may specify within limits any set of values for \(A, \; C, \; D, \;Z,\) and \(\beta \) and obtain a value for \(\delta \) for which (Cooperate, Cooperate) constitutes an SPE.
In particular, \(\sum \nolimits _{t=0}^\infty \delta ^{t}= \frac{1}{1-\delta }\) and \(\sum \nolimits _{t=1}^\infty \delta ^{t}= \frac{\delta }{1-\delta }\).
References
Ainslie, G. (1975). Specious reward: A behavioral theory of impulsiveness and impulse control. Psychological Bulletin, 82(4), 463–496.
Ainslie, G. (1992). Picoeconomics: The strategic interaction of successive motivational states within the person. New York: Cambridge University Press.
Ainslie, G. (2001). Breakdown of will. New York: Cambridge University Press.
Ainslie, G. (2012). Pure hyperbolic discount curves predict “eyes open” self-control. Theory and Decision, 73, 3–34.
Albuquerque, R., & Hopenhayn, H. A. (2004). Optimal lending contracts and firm dynamics. Review of Economic Studies, 71(2), 285–315.
Bulow, J., & Rogoff, K. (1989). Sovereign debt: Is to forgive to forget? The American Economic Review, 1, 43–50.
Deutsch, M. (1960). Trust, trustworthiness, and the F-scale. Journal of Abnormal and Social Psychology, 61, 138–140.
Fredrick, S., Loewenstein, G., & O’Donoghue, T. (2002). Time discounting and time preference: A critical review. Journal of Economic Literature, 40(2), 351–401.
Frydman, R., & Goldberg, M. D. (2007). Imperfect knowledge economics: Exchange rates and risk. Princeton: Princeton University Press.
Harris, M., & Holmstrom, B. (1982). A theory of wage dynamics. Review of Economic Studies, 44(3), 315–333.
Herrnstein, R. (1961). Relative and absolute strengths of response as a function of frequency of reinforcement. Journal of the Experimental Analysis of Animal Behavior, 4, 267–272.
Hofmeyr, A., Ainslie, G., Charlton, R., & Ross, D. (2010). The relationship between addiction and reward bundling: An experiment comparing smokers and non-smokers. Addiction, 106, 402–409.
Kirby, K. N., & Guastello, B. (2001). Making choices in anticipation of similar future choices can increase self-control. Journal of Experimental Psychology: Applied, 7, 154–164.
La Porta, R., Lopez-de-Silanes, F., Shleifer, A., & Vishny, R. W. (1997). Trust in large organizations. American Economic Review Papers and Proceedings, 87(2), 333–338.
Laibson, D. (1997). Golden eggs and hyperbolic discounting. The Quarterly Journal of Economics, 112(2), 443–477.
Loewenstein, G., & Prelec, D. (1992). Anomalies in intertemporal choice: Evidence and an interpretation. The Quarterly Journal of Economics, 107(2), 573–597.
Mazur, J. E. (1987). An adjustment procedure for studying delayed reinforcement. In M. L. Commons, J. E. Mazur, J. A. Nevin, & H. Rachlin (Eds.), Quantitative analyses of behavior V: The effect of delay and of intervening events on reinforcement value. Hillsdale: Erlbaum.
Musau, A. (2009). Modeling alternatives to exponential discounting. Master thesis, University of Agder. http://brage.bibsys.no/hia/bitstream/URN:NBN:no-bibsys_brage_10519/4/Musau_thesis09.pdf. Accessed 21 March 2012.
O’Donoghue, T., & Rabin, M. (1999). Doing it now or later. American Economic Review, 89(1), 103–124.
Phelps, E. S., & Pollak, R. (1968). On second-best national saving and game-equilibrium growth. The Review of Economic Studies, 35(2), 185–199.
Ross, D. (2005). Economic theory and cognitive science: Microexplanation. Cambridge: MIT Press.
Ross, D. (2010). Economic models of procrastination. In C. Andreou & M. White (Eds.), The thief of time. New York: Oxford University Press.
Rubinstein, A. (1979). Equilibrium in supergames with overtaking criterion. Journal of Economic Theory, 21, 1–9.
Rubinstein, A. (1998). Modeling bounded rationality. Cambridge: MIT Press.
Samuelson, P. A. (1937). A note on measurement of utility. The Review of Economic Studies, 4(2), 155–161.
Shapiro, C., & Stiglitz, J. (1984). Equilibrium unemployment as a worker discipline device. American Economic Review, 74(3), 433–444.
Shy, O. (1995). Industrial organization: Theory and applications. Cambridge: MIT Press.
Streich, P., & Levy, J. S. (2007). Time horizons, discounting, and intertemporal choice. Journal of Conflict Resolution, 51(2), 199–226.
Strotz, R. H. (1956). Myopia and inconsistency in dynamic utility maximization. The Review of Economic Studies, 23(3), 165–180.
Telser, L. G. (1980). A theory of self-enforcing agreements. Journal of Business, 53(1), 27–44.
Thomas, J., & Worrall, T. (1994). Foreign direct investment and the risk of expropriation. Review of Economic Studies, 61, 81–108.
Acknowledgments
I am especially grateful to Jochen Jungeilges for his comments, suggestions, and guidance in relation to the article that this paper references. I also thank Ellen K. Nyhus for numerous discussions on the subject, and George Ainslie for helpful comments. Financial support from the Competence Development Fund of Southern Norway is gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1: finitely iterated prisoners’ dilemma
Consider the IPD model described in Sect. 4 and suppose that the game is repeated \(T\) times in periods \(1, 2,\ldots , T\) where \(T\in \mathbb Z \) s.t. \(1 \le T \ge \infty \). If this is common knowledge, we prove that the game has a unique SPE in which each firm plays Defect at all periods. We split the proof into 2 parts.
-
[Part 1: First, we determine the Nash equilibrium of the stage game using the best response function of firm \(i\in N\):
-
Definition \(D-2\): In a 2 player game, the best response function of player \(i\) is the function \(R^{i}(s_{j})\) that for every given strategy \(s_{j}\) of player \(j\) assigns a strategy \(s_{i}= R^{i}(s_{j})\) that maximizes player \(i\)’s payoff \(\pi ^{i}(s_{i},s_{j})\) (Shy 1995, p. 21).
-
From the model description in Sect. 4, the best response function of firm \(i\in N\) is:
$$\begin{aligned} R^{i}(s_{j}) ={\left\{ \begin{array}{ll} Defect \;\;if\;s_{j} \;=\;Cooperate &{}\\ Defect \;\; if\;s_{j} \;=\; Defect &{} \end{array}\right. } \end{aligned}$$(34) -
Definition \(D-3\): An outcome \(\hat{s} = (\hat{s}^{1}, \hat{s}^{2},\ldots , \hat{s}^{N})\) (where \(\hat{s}^{i}\in S_{i}\) for every \(i= 1,2,\ldots ,N\)) is said to be a Nash equilibrium (NE) if for every player \(i\), \(\pi ^{i}(\hat{s}^{i}, \hat{s}^{\lnot i})\ge \pi ^{i}(s^{i}, \hat{s}^{\lnot i})\) for every \(s^{i}\in S_{i}\) (Shy 1995, p.18).
-
For the outcome \(s^{1}= (s_{i}^{1}, s_{i}^{1})\); \(\pi ^{1}(s_{i}^{1}, s_{i}^{1})= C < \pi ^{1}(s_{i}^{2}, s_{i}^{1})= A\), contradicting \(D-3\; \Rightarrow \; s^{1}\) is not NE.
-
For the outcome \(s^{2}= (s_{i}^{1}, s_{i}^{2})\); \(\pi ^{1}(s_{i}^{1}, s_{i}^{2})= Z < \pi ^{1}(s_{i}^{2}, s_{i}^{2})= D\), contradicting \(D-3\; \Rightarrow \; s^{2}\) is not NE.
-
For the outcome \(s^{3}= (s_{i}^{2}, s_{i}^{1})\); \(\pi ^{2}(s_{i}^{2}, s_{i}^{1})= Z < \pi ^{2}(s_{i}^{2}, s_{i}^{2})= D\), contradicting \(D-3\; \Rightarrow \; s^{3}\) is not NE.
-
For the outcome \(s^{4}= (s_{i}^{2}, s_{i}^{2})\); \(\pi ^{1}(s_{i}^{2}, s_{i}^{2})= D > \pi ^{1}(s_{i}^{1}, s_{i}^{2})= Z\) & \(\pi ^{2}(s_{i}^{2}, s_{i}^{2})= D > \pi ^{2}(s_{i}^{2}, s_{i}^{1})= Z\) \(\Rightarrow \; s^{4}\) is NE.
-
-
The outcome \(s^{4}= (Defect, Defect )\) thus constitutes an equilibrium in dominant strategies (EDS) and a unique NE for the stage IPD game.
-
[Part 2: Having established that (Defect, Defect) is an NE of the stage game, we suppose that both firms have played the IPD game in \(T-1\) periods and they are ready to play for one last time in period \(T\). At this point, the game is identical to the stage game and firm \(i\in N\) plays its dominant strategy Defect (refer to the best response function of firm \(i\) in Eq. 34). Therefore, the outcome of the game is the NE of the stage game (Defect, Defect). Now consider the game in period \(T-1\). Both firms know that following this period, they will have one game to play and the outcome of the game involves both playing Defect. Again, at this period, both firms will play their dominant strategy resulting in the outcome (Defect, Defect). Using backward induction, we note that at each period \(T-2, T-3, \ldots ,1\), the outcome where both firms play Defect will result hence SPE. \(\square \)
Appendix 2: cooperation under exponential discounting
Consider Case 1 and Case 2 defined in Sect. 4. We prove that the outcome (Cooperate, Cooperate) is SPE under exponential discounting if \(\delta \ge \frac{A-C}{A-D}\).
-
The sum of discounted payoffs under Case 1 is given by:
$$\begin{aligned} C + \delta C + \delta ^{2} C + \ldots \end{aligned}$$(35)To find the sum of the series in Eq. 35, we exploit a property of the exponential discount function. Claim: The following sum, \(1 + \delta + \delta ^{2} + \ldots \), converges to \(\frac{1}{1-\delta }\) if \(\delta < 1\).
Proof
Define the partial sums of the series as follows: \(s_{1} = 1\), \(s_{2} = 1+ \delta \), \(s_{3} =1 + \delta + \delta ^{2}\),..., \(s_{n} =1 + \delta + \cdots + \delta ^{n-1}\) where \(s_{i}\) represents the \(i\)th partial sum \((i=1,2,\ldots n)\). Multiply \(s_{n}\) by \(\delta \) and obtain \(\delta s_{n} = \delta + \delta ^{2}+ \cdots + \delta ^{n}\). Subtract \(\delta s_{n}\) from \(s_{n}\) and obtain: \(s_{n} - \delta s_{n} = 1 - \delta ^{n}\). Solve for \(s_{n}\): \(s_{n} = \frac{1 - \delta ^{n}}{1 - \delta },\;\;(\delta \ne 1)\). Finally taking the value for \(s_{n}\), note that if \(\mid \delta \mid \!<\! 1\) then \(\delta ^{n}\!\rightarrow \! 0\) as \(n \!\rightarrow \! \infty \) and \(s_{n}\!\rightarrow \! \frac{1}{1 -\delta }\). \(\square \)
-
From this property, we establish that the sum in Eq. 35 is \(C\left( \frac{1}{1-\delta }\right) \)
-
The sum of discounted payoffs under Case 2 is given by:
$$\begin{aligned} A + \delta D + \delta ^{2} D + \cdots = A + D\left( \frac{\delta }{1-\delta }\right) \end{aligned}$$(36) -
For the outcome (Cooperate, Cooperate) to be an SPE, we require that:
$$\begin{aligned} C\left( \frac{1}{1-\delta }\right) \ge A + D\left( \frac{\delta }{1-\delta }\right) \end{aligned}$$(37)$$\begin{aligned} \Leftrightarrow \frac{C}{1-\delta }\ge A+ \frac{\delta D}{1-\delta }\Leftrightarrow \frac{C-\delta D}{1-\delta }\ge A \Leftrightarrow C- \delta D \ge A - \delta A \end{aligned}$$$$\begin{aligned} \Leftrightarrow \delta (A-D)\ge A-C \Leftrightarrow \delta \ge \frac{A-C}{A-D}\square \end{aligned}$$
Appendix 3: cooperation under quasi-hyperbolic discounting
Consider Case 1 and Case 2 defined in Sect. 4. We prove that the outcome (Cooperate, Cooperate) is SPE under quasi-hyperbolic discounting if \(\delta \ge \frac{A-C}{\beta (C-D)+A-C}\).
-
The sum of discounted payoffs under Case 1 is given by:
$$\begin{aligned} C +\beta \delta C + \beta \delta ^{2} C + \cdots \end{aligned}$$(38)Similarly, we exploit the convergence property of exponential discounting to determine this sum. The sum in Eq. 38 is thus:
$$\begin{aligned} \beta C\left( \frac{\delta }{1-\delta }\right) + C \end{aligned}$$ -
The sum of discounted payoffs under Case 2 is given by:
$$\begin{aligned} A +\beta \delta D + \beta \delta ^{2} D + \cdots \end{aligned}$$(39)The sum in Eq. 39 is:
$$\begin{aligned} \beta D\left( \frac{\delta }{1-\delta }\right) + A \end{aligned}$$ -
For the outcome (Cooperate, Cooperate) to be an SPE, we require that:
$$\begin{aligned} \beta C\left( \frac{\delta }{1-\delta }\right) + C \ge \beta D\left( \frac{\delta }{1-\delta }\right) + A \end{aligned}$$(40)$$\begin{aligned} \Leftrightarrow \frac{\delta (\beta C - \beta D)}{1-\delta }\ge A-C \Leftrightarrow \delta (\beta C - \beta D)\ge (A-C) (1-\delta ) \end{aligned}$$$$\begin{aligned} \Leftrightarrow \delta (\beta C- \beta D + A - C) \ge A- C \Leftrightarrow \delta \ge \frac{A-C}{\beta (C-D)+ A-C}\square \end{aligned}$$
Appendix 4: analysis of the break-even quasi-hyperbolic discount factor
We show that the following relation in Eq. 20 holds:
Define \((A-C)\) as \(\alpha \) and \((C-D)\) as \(\gamma \) in Eq. 19 and re-write \(\delta ^{*}(\beta )\) as follows:
Differentiating \(\delta ^{*}(\beta )\) with respect to \(\beta \), we obtain:
From the model description in Sect. 4, we have that \(A>\;C>\;D\) implying \(\alpha >0\) and \(\gamma >0\):
Therefore, we establish the result in Eq. 20:
Similarly, we show that the following relation holds for the second order derivative:
\(\square \)
Rights and permissions
About this article
Cite this article
Musau, A. Hyperbolic discount curves: a reply to Ainslie. Theory Decis 76, 9–30 (2014). https://doi.org/10.1007/s11238-013-9361-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11238-013-9361-8