Strategic Experimentation with Exponential Bandits

Oxford, Economics Discussion Paper No. 143

36 Pages Posted: 7 Apr 2003

See all articles by Martin Cripps

Martin Cripps

Washington University in St. Louis - John M. Olin Business School; University College London - Department of Economics

Godfrey Keller

University of Oxford - Department of Economics

Sven Rady

University of Bonn

Multiple version iconThere are 2 versions of this paper

Date Written: January 2003

Abstract

This paper studies a game of strategic experimentation with two-armed bandits whose risky arm might yield a payoff only after some exponentially distributed random time. Because of free-riding, there is an inefficiently low level of experimentation in any equilibrium where the players use stationary Markovian strategies with posterior beliefs as the state variable. After characterizing the unique symmetric Markovian equilibrium of the game, which is in mixed strategies, we construct a variety of pure-strategy equilibria. There is no equilibrium where all players use simple cut-off strategies. Equilibria where players switch finitely often between the roles of experimenter and free-rider all lead to the same pattern of information acquisition; the efficiency of these equilibria depends on the way players share the burden of experimentation among them. In equilibria where players switch roles infinitely often, they can acquire an approximately efficient amount of information, but the rate at which it is acquired still remains inefficient; moreover, the expected payoff of an experimenter exhibits the novel feature that it rises as players become more pessimistic. Finally, over the range of beliefs where players use both arms a positive fraction of the time, the symmetric equilibrium is dominated by any asymmetric one in terms of aggregate payoffs.

Keywords: Strategic Experimentation, Two-Armed Bandit, Exponential Distribution, Bayesian Learning, Markov Perfect Equilibrium, Public Goods

JEL Classification: C73, D83, H41, O32

Suggested Citation

Cripps, Martin and Cripps, Martin and Keller, Godfrey and Rady, Sven, Strategic Experimentation with Exponential Bandits (January 2003). Oxford, Economics Discussion Paper No. 143, Available at SSRN: https://ssrn.com/abstract=372200 or http://dx.doi.org/10.2139/ssrn.372200

Martin Cripps

Washington University in St. Louis - John M. Olin Business School ( email )

One Brookings Drive
Campus Box 1133
St. Louis, MO 63130-4899
United States
314-935-4580 (Phone)
314-935-6359 (Fax)

University College London - Department of Economics ( email )

Gower Street
London
London, WC1E 6BT
United Kingdom

Godfrey Keller

University of Oxford - Department of Economics ( email )

Manor Road Building
Manor Road
Oxford, OX1 3BJ
United Kingdom
+44 20 1865 281173 (Phone)

Sven Rady (Contact Author)

University of Bonn ( email )

Regina-Pacis-Weg 3
Postfach 2220
Bonn, D-53012
Germany