Strategic Experimentation with Exponential Bandits
40 Pages Posted: 2 May 2003
There are 2 versions of this paper
Strategic Experimentation with Exponential Bandits
Date Written: March 2003
Abstract
This Paper studies a game of strategic experimentation with two-armed bandits whose risky arm might yield a pay-off only after some exponentially distributed random time. Because of free-riding, there is an inefficiently low level of experimentation in any equilibrium where the players use stationary Markovian strategies with posterior beliefs as the state variable. After characterizing the unique symmetric Markovian equilibrium of the game, which is in mixed strategies, we construct a variety of pure-strategy equilibria. There is no equilibrium where all players use simple cut-off strategies. Equilibria where players switch finitely often between the roles of experimenter and free-rider all lead to the same pattern of information acquisition; the efficiency of these equilibria depends on the way players share the burden of experimentation among them. In equilibria where players switch roles infinitely often, they can acquire an approximately efficient amount of information, but the rate at which it is acquired still remains inefficient; moreover, the expected pay-off of an experimenter exhibits the novel feature that it rises as players become more pessimistic. Finally, over the range of beliefs where players use both arms a positive fraction of the time, the symmetric equilibrium is dominated by any asymmetric one in terms of aggregate pay-offs.
Keywords: Strategic experimentation, two-armed bandits, exponential distribution, Bayesian learning, Markov perfect equilibrium, public goods
JEL Classification: C73, D83, H41, O32
Suggested Citation: Suggested Citation
Do you have a job opening that you would like to promote on SSRN?
Recommended Papers
-
The Financing of Innovation: Learning and Stopping
By Dirk Bergemann and Ulrich Hege
-
Gradualism and Irreversibility
By Ben Lockwood and Jonathan Thomas
-
Strategic Experimentation with Exponential Bandits
By Martin Cripps, Godfrey Keller, ...
-
Strategic Experimentation with Poisson Bandits
By Godfrey Keller and Sven Rady
-
Strategic Experimentation with Poisson Bandits
By Godfrey Keller and Sven Rady
-
By Dirk Bergemann and Juuso Valimaki
-
Price Dispersion and Learning in a Dynamic Differentiated-Goods Duopoly
By Godfrey Keller and Sven Rady
-
On the Smoothness of Value Functions and the Existence of Optimal Strategies
-
By Nicolas A. Klein and Sven Rady