Negatively Correlated Bandits
31 Pages Posted: 2 Dec 2008
There are 2 versions of this paper
Negatively Correlated Bandits
Date Written: October 2008
Abstract
We analyze a two-player game of strategic experimentation with two-armed bandits. Each player has to decide in continuous time whether to use a safe arm with a known payoff or a risky arm whose likelihood of delivering payoffs is initially unknown. The quality of the risky arms is perfectly negatively correlated between players. In marked contrast to the case where both risky arms are of the same type, we find that learning will be complete in any Markov perfect equilibrium if the stakes exceed a certain threshold, and that all equilibria are in cutoff strategies. For low stakes, the equilibrium is unique, symmetric, and coincides with the planner's solution. For high stakes, the equilibrium is unique, symmetric, and tantamount to myopic behavior. For intermediate stakes, there is a continuum of equilibria.
Keywords: Bayesian Learning, Exponential Distribution, Markov Perfect Equilibrium, Poisson Process, Strategic Experimentation, Two-Armed Bandit
JEL Classification: C73, D83, O32
Suggested Citation: Suggested Citation
Do you have a job opening that you would like to promote on SSRN?
Recommended Papers
-
The Financing of Innovation: Learning and Stopping
By Dirk Bergemann and Ulrich Hege
-
Gradualism and Irreversibility
By Ben Lockwood and Jonathan Thomas
-
Strategic Experimentation with Exponential Bandits
By Martin Cripps, Godfrey Keller, ...
-
Strategic Experimentation with Exponential Bandits
By Martin Cripps, Godfrey Keller, ...
-
Strategic Experimentation with Poisson Bandits
By Godfrey Keller and Sven Rady
-
Strategic Experimentation with Poisson Bandits
By Godfrey Keller and Sven Rady
-
By Dirk Bergemann and Juuso Valimaki
-
Price Dispersion and Learning in a Dynamic Differentiated-Goods Duopoly
By Godfrey Keller and Sven Rady
-
On the Smoothness of Value Functions and the Existence of Optimal Strategies