Bandit Problems
16 Pages Posted: 19 Jan 2006
Date Written: January 2006
Abstract
We survey the literature on multi-armed bandit models and their applications in economics. The multi-armed bandit problem is a statistical decision model of an agent trying to optimize his decisions while improving his information at the same time. This classic problem has received much attention in economics as it concisely models the trade-off between exploration (trying out each arm to find the best one) and exploitation (playing the arm believed to give the best payoff).
Keywords: One-Armed Bandit, Multi-Armed Bandit, Bayesian Learning, Experimentation, Index Policy, Matching, Experience Goods
JEL Classification: C72, C73, D43, D83
Suggested Citation: Suggested Citation
Do you have a job opening that you would like to promote on SSRN?
Recommended Papers
-
The Financing of Innovation: Learning and Stopping
By Dirk Bergemann and Ulrich Hege
-
Gradualism and Irreversibility
By Ben Lockwood and Jonathan Thomas
-
Strategic Experimentation with Exponential Bandits
By Martin Cripps, Godfrey Keller, ...
-
Strategic Experimentation with Exponential Bandits
By Martin Cripps, Godfrey Keller, ...
-
Strategic Experimentation with Poisson Bandits
By Godfrey Keller and Sven Rady
-
Strategic Experimentation with Poisson Bandits
By Godfrey Keller and Sven Rady
-
Price Dispersion and Learning in a Dynamic Differentiated-Goods Duopoly
By Godfrey Keller and Sven Rady
-
On the Smoothness of Value Functions and the Existence of Optimal Strategies
-
By Nicolas A. Klein and Sven Rady