Bandit Problems

16 Pages Posted: 19 Jan 2006

See all articles by Dirk Bergemann

Dirk Bergemann

Yale University - Cowles Foundation - Department of Economics; Yale University - Cowles Foundation

Juuso Valimaki

Helsinki School of Economics; University of Southampton - Division of Economics

Date Written: January 2006

Abstract

We survey the literature on multi-armed bandit models and their applications in economics. The multi-armed bandit problem is a statistical decision model of an agent trying to optimize his decisions while improving his information at the same time. This classic problem has received much attention in economics as it concisely models the trade-off between exploration (trying out each arm to find the best one) and exploitation (playing the arm believed to give the best payoff).

Keywords: One-Armed Bandit, Multi-Armed Bandit, Bayesian Learning, Experimentation, Index Policy, Matching, Experience Goods

JEL Classification: C72, C73, D43, D83

Suggested Citation

Bergemann, Dirk and Valimaki, Juuso, Bandit Problems (January 2006). Cowles Foundation Discussion Paper No. 1551. Available at SSRN: https://ssrn.com/abstract=877173

Dirk Bergemann (Contact Author)

Yale University - Cowles Foundation - Department of Economics ( email )

28 Hillhouse Ave
New Haven, CT 06520-8268
United States
203-432-3592 (Phone)
203-432-2128 (Fax)

HOME PAGE: http://www.econ.yale.edu/~dirk/

Yale University - Cowles Foundation

Box 208281
New Haven, CT 06520-8281
United States

Juuso Valimaki

Helsinki School of Economics ( email )

P.O. Box 21210
Helsinki 00100, 00101
Finland

University of Southampton - Division of Economics ( email )

Southampton, SO17 1BJ
United Kingdom
+44 23 8059 3263 (Phone)

Register to save articles to
your library

Register

Paper statistics

Downloads
1,163
Abstract Views
3,387
rank
17,013
PlumX Metrics