The Near-Term Liability of Exploitation: Exploration and Exploitation in Multi-Stage Problems

Fang, Christina; Levinthal, Daniel

Download This Paper

Open PDF in Browser

Add Paper to My Library

The Near-Term Liability of Exploitation: Exploration and Exploitation in Multi-Stage Problems

Organization Science, Vol. 20, No. 3, pp. 538-551, 2009

36 Pages Posted: 7 Apr 2008 Last revised: 27 Dec 2011

See all articles by Christina Fang

Christina Fang

New York University (NYU) - Department of Management and Organizational Behavior

Daniel Levinthal

University of Pennsylvania - Management Department

Abstract

The classic tradeoff between exploration and exploitation reflects the tension between gaining new information about alternatives to improve future returns and using the information currently available to improve present returns (March, 1991). By considering these issues in the context of a multi-stage, as opposed to a repeated, problem environment, we show that exploratory behavior has value quite apart from its role in revising beliefs. We show that even if current beliefs provide an unbiased characterization of the problem environment, maximizing with respect to these beliefs may lead to an inferior expected payoff relative to other mechanisms that make less aggressive use of the organization's beliefs. Search can lead to more robust actions in multi-stage decision problems than maximization, a benefit quite apart from its role in the updating of beliefs.

Keywords: Exploration and exploitation, maximization, multi-stage problems, reinforcement learning, softmax choice rule

Suggested Citation: Suggested Citation

Fang, Christina and Levinthal, Daniel A., The Near-Term Liability of Exploitation: Exploration and Exploitation in Multi-Stage Problems. Organization Science, Vol. 20, No. 3, pp. 538-551, 2009, Available at SSRN: https://ssrn.com/abstract=1117082