Relative Entropy in Sequential Decision Problems

Posted: 7 Mar 2012

See all articles by Ehud Lehrer

Ehud Lehrer

Tel Aviv University - School of Mathematical Sciences

Rann Smorodinsky

Technion-Israel Institute of Technology - The William Davidson Faculty of Industrial Engineering & Management

Date Written: 6 17, 1999

Abstract

Consider an agent who faces a sequential decision problem. At each stage the agent takes an action and observes a stochastic outcome e.g., daily prices, weather conditions, opponents’ actions in a repeated game, etc. The agent’s stage-utility depends on his action, the observed outcome and on previous outcomes. We assume the agent is Bayesian and is endowed with a subjective belief over the distribution of outcomes. The agent’s initial belief is typically inaccurate. Therefore, his subjectively optimal strategy is initially suboptimal. As time passes information about the true dynamics is accumulated and, depending on the compatibility of the belief with respect to the truth, the agent may eventually learn to optimize. We introduce the notion of relative entropy, which is a natural adaptation of the entropy of a stochastic process to the subjective set-up. We present conditions, expressed in terms of relative entropy, that determine whether the agent will eventually learn to optimize. It is shown that low entropy yields asymptotic optimal behavior. In addition, we present a notion of point wise merging and link it with relative entropy.

Keywords: relative entropy, sequential decision problems, optimization

JEL Classification: D83

Suggested Citation

Lehrer, Ehud and Smorodinsky, Rann, Relative Entropy in Sequential Decision Problems (6 17, 1999). Journal of Mathematical Economics, Vol. 33, 2000, Available at SSRN: https://ssrn.com/abstract=2017428

Ehud Lehrer

Tel Aviv University - School of Mathematical Sciences ( email )

Tel Aviv 69978
Israel

Rann Smorodinsky (Contact Author)

Technion-Israel Institute of Technology - The William Davidson Faculty of Industrial Engineering & Management ( email )

Haifa 32000
Israel

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Abstract Views
621
PlumX Metrics