Shape-Constrained Estimation of Value Functions
41 Pages Posted: 1 Jan 2014
Date Written: December 25, 2013
Abstract
We present a fully nonparametric method to estimate the value function, via simulation, in the context of expected infinite-horizon discounted rewards for Markov chains. Estimating such value functions plays an important role in approximate dynamic programming. We incorporate “soft information” into the estimation algorithm, such as knowledge of convexity, monotonicity, or Lipchitz constants. In the presence of such information, a nonparametric estimator for the value function can be computed that is provably consistent as the simulated time horizon tends to infinity. As an application, we implement our method on price tolling agreement contracts in energy markets.
Keywords: value function, dynamic programing, convexity, convex regression, Monte Carlo Methods, Harris Markov chains
JEL Classification: C44, C15 ,C14, C63
Suggested Citation: Suggested Citation