Shape-Constrained Estimation of Value Functions

41 Pages Posted: 1 Jan 2014

See all articles by Mohammad Mousavi

Mohammad Mousavi

Stanford University - Department of Management Science & Engineering

Peter Glynn

Stanford University

Date Written: December 25, 2013

Abstract

We present a fully nonparametric method to estimate the value function, via simulation, in the context of expected infinite-horizon discounted rewards for Markov chains. Estimating such value functions plays an important role in approximate dynamic programming. We incorporate “soft information” into the estimation algorithm, such as knowledge of convexity, monotonicity, or Lipchitz constants. In the presence of such information, a nonparametric estimator for the value function can be computed that is provably consistent as the simulated time horizon tends to infinity. As an application, we implement our method on price tolling agreement contracts in energy markets.

Keywords: value function, dynamic programing, convexity, convex regression, Monte Carlo Methods, Harris Markov chains

JEL Classification: C44, C15 ,C14, C63

Suggested Citation

Mousavi, Mohammad and Glynn, Peter, Shape-Constrained Estimation of Value Functions (December 25, 2013). Available at SSRN: https://ssrn.com/abstract=2373294 or http://dx.doi.org/10.2139/ssrn.2373294

Mohammad Mousavi (Contact Author)

Stanford University - Department of Management Science & Engineering ( email )

473 Via Ortega
Stanford, CA 94305-9025
United States

Peter Glynn

Stanford University ( email )

Stanford, CA 94305
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
106
Abstract Views
1,273
Rank
486,943
PlumX Metrics