Closing the Gap: A Learning Algorithm for the Lost-Sales Inventory System with Lead Times

45 Pages Posted: 26 Feb 2017 Last revised: 6 Oct 2018

See all articles by Huanan Zhang

Huanan Zhang

The Pennsylvania State University - The Harold and Inge Marcus Department of Industrial and Manufacturing Engineering

Xiuli Chao

University of Michigan at Ann Arbor - Department of Industrial and Operations Engineering

Cong Shi

University of Michigan at Ann Arbor - Department of Industrial and Operations Engineering

Date Written: March 1, 2018

Abstract

We consider a periodic-review single-product inventory system with lost-sales and positive lead times under censored demand. In contrast to the classical inventory literature, we assume the firm does not know the demand distribution a priori, and makes adaptive inventory ordering decision in each period based only on the past sales (censored demand) data. The standard performance measure is regret, which is the cost difference between a feasible learning algorithm and the clairvoyant (full-information) benchmark. When the benchmark is chosen to be the (full-information) optimal base-stock policy, Huh et al. [Mathematics of Operations Research 34(2): 397-416 (2009)] developed a nonparametric learning algorithm with a cubic-root convergence rate on regret. An important open question is whether there exists a nonparametric learning algorithm whose regret rate matches the theoretical lower bound of any learning algorithms. In this work, we provide an affirmative answer to the above question. More precisely, we propose a new nonparametric algorithm termed the simulated cycle-update policy, and establish a square-root convergence rate on regret, which is proven to be the lower bound of any learning algorithms. Our algorithm uses a random cycle-updating rule based on an auxiliary simulated system running in parallel, and also involves two new concepts, namely, the withheld on-hand inventory and the double-phase cycle gradient estimation. The techniques developed are effective for learning a stochastic system with complex systems dynamics and lasting impact of decisions.

Keywords: inventory, lost-sales, lead time, base-stock policy, censored demand, nonparametric, learning algorithms, regret analysis

JEL Classification: C44

Suggested Citation

Zhang, Huanan and Chao, Xiuli and Shi, Cong, Closing the Gap: A Learning Algorithm for the Lost-Sales Inventory System with Lead Times (March 1, 2018). Available at SSRN: https://ssrn.com/abstract=2922820 or http://dx.doi.org/10.2139/ssrn.2922820

Huanan Zhang

The Pennsylvania State University - The Harold and Inge Marcus Department of Industrial and Manufacturing Engineering ( email )

University Park
State College, PA 16802
United States

Xiuli Chao

University of Michigan at Ann Arbor - Department of Industrial and Operations Engineering ( email )

1205 Beal Avenue
Ann Arbor, MI 48109
United States

Cong Shi (Contact Author)

University of Michigan at Ann Arbor - Department of Industrial and Operations Engineering ( email )

1205 Beal Avenue
Ann Arbor, MI 48109
United States

Here is the Coronavirus
related research on SSRN

Paper statistics

Downloads
373
Abstract Views
1,484
rank
89,733
PlumX Metrics