Multiproduct Inventory Systems with Upgrading: Replenishment, Allocation, and Online Learning

48 Pages Posted: 15 Apr 2024

See all articles by Jingwen Tang

Jingwen Tang

University of Miami Herbert Business School

Izak Duenyas

University of Michigan, Stephen M. Ross School of Business

Cong Shi

University of Miami - Department of Management

Nan Yang

University of Miami - Department of Management

Date Written: April 2, 2024

Abstract

We consider the joint optimization of ordering and upgrading decisions in a dynamic multiproduct system over a finite time horizon of T periods. Multiple types of demand arrive in each period stochastically and can be satisfied with the supply of the same type or some higher-quality product (upgrading). The overall goal is to find an optimal joint replenishment and allocation policy that maximizes the total expected profit in both the setting in which the firm knows the demand distributions a priori and the setting in which the firm needs to learn the demand distributions during the process. We first characterize the structure of the clairvoyant optimal joint ordering and allocation policy. Based on the structure of the optimal policies, we propose a new online learning algorithm termed stochastic gradient descent with perturbed gradient (SGD-PG for short), and prove that the algorithm admits a cumulative regret upper bound of $O(\sqrt{T})$, which matches the lower bound for any learning algorithms. The novelties lie in two aspects: (a) We propose a perturbation-based subroutine to compute a valid sample-path gradient of the profit function with respect to the replenishment decisions. (b) We keep track of the real-time imbalance between supply and demand to carry out the allocation decisions. We also show that SGD-PG can be extended to a nested censored demand scenario. We demonstrate the efficacy of the proposed algorithms in numerical experiments. This work provides practitioners with the optimal policy of inventory replenishment and allocation in a multiproduct system with upgrading. When the demand distribution is unknown, we propose an easy-to-implement and provably-good algorithm for demand learning. In addition, the paper numerically quantifies the value of optimal upgrading and identifies conditions under which upgrading can be the most helpful.

Keywords: multiproduct, ordering, allocation, general upgrading, online learning, censored demand

Suggested Citation

Tang, Jingwen and Duenyas, Izak and Shi, Cong and Yang, Nan, Multiproduct Inventory Systems with Upgrading: Replenishment, Allocation, and Online Learning (April 2, 2024). Available at SSRN: https://ssrn.com/abstract=4781604 or http://dx.doi.org/10.2139/ssrn.4781604

Jingwen Tang (Contact Author)

University of Miami Herbert Business School ( email )

P.O. Box 248126
Florida
Coral Gables, FL 33124
United States

Izak Duenyas

University of Michigan, Stephen M. Ross School of Business ( email )

701 Tappan Street
Ann Arbor, MI MI 48109
United States

Cong Shi

University of Miami - Department of Management ( email )

United States

HOME PAGE: http://https://congshi-research.github.io/

Nan Yang

University of Miami - Department of Management ( email )

United States
305-284-4574 (Phone)

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
152
Abstract Views
700
Rank
412,883
PlumX Metrics