Dynamic Pricing in a Non-stationary Growing Environment
34 Pages Posted: 22 Jul 2020 Last revised: 24 Aug 2021
Date Written: August 23, 2021
Abstract
Many emerging businesses and services encounter growing environments where the stochastic demand gradually increases. The increases are often not only in the expectation of the demand but also in the variance. Because of the growing environments, sequential decision making problems raise additional challenges when model parameters are unknown and need to be dynamically learned from sequentially observed data. In this work, we use a single-product dynamic pricing problem to illustrate how the non-stationary growing environment influences policy design and policy performance such as regret. We prove matching upper and lower bounds on regret and design near-optimal pricing policies. We then demonstrate how the growth rate of demand variance affects the best achievable policy performance as well as the near-optimal policy design. In the analysis, we also prove that that whether the seller knows the length of time horizon in advance or not render different optimal regret orders.
Keywords: online learning, dynamic pricing, non-stationary, growing, optimal regret
Suggested Citation: Suggested Citation