Doubly High-Dimensional Contextual Bandits: An Interpretable Model for Joint Assortment-Pricing

61 Pages Posted: 14 Sep 2023

See all articles by Junhui Cai

Junhui Cai

University of Notre Dame - Mendoza College of Business

Ran Chen

Massachusetts Institute of Technology (MIT)

Martin J. Wainwright

University of California, Berkeley - Department of Statistics

Linda Zhao

University of Pennsylvania - Statistics Department

Date Written: September 11, 2023

Abstract

Key challenges in running a retail business include how to select products to present to consumers (the assortment problem), and how to price products (the pricing problem) to maximize revenue or profit. Instead of considering these problems in isolation, we propose a joint approach to assortment-pricing based on contextual bandits. Our model is doubly high-dimensional, in that both context vectors and actions are allowed to take values in high-dimensional spaces. In order to circumvent the curse of dimensionality, we propose a simple yet flexible model that captures the interactions between covariates and actions via a (near) low-rank representation matrix. The resulting class of models is reasonably expressive while remaining interpretable through latent factors, and includes various structured linear bandit and pricing models as particular cases. We propose a computationally tractable procedure that combines an exploration/exploitation protocol with an efficient low-rank matrix estimator, and we prove bounds on its regret. Simulation results show that this method has lower regret than state-of-the-art methods applied to various standard bandit and pricing models. Real-world case studies on the assortment-pricing problem, from an industry-leading instant noodles company to an emerging beauty start-up, underscore the gains achievable using our method. In each case, we show at least three-fold gains in revenue or profit by our bandit method, as well as the interpretability of the latent factor models that are learned.

Keywords: contextual bandits, on-line decision-making, high-dimensional statistics, low-rank matrices, factor models.

Suggested Citation

Cai, Junhui and Chen, Ran and Wainwright, Martin J. and Zhao, Linda, Doubly High-Dimensional Contextual Bandits: An Interpretable Model for Joint Assortment-Pricing (September 11, 2023). The Wharton School Research Paper, Available at SSRN: https://ssrn.com/abstract=4568525 or http://dx.doi.org/10.2139/ssrn.4568525

Junhui Cai (Contact Author)

University of Notre Dame - Mendoza College of Business

Mendoza college of business
University of Notre Dame
Notre Dame, IN 46556-5646
United States

Ran Chen

Massachusetts Institute of Technology (MIT) ( email )

77 Massachusetts Avenue
50 Memorial Drive
Cambridge, MA 02139-4307
United States

Martin J. Wainwright

University of California, Berkeley - Department of Statistics ( email )

United States

Linda Zhao

University of Pennsylvania - Statistics Department ( email )

Wharton School
Philadelphia, PA 19104
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
111
Abstract Views
522
Rank
521,680
PlumX Metrics