Download this Paper Open PDF in Browser

The Big Data Newsvendor: Practical Insights from Machine Learning

55 Pages Posted: 3 Feb 2015 Last revised: 4 Nov 2017

Gah‐Yi Ban

London Business School

Cynthia Rudin

Duke University - Pratt School of Engineering; Duke University

Date Written: November 2, 2017

Abstract

We investigate the data-driven newsvendor problem when one has $n$ observations of $p$ features related to the demand as well as historical demand data. Rather than a two-step process of first estimating a demand distribution then optimizing for the optimal order quantity, we propose solving the ``Big Data'' newsvendor problem via single step machine learning algorithms. Specifically, we propose algorithms based on the Empirical Risk Minimization (ERM) principle, with and without regularization, and an algorithm based on Kernel-weights Optimization (KO). The ERM approaches, equivalent to high-dimensional quantile regression, can be solved by convex optimization problems and the KO approach by a sorting algorithm. We analytically justify the use of features by showing that their omission yields inconsistent decisions. We then derive finite-sample performance bounds on the out-of-sample costs of the feature-based algorithms, which quantify the effects of dimensionality and cost parameters. Our bounds, based on algorithmic stability theory, generalize known analyses for the newsvendor problem without feature information. Finally, we apply the feature-based algorithms for nurse staffing in a hospital emergency room using a data set from a large UK teaching hospital and find that (i) the best KO and ERM algorithms beat the best practice benchmark by 23\% and 24\% respectively in the out-of-sample cost, and (ii) the best KO algorithm is faster than the best ERM algorithm by three orders of magnitude and the best practice benchmark by two orders of magnitude.

Keywords: big data, newsvendor, machine learning, Sample Average Approximation, statistical learning theory, quantile regression

JEL Classification: C44, C61,C80

Suggested Citation

Ban , Gah‐Yi and Rudin, Cynthia, The Big Data Newsvendor: Practical Insights from Machine Learning (November 2, 2017). Available at SSRN: https://ssrn.com/abstract=2559116 or http://dx.doi.org/10.2139/ssrn.2559116

Gah‐Yi Ban (Contact Author)

London Business School ( email )

Sussex Place
Regent's Park
London, London NW1 4SA
United Kingdom

Cynthia Rudin

Duke University - Pratt School of Engineering ( email )

Durham, NC 27708
United States

Duke University ( email )

Department of Computer Science
LSRC Building
Durham, NC 27708-0204
United States

Paper statistics

Downloads
1,097
rank
15,804
Abstract Views
3,636