Beyond IID: Data-Driven Decision-Making in Heterogeneous Environments

64 Pages Posted: 27 Jun 2022 Last revised: 15 Sep 2023

See all articles by Omar Besbes

Omar Besbes

Columbia University - Columbia Business School, Decision Risk and Operations

Will Ma

Columbia University - Columbia Business School, Decision Risk and Operations

Omar Mouchtaki

Columbia University - Columbia Business School, Decision Risk and Operations

Date Written: May 26, 2022

Abstract

How should one leverage historical data when past observations are not perfectly indicative of the future, e.g., due to the presence of unobserved confounders which one cannot "correct" for?
Motivated by this question, we study a data-driven decision-making framework in which historical samples are generated from unknown and different distributions assumed to lie in a heterogeneity ball with known radius and centered around the (also) unknown future (out-of-sample) distribution on which the performance of a decision will be evaluated. This work aims at analyzing the performance of central data-driven policies but also near-optimal ones in these heterogeneous environments.
We first establish, for a general class of policies, a new connection between data-driven decision-making and distributionally robust optimization with a regret objective. We then leverage this connection to quantify the performance that is achievable by Sample Average Approximation (SAA) as a function of the radius of the heterogeneity ball: for any integral probability metric, we derive bounds depending on the approximation parameter, a notion which quantifies how the interplay between the heterogeneity and the problem structure impacts the performance of SAA. When SAA is not rate-optimal, we design and analyze problem-dependent policies achieving rate-optimality. We compare achievable guarantees for three widely-studied problems --newsvendor, pricing, and ski rental-- under heterogeneous environments. Our work shows that the type of achievable performance varies considerably across different combinations of problem classes and notions of heterogeneity.

Keywords: data-driven algorithms; robust optimization; distributionally robust optimization; minimax regret; sample average approximation; pricing; newsvendor; ski-rental

JEL Classification: C02, C44, C61

Suggested Citation

Besbes, Omar and Ma, Will and Mouchtaki, Omar, Beyond IID: Data-Driven Decision-Making in Heterogeneous Environments (May 26, 2022). Columbia Business School Research Paper No. 4140928, Available at SSRN: https://ssrn.com/abstract=4140928 or http://dx.doi.org/10.2139/ssrn.4140928

Omar Besbes

Columbia University - Columbia Business School, Decision Risk and Operations ( email )

New York, NY
United States

Will Ma

Columbia University - Columbia Business School, Decision Risk and Operations ( email )

New York, NY
United States

Omar Mouchtaki (Contact Author)

Columbia University - Columbia Business School, Decision Risk and Operations ( email )

New York, NY
United States

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
411
Abstract Views
1,051
Rank
123,934
PlumX Metrics