Comparing Multivariate Distributions: A Novel Approach Using Optimal Transport-based Plots

48 Pages Posted: 25 May 2024

See all articles by Sibsankar Singha

Sibsankar Singha

Tata Institute of Fundamental Research (TIFR)

Marie Kratz

ESSEC Business School, CREAR risk research center

Sreekar Vadlamani

Tata Institute of Fundamental Research (TIFR)

Date Written: April 30, 2024

Abstract

Quantile-Quantile (Q-Q) plots are widely used for assessing the distributional similarity between two datasets. Traditionally, Q-Q plots are constructed for univariate distributions, making them less effective in capturing complex dependencies present in multivariate data. In this paper, we propose a novel approach for constructing multivariate Q-Q plots, which extend the traditional Q-Q plot methodology-to handle high-dimensional data. Our approach utilizes optimal transport (OT) and entropy-regularized optimal transport (EOT) to align the empirical quantiles of the two datasets. Additionally, we introduce another technique based on OT and EOT potentials which can effectively compare two multivariate datasets. Through extensive simulations and real data examples, we demonstrate the effectiveness of our proposed approach in capturing multivariate dependencies and identifying distributional differences such as tail behaviour. We also propose two test statistics based on the Q-Q and potential plots to compare two distributions rigorously.

Keywords: Q-Q plots, multivariate quantile, optimal transport, entropy regularisation, hypothesis testing, geometric quantile, tail behavior

Suggested Citation

Singha, Sibsankar and Kratz, Marie and Vadlamani, Sreekar, Comparing Multivariate Distributions: A Novel Approach Using Optimal Transport-based Plots (April 30, 2024). ESSEC Business School Research Paper, Available at SSRN: https://ssrn.com/abstract=4840410 or http://dx.doi.org/10.2139/ssrn.4840410

Sibsankar Singha (Contact Author)

Tata Institute of Fundamental Research (TIFR) ( email )

STCS, TIFR, 1, Homi Bhabha Road
Colaba
Mumbai, 400005
India

Marie Kratz

ESSEC Business School, CREAR risk research center ( email )

Avenue Bernard Hirsch
BP 50105
CERGY PONTOISE CEDEX 95021
France

Sreekar Vadlamani

Tata Institute of Fundamental Research (TIFR) ( email )

STCS, TIFR, 1, Homi Bhabha Road
Colaba
Mumbai, 400005
India

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
38
Abstract Views
189
PlumX Metrics