A Regularized High-Dimensional Positive Definite Covariance Estimator with High-Frequency Data
55 Pages Posted: 8 Jul 2019 Last revised: 9 Mar 2023
Date Written: March 4, 2019
Abstract
This paper proposes a novel large-dimensional positive definite covariance estimator for high-frequency data under a general factor model framework. We demonstrate an appealing connection between the proposed estimator and a weighted group-LASSO penalized least-squares estimator. The proposed estimator improves the traditional principal component analysis by allowing for weak factors, whose signal strengths are weak relative to idiosyncratic components. Despite microstructure noises and asynchronous trading, the proposed estimator achieves a guarded positive definiteness without sacrificing the convergence rate. To make our method fully operational, we provide an extended simultaneous alternating direction method of multipliers algorithm to solve the resultant constrained convex minimization problem efficiently. We offer a data-driven algorithm to select involved tuning parameters in practice. Empirically, we study the monthly high-frequency covariance structure of the stock constituents of the S&P 500 index from 2008 to 2016. We use all traded stocks from NYSE, AMEX, and NASDAQ stock markets to construct the high-frequency Fama-French four and extended eleven economic factors. We further examine the out-of-sample performance of the proposed method through vast portfolio allocations, which deliver significantly reduced out-of-sample portfolio risk and enhanced Sharpe ratios. The success of our method justifies the usefulness of machine learning in finance.
Keywords: Covariance estimation, High frequency, Large dimension, Weak factors, Nuclear norm, weighted group LASSO, Vast portfolio evaluation.
JEL Classification: C13, C14, C55, C58, C61, G01
Suggested Citation: Suggested Citation