Thousands of Alpha Tests
91 Pages Posted: 17 Oct 2018 Last revised: 26 Mar 2020
Date Written: March 18, 2020
Abstract
Data snooping is a major concern in empirical asset pricing. We develop a new framework to rigorously perform multiple hypothesis testing in linear asset pricing models, while limiting the occurrence of false positive results typically associated with data snooping. By exploiting a variety of machine learning techniques, our multiple-testing procedure is robust to omitted factors and missing data. We also prove its asymptotic validity when the number of tests is large relative to the sample size, as in many finance applications. To improve the finite sample performance, we also provide a wild-bootstrap procedure for inference and prove its validity in this setting. Finally, we illustrate the empirical relevance in the context of hedge fund performance evaluation.
Keywords: Data Snooping, Multiple Testing, Alpha Testing, Factor Models, Hedge Fund Performance, False Discovery Rate, Machine Learning, Missing Data, Wild-Bootstrap, Matrix Completion
Suggested Citation: Suggested Citation