A Backtesting Protocol in the Era of Machine Learning

18 Pages Posted: 13 Nov 2018 Last revised: 24 Nov 2018

See all articles by Robert D. Arnott

Robert D. Arnott

Research Affiliates, LLC

Campbell R. Harvey

Duke University - Fuqua School of Business; National Bureau of Economic Research (NBER); Duke Innovation & Entrepreneurship Initiative

Harry Markowitz

University of California at San Diego

Date Written: November 21, 2018

Abstract

Machine learning offers a set of powerful tools that holds considerable promise for investment management. As with most quantitative applications in finance, the danger of misapplying these techniques can lead to disappointment. One crucial limitation involves data availability. Many of machine learning’s early successes originated in the physical and biological sciences, in which truly vast amounts of data are available. Machine learning applications often require far more data than are available in finance, which is of particular concern in longer-horizon investing. Hence, choosing the right applications before applying the tools is important. In addition, capital markets reflect the actions of people, which may be influenced by others’ actions and by the findings of past research. In many ways, the challenges that affect machine learning are merely a continuation of the long-standing issues researchers have always faced in quantitative finance. While investors need to be cautious—indeed, more cautious than in past applications of quantitative methods—these new tools offer many potential applications in finance. In this article, the authors develop a research protocol that pertains both to the application of machine learning techniques and to quantitative finance in general.

Keywords: Machine Learning, Data Science, Data Mining, Backtesting, Overfitting, Interpretable Classification, Interpretable Policy Design, Trading, Strategies, Anomalies, Selection Bias, Research Protocol

JEL Classification: G11, G14, G17, C11, C58

Suggested Citation

Arnott, Robert D. and Harvey, Campbell R. and Markowitz, Harry, A Backtesting Protocol in the Era of Machine Learning (November 21, 2018). Available at SSRN: https://ssrn.com/abstract=3275654 or http://dx.doi.org/10.2139/ssrn.3275654

Robert D. Arnott

Research Affiliates, LLC ( email )

620 Newport Center Dr
Ste 900
Newport Beach, CA 92660
United States
949-325-8700 (Phone)
949-325-8901 (Fax)

HOME PAGE: http://www.researchaffiliates.com/Our%20Firm/Our%20Team/Pages/Rob-Arnott.aspx

Campbell R. Harvey (Contact Author)

Duke University - Fuqua School of Business ( email )

Box 90120
Durham, NC 27708-0120
United States
919-660-7768 (Phone)
919-660-8030 (Fax)

National Bureau of Economic Research (NBER)

1050 Massachusetts Avenue
Cambridge, MA 02138
United States

Duke Innovation & Entrepreneurship Initiative ( email )

215 Morris St., Suite 300
Durham, NC 27701
United States

Harry Markowitz

University of California at San Diego ( email )

9500 Gilman Drive
La Jolla, CA 92093-0508
United States
(858) 534-3383 (Phone)

Register to save articles to
your library

Register

Paper statistics

Downloads
4,614
rank
1,669
Abstract Views
13,607
PlumX Metrics
!

Under construction: SSRN citations will be offline until July when we will launch a brand new and improved citations service, check here for more details.

For more information