Synthetic Financial Data: An Application to Regulatory Compliance for Broker-Dealers
Journal of Financial Transformation (Forthcoming)
12 Pages Posted: 23 Sep 2019 Last revised: 23 Apr 2020
Date Written: September 27, 2019
Big Data hype has not missed investment management, the reality is that price data from U.S. financial markets are not really Big Data. Price data is Small Data. The fact that sellers and advisors in financial markets use Small Data to generate and test investment strategies creates two major problems. First, the economic mechanisms that generate prices (and therefore returns) may change through time, so that historical data from an earlier time may tell us little or nothing about future prices and returns. Second, even if data-generating-mechanisms are somewhat stable through time, inferences about the profitability of investment strategies may be sensitive to a handful of outliers in the data that get picked up again and again in different strategies mined from the same Small Data set. In this article, we present an answer to the financial Small Data problem: using machine-learning methods to generate “synthetic” financial data. The essential part of our approach to developing synthetic data is the use of machine learning methods to generate data that might have been generated by financial markets but was not. Synthetic price and return data have numerous uses, including testing new investment strategies and helping investors plan for retirement and other personal investment goals with more realistic future return scenarios. In this article, we focus on a particularly important use of synthetic data: meeting legal and regulatory requirements such as best interest and fiduciary requirements.
Keywords: Synthetic Data, Machine Learning, Best Interest, Compliance
Suggested Citation: Suggested Citation