Machine Learning for Realised Volatility Forecasting
54 Pages Posted: 12 Oct 2020 Last revised: 10 Nov 2021
Date Written: October 8, 2020
This paper compares machine learning and HAR models for forecasting realised volatility of 23 NASDAQ stocks using 146 variables extracted from limit order book (LOBSTER) and stock-specific news (Dow Jones Newswires) from 27 July 2007 to 18 November 2016. We find simpler ML to outperform HARs on normal volatility days. With SHAP, an Explainable AI technique, we find simple mid prices at all limit order book levels and mean bid/ask prices drive RV forecasts for many stocks. An ML model with a larger number of units and complex idiosyncratic LOB variables are needed for forecasting volatility jumps.
Keywords: Realised Volatility Forecasting, Machine Learning, Long Short-Term Memory, Heterogeneous AutoRegressive models, Explainable AI, Limit Order Book Data, Dow Jones Newswires, Big Data
JEL Classification: C22, C45, C51, C53, C55, C58
Suggested Citation: Suggested Citation