Abstract
We assess the predictive power of machine learning (ML) models for forecasting realised volatility (RV) using information from HAR model variables, limit order book (LOB) data, and news sentiment. Training and robustness checks on nearly seven million ML models show that high-dimensional ML models outperform HAR models in 90% of the out-of-sample period, except during extreme volatility. Explainable AI identifies mid prices, mean bids, and mean asks as key predictors. Notably, incorporating ML into ensemble frameworks enhances HAR model performance, though caution is needed when using ML models as direct substitutes, since they may yield unreliable forecasts under certain market conditions.
Original language | English |
---|---|
Publisher | Social Science Research Network |
Number of pages | 83 |
DOIs | |
Publication status | Published - 12 Oct 2020 |
Keywords
- Realised Volatility Forecasting
- Machine Learning
- Long Short-Term Memory
- Heterogeneous AutoRegressive (HAR) Models
- Limit Order Book (LOB) Data
- Dow Jones Corporate News
- Big Data
Research Beacons, Institutes and Platforms
- Institute for Data Science and AI