Machine Learning for Realised Volatility Forecasting

Eghbal Rahimikia; Ser-Huang Poon

doi:10.2139/ssrn.3707796

Machine Learning for Realised Volatility Forecasting

Research output: Preprint/Working paper › Working paper

Abstract

We assess the predictive power of machine learning (ML) models for forecasting realised volatility (RV) using information from HAR model variables, limit order book (LOB) data, and news sentiment. Training and robustness checks on nearly seven million ML models show that high-dimensional ML models outperform HAR models in 90% of the out-of-sample period, except during extreme volatility. Explainable AI identifies mid prices, mean bids, and mean asks as key predictors. Notably, incorporating ML into ensemble frameworks enhances HAR model performance, though caution is needed when using ML models as direct substitutes, since they may yield unreliable forecasts under certain market conditions.

Original language	English
Publisher	Social Science Research Network
Number of pages	83
DOIs	https://doi.org/10.2139/ssrn.3707796
Publication status	Published - 12 Oct 2020

Keywords

Realised Volatility Forecasting
Machine Learning
Long Short-Term Memory
Heterogeneous AutoRegressive (HAR) Models
Limit Order Book (LOB) Data
Dow Jones Corporate News
Big Data

Research Beacons, Institutes and Platforms

Institute for Data Science and AI

Access to Document

10.2139/ssrn.3707796

Cite this

@techreport{b5bfaa97d7b6466ead4246df4ca0afdc,

title = "Machine Learning for Realised Volatility Forecasting",

abstract = "We assess the predictive power of machine learning (ML) models for forecasting realised volatility (RV) using information from HAR model variables, limit order book (LOB) data, and news sentiment. Training and robustness checks on nearly seven million ML models show that high-dimensional ML models outperform HAR models in 90\% of the out-of-sample period, except during extreme volatility. Explainable AI identifies mid prices, mean bids, and mean asks as key predictors. Notably, incorporating ML into ensemble frameworks enhances HAR model performance, though caution is needed when using ML models as direct substitutes, since they may yield unreliable forecasts under certain market conditions. ",

keywords = "Realised Volatility Forecasting, Machine Learning, Long Short-Term Memory, Heterogeneous AutoRegressive (HAR) Models, Limit Order Book (LOB) Data, Dow Jones Corporate News, Big Data",

author = "Eghbal Rahimikia and Ser-Huang Poon",

year = "2020",

month = oct,

day = "12",

doi = "10.2139/ssrn.3707796",

language = "English",

publisher = "Social Science Research Network",

address = "United Kingdom",

type = "WorkingPaper",

institution = "Social Science Research Network",

}

TY - UNPB

T1 - Machine Learning for Realised Volatility Forecasting

AU - Rahimikia, Eghbal

AU - Poon, Ser-Huang

PY - 2020/10/12

Y1 - 2020/10/12

N2 - We assess the predictive power of machine learning (ML) models for forecasting realised volatility (RV) using information from HAR model variables, limit order book (LOB) data, and news sentiment. Training and robustness checks on nearly seven million ML models show that high-dimensional ML models outperform HAR models in 90% of the out-of-sample period, except during extreme volatility. Explainable AI identifies mid prices, mean bids, and mean asks as key predictors. Notably, incorporating ML into ensemble frameworks enhances HAR model performance, though caution is needed when using ML models as direct substitutes, since they may yield unreliable forecasts under certain market conditions.

AB - We assess the predictive power of machine learning (ML) models for forecasting realised volatility (RV) using information from HAR model variables, limit order book (LOB) data, and news sentiment. Training and robustness checks on nearly seven million ML models show that high-dimensional ML models outperform HAR models in 90% of the out-of-sample period, except during extreme volatility. Explainable AI identifies mid prices, mean bids, and mean asks as key predictors. Notably, incorporating ML into ensemble frameworks enhances HAR model performance, though caution is needed when using ML models as direct substitutes, since they may yield unreliable forecasts under certain market conditions.

KW - Realised Volatility Forecasting

KW - Machine Learning

KW - Long Short-Term Memory

KW - Heterogeneous AutoRegressive (HAR) Models

KW - Limit Order Book (LOB) Data

KW - Dow Jones Corporate News

KW - Big Data

U2 - 10.2139/ssrn.3707796

DO - 10.2139/ssrn.3707796

M3 - Working paper

BT - Machine Learning for Realised Volatility Forecasting

PB - Social Science Research Network

ER -

Machine Learning for Realised Volatility Forecasting

Abstract

Keywords

Research Beacons, Institutes and Platforms

Access to Document

Fingerprint

Cite this