Cargando…

Stock Index Spot–Futures Arbitrage Prediction Using Machine Learning Models

With the development of quantitative finance, machine learning methods used in the financial fields have been given significant attention among researchers, investors, and traders. However, in the field of stock index spot–futures arbitrage, relevant work is still rare. Furthermore, existing work is...

Descripción completa

Detalles Bibliográficos
Autores principales: Sheng, Yankai, Ma, Ding
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9601484/
https://www.ncbi.nlm.nih.gov/pubmed/37420482
http://dx.doi.org/10.3390/e24101462
_version_ 1784817076998766592
author Sheng, Yankai
Ma, Ding
author_facet Sheng, Yankai
Ma, Ding
author_sort Sheng, Yankai
collection PubMed
description With the development of quantitative finance, machine learning methods used in the financial fields have been given significant attention among researchers, investors, and traders. However, in the field of stock index spot–futures arbitrage, relevant work is still rare. Furthermore, existing work is mostly retrospective, rather than anticipatory of arbitrage opportunities. To close the gap, this study uses machine learning approaches based on historical high-frequency data to forecast spot–futures arbitrage opportunities for the China Security Index (CSI) 300. Firstly, the possibility of spot–futures arbitrage opportunities is identified through econometric models. Then, Exchange-Traded-Fund (ETF)-based portfolios are built to fit the movements of CSI 300 with the least tracking errors. A strategy consisting of non-arbitrage intervals and unwinding timing indicators is derived and proven profitable in a back-test. In forecasting, four machine learning methods are adopted to predict the indicator we acquired, namely Least Absolute Shrinkage and Selection Operator (LASSO), Extreme Gradient Boosting (XGBoost), Back Propagation Neural Network (BPNN), and Long Short-Term Memory neural network (LSTM). The performance of each algorithm is compared from two perspectives. One is an error perspective based on the Root-Mean-Squared Error (RMSE), Mean Absolute Percentage Error (MAPE), and goodness of fit (R(2)). Another is a return perspective based on the trade yield and the number of arbitrage opportunities captured. Finally, a performance heterogeneity analysis is conducted based on the separation of bull and bear markets. The results show that LSTM outperforms all other algorithms over the entire time period, with an RMSE of 0.00813, MAPE of 0.70 percent, R(2) of 92.09 percent, and an arbitrage return of 58.18 percent. Meanwhile, in certain market conditions, namely both the bull market and bear market separately with a shorter period, LASSO can outperform.
format Online
Article
Text
id pubmed-9601484
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96014842022-10-27 Stock Index Spot–Futures Arbitrage Prediction Using Machine Learning Models Sheng, Yankai Ma, Ding Entropy (Basel) Article With the development of quantitative finance, machine learning methods used in the financial fields have been given significant attention among researchers, investors, and traders. However, in the field of stock index spot–futures arbitrage, relevant work is still rare. Furthermore, existing work is mostly retrospective, rather than anticipatory of arbitrage opportunities. To close the gap, this study uses machine learning approaches based on historical high-frequency data to forecast spot–futures arbitrage opportunities for the China Security Index (CSI) 300. Firstly, the possibility of spot–futures arbitrage opportunities is identified through econometric models. Then, Exchange-Traded-Fund (ETF)-based portfolios are built to fit the movements of CSI 300 with the least tracking errors. A strategy consisting of non-arbitrage intervals and unwinding timing indicators is derived and proven profitable in a back-test. In forecasting, four machine learning methods are adopted to predict the indicator we acquired, namely Least Absolute Shrinkage and Selection Operator (LASSO), Extreme Gradient Boosting (XGBoost), Back Propagation Neural Network (BPNN), and Long Short-Term Memory neural network (LSTM). The performance of each algorithm is compared from two perspectives. One is an error perspective based on the Root-Mean-Squared Error (RMSE), Mean Absolute Percentage Error (MAPE), and goodness of fit (R(2)). Another is a return perspective based on the trade yield and the number of arbitrage opportunities captured. Finally, a performance heterogeneity analysis is conducted based on the separation of bull and bear markets. The results show that LSTM outperforms all other algorithms over the entire time period, with an RMSE of 0.00813, MAPE of 0.70 percent, R(2) of 92.09 percent, and an arbitrage return of 58.18 percent. Meanwhile, in certain market conditions, namely both the bull market and bear market separately with a shorter period, LASSO can outperform. MDPI 2022-10-13 /pmc/articles/PMC9601484/ /pubmed/37420482 http://dx.doi.org/10.3390/e24101462 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sheng, Yankai
Ma, Ding
Stock Index Spot–Futures Arbitrage Prediction Using Machine Learning Models
title Stock Index Spot–Futures Arbitrage Prediction Using Machine Learning Models
title_full Stock Index Spot–Futures Arbitrage Prediction Using Machine Learning Models
title_fullStr Stock Index Spot–Futures Arbitrage Prediction Using Machine Learning Models
title_full_unstemmed Stock Index Spot–Futures Arbitrage Prediction Using Machine Learning Models
title_short Stock Index Spot–Futures Arbitrage Prediction Using Machine Learning Models
title_sort stock index spot–futures arbitrage prediction using machine learning models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9601484/
https://www.ncbi.nlm.nih.gov/pubmed/37420482
http://dx.doi.org/10.3390/e24101462
work_keys_str_mv AT shengyankai stockindexspotfuturesarbitragepredictionusingmachinelearningmodels
AT mading stockindexspotfuturesarbitragepredictionusingmachinelearningmodels