Cargando…
Novel modelling strategies for high-frequency stock trading data
Full electronic automation in stock exchanges has recently become popular, generating high-frequency intraday data and motivating the development of near real-time price forecasting methods. Machine learning algorithms are widely applied to mid-price stock predictions. Processing raw data as inputs...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9842577/ https://www.ncbi.nlm.nih.gov/pubmed/36687790 http://dx.doi.org/10.1186/s40854-022-00431-9 |
_version_ | 1784870165241921536 |
---|---|
author | Zhang, Xuekui Huang, Yuying Xu, Ke Xing, Li |
author_facet | Zhang, Xuekui Huang, Yuying Xu, Ke Xing, Li |
author_sort | Zhang, Xuekui |
collection | PubMed |
description | Full electronic automation in stock exchanges has recently become popular, generating high-frequency intraday data and motivating the development of near real-time price forecasting methods. Machine learning algorithms are widely applied to mid-price stock predictions. Processing raw data as inputs for prediction models (e.g., data thinning and feature engineering) can primarily affect the performance of the prediction methods. However, researchers rarely discuss this topic. This motivated us to propose three novel modelling strategies for processing raw data. We illustrate how our novel modelling strategies improve forecasting performance by analyzing high-frequency data of the Dow Jones 30 component stocks. In these experiments, our strategies often lead to statistically significant improvement in predictions. The three strategies improve the F1 scores of the SVM models by 0.056, 0.087, and 0.016, respectively. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40854-022-00431-9. |
format | Online Article Text |
id | pubmed-9842577 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-98425772023-01-18 Novel modelling strategies for high-frequency stock trading data Zhang, Xuekui Huang, Yuying Xu, Ke Xing, Li Financ Innov Research Full electronic automation in stock exchanges has recently become popular, generating high-frequency intraday data and motivating the development of near real-time price forecasting methods. Machine learning algorithms are widely applied to mid-price stock predictions. Processing raw data as inputs for prediction models (e.g., data thinning and feature engineering) can primarily affect the performance of the prediction methods. However, researchers rarely discuss this topic. This motivated us to propose three novel modelling strategies for processing raw data. We illustrate how our novel modelling strategies improve forecasting performance by analyzing high-frequency data of the Dow Jones 30 component stocks. In these experiments, our strategies often lead to statistically significant improvement in predictions. The three strategies improve the F1 scores of the SVM models by 0.056, 0.087, and 0.016, respectively. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40854-022-00431-9. Springer Berlin Heidelberg 2023-01-17 2023 /pmc/articles/PMC9842577/ /pubmed/36687790 http://dx.doi.org/10.1186/s40854-022-00431-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Research Zhang, Xuekui Huang, Yuying Xu, Ke Xing, Li Novel modelling strategies for high-frequency stock trading data |
title | Novel modelling strategies for high-frequency stock trading data |
title_full | Novel modelling strategies for high-frequency stock trading data |
title_fullStr | Novel modelling strategies for high-frequency stock trading data |
title_full_unstemmed | Novel modelling strategies for high-frequency stock trading data |
title_short | Novel modelling strategies for high-frequency stock trading data |
title_sort | novel modelling strategies for high-frequency stock trading data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9842577/ https://www.ncbi.nlm.nih.gov/pubmed/36687790 http://dx.doi.org/10.1186/s40854-022-00431-9 |
work_keys_str_mv | AT zhangxuekui novelmodellingstrategiesforhighfrequencystocktradingdata AT huangyuying novelmodellingstrategiesforhighfrequencystocktradingdata AT xuke novelmodellingstrategiesforhighfrequencystocktradingdata AT xingli novelmodellingstrategiesforhighfrequencystocktradingdata |