Cargando…

Prediction of stock price movement using an improved NSGA-II-RF algorithm with a three-stage feature engineering process

Prediction of stock price has been a hot topic in artificial intelligence field. Computational intelligent methods such as machine learning or deep learning are explored in the prediction system in recent years. However, making accurate predictions of stock price direction is still a big challenge b...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeng, Xiaohua, Cai, Jieping, Liang, Changzhou, Yuan, Chiping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10306235/
https://www.ncbi.nlm.nih.gov/pubmed/37379318
http://dx.doi.org/10.1371/journal.pone.0287754
_version_ 1785065895269236736
author Zeng, Xiaohua
Cai, Jieping
Liang, Changzhou
Yuan, Chiping
author_facet Zeng, Xiaohua
Cai, Jieping
Liang, Changzhou
Yuan, Chiping
author_sort Zeng, Xiaohua
collection PubMed
description Prediction of stock price has been a hot topic in artificial intelligence field. Computational intelligent methods such as machine learning or deep learning are explored in the prediction system in recent years. However, making accurate predictions of stock price direction is still a big challenge because stock prices are affected by nonlinear, nonstationary, and high dimensional features. In previous works, feature engineering was overlooked. How to select the optimal feature sets that affect stock price is a prominent solution. Hence, our motivation for this article is to propose an improved many-objective optimization algorithm integrating random forest (I-NSGA-II-RF) algorithm with a three-stage feature engineering process in order to decrease the computational complexity and improve the accuracy of prediction system. Maximizing accuracy and minimizing the optimal solution set are the optimization directions of the model in this study. The integrated information initialization population of two filtered feature selection methods is used to optimize the I-NSGA-II algorithm, using multiple chromosome hybrid coding to synchronously select features and optimize model parameters. Finally, the selected feature subset and parameters are input to the RF for training, prediction, and iterative optimization. Experimental results show that the I-NSGA-II-RF algorithm has the highest average accuracy, the smallest optimal solution set, and the shortest running time compared to the unmodified multi-objective feature selection algorithm and the single target feature selection algorithm. Compared to the deep learning model, this model has interpretability, higher accuracy, and less running time.
format Online
Article
Text
id pubmed-10306235
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-103062352023-06-29 Prediction of stock price movement using an improved NSGA-II-RF algorithm with a three-stage feature engineering process Zeng, Xiaohua Cai, Jieping Liang, Changzhou Yuan, Chiping PLoS One Research Article Prediction of stock price has been a hot topic in artificial intelligence field. Computational intelligent methods such as machine learning or deep learning are explored in the prediction system in recent years. However, making accurate predictions of stock price direction is still a big challenge because stock prices are affected by nonlinear, nonstationary, and high dimensional features. In previous works, feature engineering was overlooked. How to select the optimal feature sets that affect stock price is a prominent solution. Hence, our motivation for this article is to propose an improved many-objective optimization algorithm integrating random forest (I-NSGA-II-RF) algorithm with a three-stage feature engineering process in order to decrease the computational complexity and improve the accuracy of prediction system. Maximizing accuracy and minimizing the optimal solution set are the optimization directions of the model in this study. The integrated information initialization population of two filtered feature selection methods is used to optimize the I-NSGA-II algorithm, using multiple chromosome hybrid coding to synchronously select features and optimize model parameters. Finally, the selected feature subset and parameters are input to the RF for training, prediction, and iterative optimization. Experimental results show that the I-NSGA-II-RF algorithm has the highest average accuracy, the smallest optimal solution set, and the shortest running time compared to the unmodified multi-objective feature selection algorithm and the single target feature selection algorithm. Compared to the deep learning model, this model has interpretability, higher accuracy, and less running time. Public Library of Science 2023-06-28 /pmc/articles/PMC10306235/ /pubmed/37379318 http://dx.doi.org/10.1371/journal.pone.0287754 Text en © 2023 Zeng et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zeng, Xiaohua
Cai, Jieping
Liang, Changzhou
Yuan, Chiping
Prediction of stock price movement using an improved NSGA-II-RF algorithm with a three-stage feature engineering process
title Prediction of stock price movement using an improved NSGA-II-RF algorithm with a three-stage feature engineering process
title_full Prediction of stock price movement using an improved NSGA-II-RF algorithm with a three-stage feature engineering process
title_fullStr Prediction of stock price movement using an improved NSGA-II-RF algorithm with a three-stage feature engineering process
title_full_unstemmed Prediction of stock price movement using an improved NSGA-II-RF algorithm with a three-stage feature engineering process
title_short Prediction of stock price movement using an improved NSGA-II-RF algorithm with a three-stage feature engineering process
title_sort prediction of stock price movement using an improved nsga-ii-rf algorithm with a three-stage feature engineering process
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10306235/
https://www.ncbi.nlm.nih.gov/pubmed/37379318
http://dx.doi.org/10.1371/journal.pone.0287754
work_keys_str_mv AT zengxiaohua predictionofstockpricemovementusinganimprovednsgaiirfalgorithmwithathreestagefeatureengineeringprocess
AT caijieping predictionofstockpricemovementusinganimprovednsgaiirfalgorithmwithathreestagefeatureengineeringprocess
AT liangchangzhou predictionofstockpricemovementusinganimprovednsgaiirfalgorithmwithathreestagefeatureengineeringprocess
AT yuanchiping predictionofstockpricemovementusinganimprovednsgaiirfalgorithmwithathreestagefeatureengineeringprocess