Cargando…

The research of ARIMA, GM(1,1), and LSTM models for prediction of TB cases in China

BACKGROUND AND OBJECTIVE: Tuberculosis (Tuberculosis, TB) is a public health problem in China, which not only endangers the population’s health but also affects economic and social development. It requires an accurate prediction analysis to help to make policymakers with early warning and provide ef...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Daren, Zhang, Huiwu, Cao, Qing, Wang, Zhiyi, He, Sizhang, Zhou, Minghua, Zhang, Ruihua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8865644/
https://www.ncbi.nlm.nih.gov/pubmed/35196309
http://dx.doi.org/10.1371/journal.pone.0262734
_version_ 1784655670286483456
author Zhao, Daren
Zhang, Huiwu
Cao, Qing
Wang, Zhiyi
He, Sizhang
Zhou, Minghua
Zhang, Ruihua
author_facet Zhao, Daren
Zhang, Huiwu
Cao, Qing
Wang, Zhiyi
He, Sizhang
Zhou, Minghua
Zhang, Ruihua
author_sort Zhao, Daren
collection PubMed
description BACKGROUND AND OBJECTIVE: Tuberculosis (Tuberculosis, TB) is a public health problem in China, which not only endangers the population’s health but also affects economic and social development. It requires an accurate prediction analysis to help to make policymakers with early warning and provide effective precautionary measures. In this study, ARIMA, GM(1,1), and LSTM models were constructed and compared, respectively. The results showed that the LSTM was the optimal model, which can be achieved satisfactory performance for TB cases predictions in mainland China. METHODS: The data of tuberculosis cases in mainland China were extracted from the National Health Commission of the People’s Republic of China website. According to the TB data characteristics and the sample requirements, we created the ARIMA, GM(1,1), and LSTM models, which can make predictions for the prevalence trend of TB. The mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) were applied to evaluate the effects of model fitting predicting accuracy. RESULTS: There were 3,021,995 tuberculosis cases in mainland China from January 2018 to December 2020. And the overall TB cases in mainland China take on a downtrend trend. We established ARIMA, GM(1,1), and LSTM models, respectively. The optimal ARIMA model is the ARIMA (0,1,0) × (0,1,0)12. The equation for GM(1,1) model was X(k+1) = -10057053.55e((-0.01k)) + 10153178.55 the Mean square deviation ratio C value was 0.49, and the Small probability of error P was 0.94. LSTM model consists of an input layer, a hidden layer and an output layer, the parameters of epochs, learning rating are 60, 0.01, respectively. The MAE, RMSE, and MAPE values of LSTM model were smaller than that of GM(1,1) and ARIMA models. CONCLUSIONS: Our findings showed that the LSTM model was the optimal model, which has a higher accuracy performance than that of ARIMA and GM (1,1) models. Its prediction results can act as a predictive tool for TB prevention measures in mainland China.
format Online
Article
Text
id pubmed-8865644
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-88656442022-02-24 The research of ARIMA, GM(1,1), and LSTM models for prediction of TB cases in China Zhao, Daren Zhang, Huiwu Cao, Qing Wang, Zhiyi He, Sizhang Zhou, Minghua Zhang, Ruihua PLoS One Research Article BACKGROUND AND OBJECTIVE: Tuberculosis (Tuberculosis, TB) is a public health problem in China, which not only endangers the population’s health but also affects economic and social development. It requires an accurate prediction analysis to help to make policymakers with early warning and provide effective precautionary measures. In this study, ARIMA, GM(1,1), and LSTM models were constructed and compared, respectively. The results showed that the LSTM was the optimal model, which can be achieved satisfactory performance for TB cases predictions in mainland China. METHODS: The data of tuberculosis cases in mainland China were extracted from the National Health Commission of the People’s Republic of China website. According to the TB data characteristics and the sample requirements, we created the ARIMA, GM(1,1), and LSTM models, which can make predictions for the prevalence trend of TB. The mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) were applied to evaluate the effects of model fitting predicting accuracy. RESULTS: There were 3,021,995 tuberculosis cases in mainland China from January 2018 to December 2020. And the overall TB cases in mainland China take on a downtrend trend. We established ARIMA, GM(1,1), and LSTM models, respectively. The optimal ARIMA model is the ARIMA (0,1,0) × (0,1,0)12. The equation for GM(1,1) model was X(k+1) = -10057053.55e((-0.01k)) + 10153178.55 the Mean square deviation ratio C value was 0.49, and the Small probability of error P was 0.94. LSTM model consists of an input layer, a hidden layer and an output layer, the parameters of epochs, learning rating are 60, 0.01, respectively. The MAE, RMSE, and MAPE values of LSTM model were smaller than that of GM(1,1) and ARIMA models. CONCLUSIONS: Our findings showed that the LSTM model was the optimal model, which has a higher accuracy performance than that of ARIMA and GM (1,1) models. Its prediction results can act as a predictive tool for TB prevention measures in mainland China. Public Library of Science 2022-02-23 /pmc/articles/PMC8865644/ /pubmed/35196309 http://dx.doi.org/10.1371/journal.pone.0262734 Text en © 2022 Zhao et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhao, Daren
Zhang, Huiwu
Cao, Qing
Wang, Zhiyi
He, Sizhang
Zhou, Minghua
Zhang, Ruihua
The research of ARIMA, GM(1,1), and LSTM models for prediction of TB cases in China
title The research of ARIMA, GM(1,1), and LSTM models for prediction of TB cases in China
title_full The research of ARIMA, GM(1,1), and LSTM models for prediction of TB cases in China
title_fullStr The research of ARIMA, GM(1,1), and LSTM models for prediction of TB cases in China
title_full_unstemmed The research of ARIMA, GM(1,1), and LSTM models for prediction of TB cases in China
title_short The research of ARIMA, GM(1,1), and LSTM models for prediction of TB cases in China
title_sort research of arima, gm(1,1), and lstm models for prediction of tb cases in china
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8865644/
https://www.ncbi.nlm.nih.gov/pubmed/35196309
http://dx.doi.org/10.1371/journal.pone.0262734
work_keys_str_mv AT zhaodaren theresearchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT zhanghuiwu theresearchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT caoqing theresearchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT wangzhiyi theresearchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT hesizhang theresearchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT zhouminghua theresearchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT zhangruihua theresearchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT zhaodaren researchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT zhanghuiwu researchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT caoqing researchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT wangzhiyi researchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT hesizhang researchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT zhouminghua researchofarimagm11andlstmmodelsforpredictionoftbcasesinchina
AT zhangruihua researchofarimagm11andlstmmodelsforpredictionoftbcasesinchina