Cargando…

Predicting incidence of hepatitis E using machine learning in Jiangsu Province, China

Hepatitis E is an increasingly serious worldwide public health problem that has attracted extensive attention. It is necessary to accurately predict the incidence of hepatitis E to better plan ahead for future medical care. In this study, we developed a Bi-LSTM model that incorporated meteorological...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheng, Xiaoqing, Liu, Wendong, Zhang, Xuefeng, Wang, Minghao, Bao, Changjun, Wu, Tianxing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cambridge University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9386790/
https://www.ncbi.nlm.nih.gov/pubmed/35899849
http://dx.doi.org/10.1017/S0950268822001303
_version_ 1784769888508706816
author Cheng, Xiaoqing
Liu, Wendong
Zhang, Xuefeng
Wang, Minghao
Bao, Changjun
Wu, Tianxing
author_facet Cheng, Xiaoqing
Liu, Wendong
Zhang, Xuefeng
Wang, Minghao
Bao, Changjun
Wu, Tianxing
author_sort Cheng, Xiaoqing
collection PubMed
description Hepatitis E is an increasingly serious worldwide public health problem that has attracted extensive attention. It is necessary to accurately predict the incidence of hepatitis E to better plan ahead for future medical care. In this study, we developed a Bi-LSTM model that incorporated meteorological factors to predict the prevalence of hepatitis E. The hepatitis E data used in this study are collected from January 2005 to March 2017 by Jiangsu Provincial Center for Disease Control and Prevention. ARIMA, GBDT, SVM, LSTM and Bi-LSTM models are adopted in this study. The data from January 2009 to September 2014 are used as the training set to fit models, and data from October 2014 to March 2017 are used as the testing set to evaluate the predicting accuracy of different models. Selecting models and evaluating the effectiveness of the models are based on mean absolute per cent error (MAPE), root mean square error (RMSE) and mean absolute error (MAE). A total of 44 923 cases of hepatitis E are detected in Jiangsu Province from January 2005 to March 2017. The average monthly incidence rate is 0.35 per 100 000 persons in Jiangsu Province. Incorporating meteorological factors of temperature, water vapour pressure, and rainfall as a combination into the Bi-LSTM Model achieved the state-of-the-art performance in predicting the monthly incidence of hepatitis E, in which RMSE is 0.044, MAPE is 11.88%, and MAE is 0.0377. The Bi-LSTM model with the meteorological factors of temperature, water vapour pressure, and rainfall can fully extract the linear and non-linear information in the hepatitis E incidence data, and has significantly improved the interpretability, learning ability, generalisability and prediction accuracy.
format Online
Article
Text
id pubmed-9386790
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Cambridge University Press
record_format MEDLINE/PubMed
spelling pubmed-93867902022-08-23 Predicting incidence of hepatitis E using machine learning in Jiangsu Province, China Cheng, Xiaoqing Liu, Wendong Zhang, Xuefeng Wang, Minghao Bao, Changjun Wu, Tianxing Epidemiol Infect Original Paper Hepatitis E is an increasingly serious worldwide public health problem that has attracted extensive attention. It is necessary to accurately predict the incidence of hepatitis E to better plan ahead for future medical care. In this study, we developed a Bi-LSTM model that incorporated meteorological factors to predict the prevalence of hepatitis E. The hepatitis E data used in this study are collected from January 2005 to March 2017 by Jiangsu Provincial Center for Disease Control and Prevention. ARIMA, GBDT, SVM, LSTM and Bi-LSTM models are adopted in this study. The data from January 2009 to September 2014 are used as the training set to fit models, and data from October 2014 to March 2017 are used as the testing set to evaluate the predicting accuracy of different models. Selecting models and evaluating the effectiveness of the models are based on mean absolute per cent error (MAPE), root mean square error (RMSE) and mean absolute error (MAE). A total of 44 923 cases of hepatitis E are detected in Jiangsu Province from January 2005 to March 2017. The average monthly incidence rate is 0.35 per 100 000 persons in Jiangsu Province. Incorporating meteorological factors of temperature, water vapour pressure, and rainfall as a combination into the Bi-LSTM Model achieved the state-of-the-art performance in predicting the monthly incidence of hepatitis E, in which RMSE is 0.044, MAPE is 11.88%, and MAE is 0.0377. The Bi-LSTM model with the meteorological factors of temperature, water vapour pressure, and rainfall can fully extract the linear and non-linear information in the hepatitis E incidence data, and has significantly improved the interpretability, learning ability, generalisability and prediction accuracy. Cambridge University Press 2022-07-28 /pmc/articles/PMC9386790/ /pubmed/35899849 http://dx.doi.org/10.1017/S0950268822001303 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
spellingShingle Original Paper
Cheng, Xiaoqing
Liu, Wendong
Zhang, Xuefeng
Wang, Minghao
Bao, Changjun
Wu, Tianxing
Predicting incidence of hepatitis E using machine learning in Jiangsu Province, China
title Predicting incidence of hepatitis E using machine learning in Jiangsu Province, China
title_full Predicting incidence of hepatitis E using machine learning in Jiangsu Province, China
title_fullStr Predicting incidence of hepatitis E using machine learning in Jiangsu Province, China
title_full_unstemmed Predicting incidence of hepatitis E using machine learning in Jiangsu Province, China
title_short Predicting incidence of hepatitis E using machine learning in Jiangsu Province, China
title_sort predicting incidence of hepatitis e using machine learning in jiangsu province, china
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9386790/
https://www.ncbi.nlm.nih.gov/pubmed/35899849
http://dx.doi.org/10.1017/S0950268822001303
work_keys_str_mv AT chengxiaoqing predictingincidenceofhepatitiseusingmachinelearninginjiangsuprovincechina
AT liuwendong predictingincidenceofhepatitiseusingmachinelearninginjiangsuprovincechina
AT zhangxuefeng predictingincidenceofhepatitiseusingmachinelearninginjiangsuprovincechina
AT wangminghao predictingincidenceofhepatitiseusingmachinelearninginjiangsuprovincechina
AT baochangjun predictingincidenceofhepatitiseusingmachinelearninginjiangsuprovincechina
AT wutianxing predictingincidenceofhepatitiseusingmachinelearninginjiangsuprovincechina