Cargando…

Machine Learning Prediction Model of Tuberculosis Incidence Based on Meteorological Factors and Air Pollutants

Background: Tuberculosis (TB) is a public health problem worldwide, and the influence of meteorological and air pollutants on the incidence of tuberculosis have been attracting interest from researchers. It is of great importance to use machine learning to build a prediction model of tuberculosis in...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Na, Yuan, Maoxiang, Chen, Zhijun, Ma, Jian, Sun, Rui, Yang, Yide, He, Quanyuan, Guo, Xiaowei, Hu, Shixiong, Zhou, Junhua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002212/
https://www.ncbi.nlm.nih.gov/pubmed/36900920
http://dx.doi.org/10.3390/ijerph20053910
_version_ 1784904335167062016
author Tang, Na
Yuan, Maoxiang
Chen, Zhijun
Ma, Jian
Sun, Rui
Yang, Yide
He, Quanyuan
Guo, Xiaowei
Hu, Shixiong
Zhou, Junhua
author_facet Tang, Na
Yuan, Maoxiang
Chen, Zhijun
Ma, Jian
Sun, Rui
Yang, Yide
He, Quanyuan
Guo, Xiaowei
Hu, Shixiong
Zhou, Junhua
author_sort Tang, Na
collection PubMed
description Background: Tuberculosis (TB) is a public health problem worldwide, and the influence of meteorological and air pollutants on the incidence of tuberculosis have been attracting interest from researchers. It is of great importance to use machine learning to build a prediction model of tuberculosis incidence influenced by meteorological and air pollutants for timely and applicable measures of both prevention and control. Methods: The data of daily TB notifications, meteorological factors and air pollutants in Changde City, Hunan Province ranging from 2010 to 2021 were collected. Spearman rank correlation analysis was conducted to analyze the correlation between the daily TB notifications and the meteorological factors or air pollutants. Based on the correlation analysis results, machine learning methods, including support vector regression, random forest regression and a BP neural network model, were utilized to construct the incidence prediction model of tuberculosis. RMSE, MAE and MAPE were performed to evaluate the constructed model for selecting the best prediction model. Results: (1) From the year 2010 to 2021, the overall incidence of tuberculosis in Changde City showed a downward trend. (2) The daily TB notifications was positively correlated with average temperature (r = 0.231), maximum temperature (r = 0.194), minimum temperature (r = 0.165), sunshine duration (r = 0.329), PM(2.5) (r = 0.097), PM(10) (r = 0.215) and O(3) (r = 0.084) (p < 0.05). However, there was a significant negative correlation between the daily TB notifications and mean air pressure (r = −0.119), precipitation (r = −0.063), relative humidity (r = −0.084), CO (r = −0.038) and SO(2) (r = −0.034) (p < 0.05). (3) The random forest regression model had the best fitting effect, while the BP neural network model exhibited the best prediction. (4) The validation set of the BP neural network model, including average daily temperature, sunshine hours and PM(10), showed the lowest root mean square error, mean absolute error and mean absolute percentage error, followed by support vector regression. Conclusions: The prediction trend of the BP neural network model, including average daily temperature, sunshine hours and PM(10), successfully mimics the actual incidence, and the peak incidence highly coincides with the actual aggregation time, with a high accuracy and a minimum error. Taken together, these data suggest that the BP neural network model can predict the incidence trend of tuberculosis in Changde City.
format Online
Article
Text
id pubmed-10002212
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100022122023-03-11 Machine Learning Prediction Model of Tuberculosis Incidence Based on Meteorological Factors and Air Pollutants Tang, Na Yuan, Maoxiang Chen, Zhijun Ma, Jian Sun, Rui Yang, Yide He, Quanyuan Guo, Xiaowei Hu, Shixiong Zhou, Junhua Int J Environ Res Public Health Article Background: Tuberculosis (TB) is a public health problem worldwide, and the influence of meteorological and air pollutants on the incidence of tuberculosis have been attracting interest from researchers. It is of great importance to use machine learning to build a prediction model of tuberculosis incidence influenced by meteorological and air pollutants for timely and applicable measures of both prevention and control. Methods: The data of daily TB notifications, meteorological factors and air pollutants in Changde City, Hunan Province ranging from 2010 to 2021 were collected. Spearman rank correlation analysis was conducted to analyze the correlation between the daily TB notifications and the meteorological factors or air pollutants. Based on the correlation analysis results, machine learning methods, including support vector regression, random forest regression and a BP neural network model, were utilized to construct the incidence prediction model of tuberculosis. RMSE, MAE and MAPE were performed to evaluate the constructed model for selecting the best prediction model. Results: (1) From the year 2010 to 2021, the overall incidence of tuberculosis in Changde City showed a downward trend. (2) The daily TB notifications was positively correlated with average temperature (r = 0.231), maximum temperature (r = 0.194), minimum temperature (r = 0.165), sunshine duration (r = 0.329), PM(2.5) (r = 0.097), PM(10) (r = 0.215) and O(3) (r = 0.084) (p < 0.05). However, there was a significant negative correlation between the daily TB notifications and mean air pressure (r = −0.119), precipitation (r = −0.063), relative humidity (r = −0.084), CO (r = −0.038) and SO(2) (r = −0.034) (p < 0.05). (3) The random forest regression model had the best fitting effect, while the BP neural network model exhibited the best prediction. (4) The validation set of the BP neural network model, including average daily temperature, sunshine hours and PM(10), showed the lowest root mean square error, mean absolute error and mean absolute percentage error, followed by support vector regression. Conclusions: The prediction trend of the BP neural network model, including average daily temperature, sunshine hours and PM(10), successfully mimics the actual incidence, and the peak incidence highly coincides with the actual aggregation time, with a high accuracy and a minimum error. Taken together, these data suggest that the BP neural network model can predict the incidence trend of tuberculosis in Changde City. MDPI 2023-02-22 /pmc/articles/PMC10002212/ /pubmed/36900920 http://dx.doi.org/10.3390/ijerph20053910 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tang, Na
Yuan, Maoxiang
Chen, Zhijun
Ma, Jian
Sun, Rui
Yang, Yide
He, Quanyuan
Guo, Xiaowei
Hu, Shixiong
Zhou, Junhua
Machine Learning Prediction Model of Tuberculosis Incidence Based on Meteorological Factors and Air Pollutants
title Machine Learning Prediction Model of Tuberculosis Incidence Based on Meteorological Factors and Air Pollutants
title_full Machine Learning Prediction Model of Tuberculosis Incidence Based on Meteorological Factors and Air Pollutants
title_fullStr Machine Learning Prediction Model of Tuberculosis Incidence Based on Meteorological Factors and Air Pollutants
title_full_unstemmed Machine Learning Prediction Model of Tuberculosis Incidence Based on Meteorological Factors and Air Pollutants
title_short Machine Learning Prediction Model of Tuberculosis Incidence Based on Meteorological Factors and Air Pollutants
title_sort machine learning prediction model of tuberculosis incidence based on meteorological factors and air pollutants
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002212/
https://www.ncbi.nlm.nih.gov/pubmed/36900920
http://dx.doi.org/10.3390/ijerph20053910
work_keys_str_mv AT tangna machinelearningpredictionmodeloftuberculosisincidencebasedonmeteorologicalfactorsandairpollutants
AT yuanmaoxiang machinelearningpredictionmodeloftuberculosisincidencebasedonmeteorologicalfactorsandairpollutants
AT chenzhijun machinelearningpredictionmodeloftuberculosisincidencebasedonmeteorologicalfactorsandairpollutants
AT majian machinelearningpredictionmodeloftuberculosisincidencebasedonmeteorologicalfactorsandairpollutants
AT sunrui machinelearningpredictionmodeloftuberculosisincidencebasedonmeteorologicalfactorsandairpollutants
AT yangyide machinelearningpredictionmodeloftuberculosisincidencebasedonmeteorologicalfactorsandairpollutants
AT hequanyuan machinelearningpredictionmodeloftuberculosisincidencebasedonmeteorologicalfactorsandairpollutants
AT guoxiaowei machinelearningpredictionmodeloftuberculosisincidencebasedonmeteorologicalfactorsandairpollutants
AT hushixiong machinelearningpredictionmodeloftuberculosisincidencebasedonmeteorologicalfactorsandairpollutants
AT zhoujunhua machinelearningpredictionmodeloftuberculosisincidencebasedonmeteorologicalfactorsandairpollutants