Cargando…

Spatial Prediction of COVID-19 in China Based on Machine Learning Algorithms and Geographically Weighted Regression

COVID-19 has swept through the world since December 2019 and caused a large number of patients and deaths. Spatial prediction on the spread of the epidemic is greatly important for disease control and management. In this study, we predicted the cumulative confirmed cases (CCCs) from Jan 17 to Mar 1,...

Descripción completa

Detalles Bibliográficos
Autores principales: Shao, Qi, Xu, Yongming, Wu, Hanyi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8528585/
https://www.ncbi.nlm.nih.gov/pubmed/34691241
http://dx.doi.org/10.1155/2021/7196492
_version_ 1784586278305529856
author Shao, Qi
Xu, Yongming
Wu, Hanyi
author_facet Shao, Qi
Xu, Yongming
Wu, Hanyi
author_sort Shao, Qi
collection PubMed
description COVID-19 has swept through the world since December 2019 and caused a large number of patients and deaths. Spatial prediction on the spread of the epidemic is greatly important for disease control and management. In this study, we predicted the cumulative confirmed cases (CCCs) from Jan 17 to Mar 1, 2020, in mainland China at the city level, using machine learning algorithms, geographically weighted regression (GWR), and partial least squares regression (PLSR) based on population flow, geolocation, meteorological, and socioeconomic variables. The validation results showed that machine learning algorithms and GWR achieved good performances. These models could not effectively predict CCCs in Wuhan, the first city that reported COVID-19 cases in China, but performed well in other cities. Random Forest (RF) outperformed other methods with a CV‐R(2) of 0.84. In this model, the population flow from Wuhan to other cities (WP) was the most important feature and the other features also made considerable contributions to the prediction accuracy. Compared with RF, GWR showed a slightly worse performance (CV‐R(2) = 0.81) but required fewer spatial independent variables. This study explored the spatial prediction of the epidemic based on multisource spatial independent variables, providing references for the estimation of CCCs in the regions lacking accurate and timely.
format Online
Article
Text
id pubmed-8528585
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-85285852021-10-21 Spatial Prediction of COVID-19 in China Based on Machine Learning Algorithms and Geographically Weighted Regression Shao, Qi Xu, Yongming Wu, Hanyi Comput Math Methods Med Research Article COVID-19 has swept through the world since December 2019 and caused a large number of patients and deaths. Spatial prediction on the spread of the epidemic is greatly important for disease control and management. In this study, we predicted the cumulative confirmed cases (CCCs) from Jan 17 to Mar 1, 2020, in mainland China at the city level, using machine learning algorithms, geographically weighted regression (GWR), and partial least squares regression (PLSR) based on population flow, geolocation, meteorological, and socioeconomic variables. The validation results showed that machine learning algorithms and GWR achieved good performances. These models could not effectively predict CCCs in Wuhan, the first city that reported COVID-19 cases in China, but performed well in other cities. Random Forest (RF) outperformed other methods with a CV‐R(2) of 0.84. In this model, the population flow from Wuhan to other cities (WP) was the most important feature and the other features also made considerable contributions to the prediction accuracy. Compared with RF, GWR showed a slightly worse performance (CV‐R(2) = 0.81) but required fewer spatial independent variables. This study explored the spatial prediction of the epidemic based on multisource spatial independent variables, providing references for the estimation of CCCs in the regions lacking accurate and timely. Hindawi 2021-10-13 /pmc/articles/PMC8528585/ /pubmed/34691241 http://dx.doi.org/10.1155/2021/7196492 Text en Copyright © 2021 Qi Shao et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Shao, Qi
Xu, Yongming
Wu, Hanyi
Spatial Prediction of COVID-19 in China Based on Machine Learning Algorithms and Geographically Weighted Regression
title Spatial Prediction of COVID-19 in China Based on Machine Learning Algorithms and Geographically Weighted Regression
title_full Spatial Prediction of COVID-19 in China Based on Machine Learning Algorithms and Geographically Weighted Regression
title_fullStr Spatial Prediction of COVID-19 in China Based on Machine Learning Algorithms and Geographically Weighted Regression
title_full_unstemmed Spatial Prediction of COVID-19 in China Based on Machine Learning Algorithms and Geographically Weighted Regression
title_short Spatial Prediction of COVID-19 in China Based on Machine Learning Algorithms and Geographically Weighted Regression
title_sort spatial prediction of covid-19 in china based on machine learning algorithms and geographically weighted regression
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8528585/
https://www.ncbi.nlm.nih.gov/pubmed/34691241
http://dx.doi.org/10.1155/2021/7196492
work_keys_str_mv AT shaoqi spatialpredictionofcovid19inchinabasedonmachinelearningalgorithmsandgeographicallyweightedregression
AT xuyongming spatialpredictionofcovid19inchinabasedonmachinelearningalgorithmsandgeographicallyweightedregression
AT wuhanyi spatialpredictionofcovid19inchinabasedonmachinelearningalgorithmsandgeographicallyweightedregression