Cargando…

Analyzing COVID‐19 Using Multisource Data: An Integrated Approach of Visualization, Spatial Regression, and Machine Learning

Coronavirus disease 2019 (COVID‐19), caused by severe acute respiratory syndrome coronavirus 2, was first identified in Wuhan, China, in December 2019. As the number of COVID‐19 infections and deaths worldwide continues to increase rapidly, the prevention and control of COVID‐19 remains urgent. This...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Chao, Zhou, Mengjie, Liu, Pengyu, Yang, Mengjie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8335962/
https://www.ncbi.nlm.nih.gov/pubmed/34377880
http://dx.doi.org/10.1029/2021GH000439
_version_ 1783733233251254272
author Wu, Chao
Zhou, Mengjie
Liu, Pengyu
Yang, Mengjie
author_facet Wu, Chao
Zhou, Mengjie
Liu, Pengyu
Yang, Mengjie
author_sort Wu, Chao
collection PubMed
description Coronavirus disease 2019 (COVID‐19), caused by severe acute respiratory syndrome coronavirus 2, was first identified in Wuhan, China, in December 2019. As the number of COVID‐19 infections and deaths worldwide continues to increase rapidly, the prevention and control of COVID‐19 remains urgent. This article aims to analyze COVID‐19 from a geographical perspective, and this information can provide useful insights for rapid visualization of spatial‐temporal epidemic information and identification of the factors important to the spread of COVID‐19. A new type of vitalization method, called the point grid map, is integrated with calendar‐based visualization to show the spatial‐temporal variations in COVID‐19. The combination of mixed geographically weighted regression (mixed GWR) and extreme gradient boosting (XGBoost) is used to identify the potential factors and the corresponding importance. The visualization results clearly reflect the spatial‐temporal patterns of COVID‐19. The quantified results reveal that the impact of population outflow from Wuhan is the most important factor and indicate statistically significant spatial heterogeneity. Our results provide insights into how multisource big geodata can be employed within the framework of integrating visualization and analytical methods to characterize COVID‐19 trends. In addition, this work can help understand the influential factors for controlling and preventing epidemics, which is important for policy design and effective decision‐making for controlling COVID‐19. The results reveal that one of the most effective ways to control COVID‐19 include controlling the source of infection, cutting off the transmission route, and protecting vulnerable groups.
format Online
Article
Text
id pubmed-8335962
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-83359622021-08-09 Analyzing COVID‐19 Using Multisource Data: An Integrated Approach of Visualization, Spatial Regression, and Machine Learning Wu, Chao Zhou, Mengjie Liu, Pengyu Yang, Mengjie Geohealth Research Article Coronavirus disease 2019 (COVID‐19), caused by severe acute respiratory syndrome coronavirus 2, was first identified in Wuhan, China, in December 2019. As the number of COVID‐19 infections and deaths worldwide continues to increase rapidly, the prevention and control of COVID‐19 remains urgent. This article aims to analyze COVID‐19 from a geographical perspective, and this information can provide useful insights for rapid visualization of spatial‐temporal epidemic information and identification of the factors important to the spread of COVID‐19. A new type of vitalization method, called the point grid map, is integrated with calendar‐based visualization to show the spatial‐temporal variations in COVID‐19. The combination of mixed geographically weighted regression (mixed GWR) and extreme gradient boosting (XGBoost) is used to identify the potential factors and the corresponding importance. The visualization results clearly reflect the spatial‐temporal patterns of COVID‐19. The quantified results reveal that the impact of population outflow from Wuhan is the most important factor and indicate statistically significant spatial heterogeneity. Our results provide insights into how multisource big geodata can be employed within the framework of integrating visualization and analytical methods to characterize COVID‐19 trends. In addition, this work can help understand the influential factors for controlling and preventing epidemics, which is important for policy design and effective decision‐making for controlling COVID‐19. The results reveal that one of the most effective ways to control COVID‐19 include controlling the source of infection, cutting off the transmission route, and protecting vulnerable groups. John Wiley and Sons Inc. 2021-08-01 /pmc/articles/PMC8335962/ /pubmed/34377880 http://dx.doi.org/10.1029/2021GH000439 Text en © 2021. The Authors. GeoHealth published by Wiley Periodicals LLC on behalf of American Geophysical Union. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wu, Chao
Zhou, Mengjie
Liu, Pengyu
Yang, Mengjie
Analyzing COVID‐19 Using Multisource Data: An Integrated Approach of Visualization, Spatial Regression, and Machine Learning
title Analyzing COVID‐19 Using Multisource Data: An Integrated Approach of Visualization, Spatial Regression, and Machine Learning
title_full Analyzing COVID‐19 Using Multisource Data: An Integrated Approach of Visualization, Spatial Regression, and Machine Learning
title_fullStr Analyzing COVID‐19 Using Multisource Data: An Integrated Approach of Visualization, Spatial Regression, and Machine Learning
title_full_unstemmed Analyzing COVID‐19 Using Multisource Data: An Integrated Approach of Visualization, Spatial Regression, and Machine Learning
title_short Analyzing COVID‐19 Using Multisource Data: An Integrated Approach of Visualization, Spatial Regression, and Machine Learning
title_sort analyzing covid‐19 using multisource data: an integrated approach of visualization, spatial regression, and machine learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8335962/
https://www.ncbi.nlm.nih.gov/pubmed/34377880
http://dx.doi.org/10.1029/2021GH000439
work_keys_str_mv AT wuchao analyzingcovid19usingmultisourcedataanintegratedapproachofvisualizationspatialregressionandmachinelearning
AT zhoumengjie analyzingcovid19usingmultisourcedataanintegratedapproachofvisualizationspatialregressionandmachinelearning
AT liupengyu analyzingcovid19usingmultisourcedataanintegratedapproachofvisualizationspatialregressionandmachinelearning
AT yangmengjie analyzingcovid19usingmultisourcedataanintegratedapproachofvisualizationspatialregressionandmachinelearning