Cargando…

Unraveling the dynamic importance of county-level features in trajectory of COVID-19

The objective of this study was to investigate the importance of multiple county-level features in the trajectory of COVID-19. We examined feature importance across 2787 counties in the United States using data-driven machine learning models. Existing mathematical models of disease spread usually fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Qingchun, Yang, Yang, Wang, Wanqiu, Lee, Sanghyeon, Xiao, Xin, Gao, Xinyu, Oztekin, Bora, Fan, Chao, Mostafavi, Ali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8219723/
https://www.ncbi.nlm.nih.gov/pubmed/34158571
http://dx.doi.org/10.1038/s41598-021-92634-w
_version_ 1783710997594243072
author Li, Qingchun
Yang, Yang
Wang, Wanqiu
Lee, Sanghyeon
Xiao, Xin
Gao, Xinyu
Oztekin, Bora
Fan, Chao
Mostafavi, Ali
author_facet Li, Qingchun
Yang, Yang
Wang, Wanqiu
Lee, Sanghyeon
Xiao, Xin
Gao, Xinyu
Oztekin, Bora
Fan, Chao
Mostafavi, Ali
author_sort Li, Qingchun
collection PubMed
description The objective of this study was to investigate the importance of multiple county-level features in the trajectory of COVID-19. We examined feature importance across 2787 counties in the United States using data-driven machine learning models. Existing mathematical models of disease spread usually focused on the case prediction with different infection rates without incorporating multiple heterogeneous features that could impact the spatial and temporal trajectory of COVID-19. Recognizing this, we trained a data-driven model using 23 features representing six key influencing factors affecting the pandemic spread: social demographics of counties, population activities, mobility within the counties, movement across counties, disease attributes, and social network structure. Also, we categorized counties into multiple groups according to their population densities, and we divided the trajectory of COVID-19 into three stages: the outbreak stage, the social distancing stage, and the reopening stage. The study aimed to answer two research questions: (1) The extent to which the importance of heterogeneous features evolved at different stages; (2) The extent to which the importance of heterogeneous features varied across counties with different characteristics. We fitted a set of random forest models to determine weekly feature importance. The results showed that: (1) Social demographic features, such as gross domestic product, population density, and minority status maintained high-importance features throughout stages of COVID-19 across 2787 studied counties; (2) Within-county mobility features had the highest importance in counties with higher population densities; (3) The feature reflecting the social network structure (Facebook, social connectedness index), had higher importance for counties with higher population densities. The results showed that the data-driven machine learning models could provide important insights to inform policymakers regarding feature importance for counties with various population densities and at different stages of a pandemic life cycle.
format Online
Article
Text
id pubmed-8219723
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-82197232021-06-24 Unraveling the dynamic importance of county-level features in trajectory of COVID-19 Li, Qingchun Yang, Yang Wang, Wanqiu Lee, Sanghyeon Xiao, Xin Gao, Xinyu Oztekin, Bora Fan, Chao Mostafavi, Ali Sci Rep Article The objective of this study was to investigate the importance of multiple county-level features in the trajectory of COVID-19. We examined feature importance across 2787 counties in the United States using data-driven machine learning models. Existing mathematical models of disease spread usually focused on the case prediction with different infection rates without incorporating multiple heterogeneous features that could impact the spatial and temporal trajectory of COVID-19. Recognizing this, we trained a data-driven model using 23 features representing six key influencing factors affecting the pandemic spread: social demographics of counties, population activities, mobility within the counties, movement across counties, disease attributes, and social network structure. Also, we categorized counties into multiple groups according to their population densities, and we divided the trajectory of COVID-19 into three stages: the outbreak stage, the social distancing stage, and the reopening stage. The study aimed to answer two research questions: (1) The extent to which the importance of heterogeneous features evolved at different stages; (2) The extent to which the importance of heterogeneous features varied across counties with different characteristics. We fitted a set of random forest models to determine weekly feature importance. The results showed that: (1) Social demographic features, such as gross domestic product, population density, and minority status maintained high-importance features throughout stages of COVID-19 across 2787 studied counties; (2) Within-county mobility features had the highest importance in counties with higher population densities; (3) The feature reflecting the social network structure (Facebook, social connectedness index), had higher importance for counties with higher population densities. The results showed that the data-driven machine learning models could provide important insights to inform policymakers regarding feature importance for counties with various population densities and at different stages of a pandemic life cycle. Nature Publishing Group UK 2021-06-22 /pmc/articles/PMC8219723/ /pubmed/34158571 http://dx.doi.org/10.1038/s41598-021-92634-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Li, Qingchun
Yang, Yang
Wang, Wanqiu
Lee, Sanghyeon
Xiao, Xin
Gao, Xinyu
Oztekin, Bora
Fan, Chao
Mostafavi, Ali
Unraveling the dynamic importance of county-level features in trajectory of COVID-19
title Unraveling the dynamic importance of county-level features in trajectory of COVID-19
title_full Unraveling the dynamic importance of county-level features in trajectory of COVID-19
title_fullStr Unraveling the dynamic importance of county-level features in trajectory of COVID-19
title_full_unstemmed Unraveling the dynamic importance of county-level features in trajectory of COVID-19
title_short Unraveling the dynamic importance of county-level features in trajectory of COVID-19
title_sort unraveling the dynamic importance of county-level features in trajectory of covid-19
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8219723/
https://www.ncbi.nlm.nih.gov/pubmed/34158571
http://dx.doi.org/10.1038/s41598-021-92634-w
work_keys_str_mv AT liqingchun unravelingthedynamicimportanceofcountylevelfeaturesintrajectoryofcovid19
AT yangyang unravelingthedynamicimportanceofcountylevelfeaturesintrajectoryofcovid19
AT wangwanqiu unravelingthedynamicimportanceofcountylevelfeaturesintrajectoryofcovid19
AT leesanghyeon unravelingthedynamicimportanceofcountylevelfeaturesintrajectoryofcovid19
AT xiaoxin unravelingthedynamicimportanceofcountylevelfeaturesintrajectoryofcovid19
AT gaoxinyu unravelingthedynamicimportanceofcountylevelfeaturesintrajectoryofcovid19
AT oztekinbora unravelingthedynamicimportanceofcountylevelfeaturesintrajectoryofcovid19
AT fanchao unravelingthedynamicimportanceofcountylevelfeaturesintrajectoryofcovid19
AT mostafaviali unravelingthedynamicimportanceofcountylevelfeaturesintrajectoryofcovid19