Cargando…

Who was at risk for COVID-19 late in the US pandemic? Insights from a population health machine learning model

Notable discrepancies in vulnerability to COVID-19 infection have been identified between specific population groups and regions in the USA. The purpose of this study was to estimate the likelihood of COVID-19 infection using a machine-learning algorithm that can be updated continuously based on hea...

Descripción completa

Detalles Bibliográficos
Autores principales: Adeoye, Elijah A., Rozenfeld, Yelena, Beam, Jennifer, Boudreau, Karen, Cox, Emily J., Scanlan, James M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9090454/
https://www.ncbi.nlm.nih.gov/pubmed/35538201
http://dx.doi.org/10.1007/s11517-022-02549-5
_version_ 1784704727091511296
author Adeoye, Elijah A.
Rozenfeld, Yelena
Beam, Jennifer
Boudreau, Karen
Cox, Emily J.
Scanlan, James M.
author_facet Adeoye, Elijah A.
Rozenfeld, Yelena
Beam, Jennifer
Boudreau, Karen
Cox, Emily J.
Scanlan, James M.
author_sort Adeoye, Elijah A.
collection PubMed
description Notable discrepancies in vulnerability to COVID-19 infection have been identified between specific population groups and regions in the USA. The purpose of this study was to estimate the likelihood of COVID-19 infection using a machine-learning algorithm that can be updated continuously based on health care data. Patient records were extracted for all COVID-19 nasal swab PCR tests performed within the Providence St. Joseph Health system from February to October of 2020. A total of 316,599 participants were included in this study, and approximately 7.7% (n = 24,358) tested positive for COVID-19. A gradient boosting model, LightGBM (LGBM), predicted risk of initial infection with an area under the receiver operating characteristic curve of 0.819. Factors that predicted infection were cough, fever, being a member of the Hispanic or Latino community, being Spanish speaking, having a history of diabetes or dementia, and living in a neighborhood with housing insecurity. A model trained on sociodemographic, environmental, and medical history data performed well in predicting risk of a positive COVID-19 test. This model could be used to tailor education, public health policy, and resources for communities that are at the greatest risk of infection. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11517-022-02549-5.
format Online
Article
Text
id pubmed-9090454
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-90904542022-05-11 Who was at risk for COVID-19 late in the US pandemic? Insights from a population health machine learning model Adeoye, Elijah A. Rozenfeld, Yelena Beam, Jennifer Boudreau, Karen Cox, Emily J. Scanlan, James M. Med Biol Eng Comput Original Article Notable discrepancies in vulnerability to COVID-19 infection have been identified between specific population groups and regions in the USA. The purpose of this study was to estimate the likelihood of COVID-19 infection using a machine-learning algorithm that can be updated continuously based on health care data. Patient records were extracted for all COVID-19 nasal swab PCR tests performed within the Providence St. Joseph Health system from February to October of 2020. A total of 316,599 participants were included in this study, and approximately 7.7% (n = 24,358) tested positive for COVID-19. A gradient boosting model, LightGBM (LGBM), predicted risk of initial infection with an area under the receiver operating characteristic curve of 0.819. Factors that predicted infection were cough, fever, being a member of the Hispanic or Latino community, being Spanish speaking, having a history of diabetes or dementia, and living in a neighborhood with housing insecurity. A model trained on sociodemographic, environmental, and medical history data performed well in predicting risk of a positive COVID-19 test. This model could be used to tailor education, public health policy, and resources for communities that are at the greatest risk of infection. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11517-022-02549-5. Springer Berlin Heidelberg 2022-05-11 2022 /pmc/articles/PMC9090454/ /pubmed/35538201 http://dx.doi.org/10.1007/s11517-022-02549-5 Text en © International Federation for Medical and Biological Engineering 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Article
Adeoye, Elijah A.
Rozenfeld, Yelena
Beam, Jennifer
Boudreau, Karen
Cox, Emily J.
Scanlan, James M.
Who was at risk for COVID-19 late in the US pandemic? Insights from a population health machine learning model
title Who was at risk for COVID-19 late in the US pandemic? Insights from a population health machine learning model
title_full Who was at risk for COVID-19 late in the US pandemic? Insights from a population health machine learning model
title_fullStr Who was at risk for COVID-19 late in the US pandemic? Insights from a population health machine learning model
title_full_unstemmed Who was at risk for COVID-19 late in the US pandemic? Insights from a population health machine learning model
title_short Who was at risk for COVID-19 late in the US pandemic? Insights from a population health machine learning model
title_sort who was at risk for covid-19 late in the us pandemic? insights from a population health machine learning model
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9090454/
https://www.ncbi.nlm.nih.gov/pubmed/35538201
http://dx.doi.org/10.1007/s11517-022-02549-5
work_keys_str_mv AT adeoyeelijaha whowasatriskforcovid19lateintheuspandemicinsightsfromapopulationhealthmachinelearningmodel
AT rozenfeldyelena whowasatriskforcovid19lateintheuspandemicinsightsfromapopulationhealthmachinelearningmodel
AT beamjennifer whowasatriskforcovid19lateintheuspandemicinsightsfromapopulationhealthmachinelearningmodel
AT boudreaukaren whowasatriskforcovid19lateintheuspandemicinsightsfromapopulationhealthmachinelearningmodel
AT coxemilyj whowasatriskforcovid19lateintheuspandemicinsightsfromapopulationhealthmachinelearningmodel
AT scanlanjamesm whowasatriskforcovid19lateintheuspandemicinsightsfromapopulationhealthmachinelearningmodel