Cargando…

Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning

There have been over 621 million cases of COVID-19 worldwide with over 6.5 million deaths. Despite the high secondary attack rate of COVID-19 in shared households, some exposed individuals do not contract the virus. In addition, little is known about whether the occurrence of COVID-19 resistance dif...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Kai-Wen K., Paris, Chloé F., Gorman, Kevin T., Rattsev, Ilia, Yoo, Rebecca H., Chen, Yijia, Desman, Jacob M., Wei, Tony Y., Greenstein, Joseph L., Taylor, Casey Overby, Ray, Stuart C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9946212/
https://www.ncbi.nlm.nih.gov/pubmed/36812214
http://dx.doi.org/10.1371/journal.pone.0278466
_version_ 1784892284501753856
author Yang, Kai-Wen K.
Paris, Chloé F.
Gorman, Kevin T.
Rattsev, Ilia
Yoo, Rebecca H.
Chen, Yijia
Desman, Jacob M.
Wei, Tony Y.
Greenstein, Joseph L.
Taylor, Casey Overby
Ray, Stuart C.
author_facet Yang, Kai-Wen K.
Paris, Chloé F.
Gorman, Kevin T.
Rattsev, Ilia
Yoo, Rebecca H.
Chen, Yijia
Desman, Jacob M.
Wei, Tony Y.
Greenstein, Joseph L.
Taylor, Casey Overby
Ray, Stuart C.
author_sort Yang, Kai-Wen K.
collection PubMed
description There have been over 621 million cases of COVID-19 worldwide with over 6.5 million deaths. Despite the high secondary attack rate of COVID-19 in shared households, some exposed individuals do not contract the virus. In addition, little is known about whether the occurrence of COVID-19 resistance differs among people by health characteristics as stored in the electronic health records (EHR). In this retrospective analysis, we develop a statistical model to predict COVID-19 resistance in 8,536 individuals with prior COVID-19 exposure using demographics, diagnostic codes, outpatient medication orders, and count of Elixhauser comorbidities in EHR data from the COVID-19 Precision Medicine Platform Registry. Cluster analyses identified 5 patterns of diagnostic codes that distinguished resistant from non-resistant patients in our study population. In addition, our models showed modest performance in predicting COVID-19 resistance (best performing model AUROC = 0.61). Monte Carlo simulations conducted indicated that the AUROC results are statistically significant (p < 0.001) for the testing set. We hope to validate the features found to be associated with resistance/non-resistance through more advanced association studies.
format Online
Article
Text
id pubmed-9946212
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-99462122023-02-23 Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning Yang, Kai-Wen K. Paris, Chloé F. Gorman, Kevin T. Rattsev, Ilia Yoo, Rebecca H. Chen, Yijia Desman, Jacob M. Wei, Tony Y. Greenstein, Joseph L. Taylor, Casey Overby Ray, Stuart C. PLoS One Research Article There have been over 621 million cases of COVID-19 worldwide with over 6.5 million deaths. Despite the high secondary attack rate of COVID-19 in shared households, some exposed individuals do not contract the virus. In addition, little is known about whether the occurrence of COVID-19 resistance differs among people by health characteristics as stored in the electronic health records (EHR). In this retrospective analysis, we develop a statistical model to predict COVID-19 resistance in 8,536 individuals with prior COVID-19 exposure using demographics, diagnostic codes, outpatient medication orders, and count of Elixhauser comorbidities in EHR data from the COVID-19 Precision Medicine Platform Registry. Cluster analyses identified 5 patterns of diagnostic codes that distinguished resistant from non-resistant patients in our study population. In addition, our models showed modest performance in predicting COVID-19 resistance (best performing model AUROC = 0.61). Monte Carlo simulations conducted indicated that the AUROC results are statistically significant (p < 0.001) for the testing set. We hope to validate the features found to be associated with resistance/non-resistance through more advanced association studies. Public Library of Science 2023-02-22 /pmc/articles/PMC9946212/ /pubmed/36812214 http://dx.doi.org/10.1371/journal.pone.0278466 Text en © 2023 Yang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Yang, Kai-Wen K.
Paris, Chloé F.
Gorman, Kevin T.
Rattsev, Ilia
Yoo, Rebecca H.
Chen, Yijia
Desman, Jacob M.
Wei, Tony Y.
Greenstein, Joseph L.
Taylor, Casey Overby
Ray, Stuart C.
Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning
title Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning
title_full Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning
title_fullStr Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning
title_full_unstemmed Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning
title_short Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning
title_sort factors associated with resistance to sars-cov-2 infection discovered using large-scale medical record data and machine learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9946212/
https://www.ncbi.nlm.nih.gov/pubmed/36812214
http://dx.doi.org/10.1371/journal.pone.0278466
work_keys_str_mv AT yangkaiwenk factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning
AT parischloef factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning
AT gormankevint factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning
AT rattsevilia factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning
AT yoorebeccah factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning
AT chenyijia factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning
AT desmanjacobm factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning
AT weitonyy factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning
AT greensteinjosephl factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning
AT taylorcaseyoverby factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning
AT raystuartc factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning