Cargando…
Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning
There have been over 621 million cases of COVID-19 worldwide with over 6.5 million deaths. Despite the high secondary attack rate of COVID-19 in shared households, some exposed individuals do not contract the virus. In addition, little is known about whether the occurrence of COVID-19 resistance dif...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9946212/ https://www.ncbi.nlm.nih.gov/pubmed/36812214 http://dx.doi.org/10.1371/journal.pone.0278466 |
_version_ | 1784892284501753856 |
---|---|
author | Yang, Kai-Wen K. Paris, Chloé F. Gorman, Kevin T. Rattsev, Ilia Yoo, Rebecca H. Chen, Yijia Desman, Jacob M. Wei, Tony Y. Greenstein, Joseph L. Taylor, Casey Overby Ray, Stuart C. |
author_facet | Yang, Kai-Wen K. Paris, Chloé F. Gorman, Kevin T. Rattsev, Ilia Yoo, Rebecca H. Chen, Yijia Desman, Jacob M. Wei, Tony Y. Greenstein, Joseph L. Taylor, Casey Overby Ray, Stuart C. |
author_sort | Yang, Kai-Wen K. |
collection | PubMed |
description | There have been over 621 million cases of COVID-19 worldwide with over 6.5 million deaths. Despite the high secondary attack rate of COVID-19 in shared households, some exposed individuals do not contract the virus. In addition, little is known about whether the occurrence of COVID-19 resistance differs among people by health characteristics as stored in the electronic health records (EHR). In this retrospective analysis, we develop a statistical model to predict COVID-19 resistance in 8,536 individuals with prior COVID-19 exposure using demographics, diagnostic codes, outpatient medication orders, and count of Elixhauser comorbidities in EHR data from the COVID-19 Precision Medicine Platform Registry. Cluster analyses identified 5 patterns of diagnostic codes that distinguished resistant from non-resistant patients in our study population. In addition, our models showed modest performance in predicting COVID-19 resistance (best performing model AUROC = 0.61). Monte Carlo simulations conducted indicated that the AUROC results are statistically significant (p < 0.001) for the testing set. We hope to validate the features found to be associated with resistance/non-resistance through more advanced association studies. |
format | Online Article Text |
id | pubmed-9946212 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-99462122023-02-23 Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning Yang, Kai-Wen K. Paris, Chloé F. Gorman, Kevin T. Rattsev, Ilia Yoo, Rebecca H. Chen, Yijia Desman, Jacob M. Wei, Tony Y. Greenstein, Joseph L. Taylor, Casey Overby Ray, Stuart C. PLoS One Research Article There have been over 621 million cases of COVID-19 worldwide with over 6.5 million deaths. Despite the high secondary attack rate of COVID-19 in shared households, some exposed individuals do not contract the virus. In addition, little is known about whether the occurrence of COVID-19 resistance differs among people by health characteristics as stored in the electronic health records (EHR). In this retrospective analysis, we develop a statistical model to predict COVID-19 resistance in 8,536 individuals with prior COVID-19 exposure using demographics, diagnostic codes, outpatient medication orders, and count of Elixhauser comorbidities in EHR data from the COVID-19 Precision Medicine Platform Registry. Cluster analyses identified 5 patterns of diagnostic codes that distinguished resistant from non-resistant patients in our study population. In addition, our models showed modest performance in predicting COVID-19 resistance (best performing model AUROC = 0.61). Monte Carlo simulations conducted indicated that the AUROC results are statistically significant (p < 0.001) for the testing set. We hope to validate the features found to be associated with resistance/non-resistance through more advanced association studies. Public Library of Science 2023-02-22 /pmc/articles/PMC9946212/ /pubmed/36812214 http://dx.doi.org/10.1371/journal.pone.0278466 Text en © 2023 Yang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Yang, Kai-Wen K. Paris, Chloé F. Gorman, Kevin T. Rattsev, Ilia Yoo, Rebecca H. Chen, Yijia Desman, Jacob M. Wei, Tony Y. Greenstein, Joseph L. Taylor, Casey Overby Ray, Stuart C. Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning |
title | Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning |
title_full | Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning |
title_fullStr | Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning |
title_full_unstemmed | Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning |
title_short | Factors associated with resistance to SARS-CoV-2 infection discovered using large-scale medical record data and machine learning |
title_sort | factors associated with resistance to sars-cov-2 infection discovered using large-scale medical record data and machine learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9946212/ https://www.ncbi.nlm.nih.gov/pubmed/36812214 http://dx.doi.org/10.1371/journal.pone.0278466 |
work_keys_str_mv | AT yangkaiwenk factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning AT parischloef factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning AT gormankevint factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning AT rattsevilia factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning AT yoorebeccah factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning AT chenyijia factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning AT desmanjacobm factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning AT weitonyy factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning AT greensteinjosephl factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning AT taylorcaseyoverby factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning AT raystuartc factorsassociatedwithresistancetosarscov2infectiondiscoveredusinglargescalemedicalrecorddataandmachinelearning |