Cargando…
A machine learning approach utilizing DNA methylation as an accurate classifier of COVID-19 disease severity
Since the onset of the COVID-19 pandemic, increasing cases with variable outcomes continue globally because of variants and despite vaccines and therapies. There is a need to identify at-risk individuals early that would benefit from timely medical interventions. DNA methylation provides an opportun...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580434/ https://www.ncbi.nlm.nih.gov/pubmed/36261477 http://dx.doi.org/10.1038/s41598-022-22201-4 |
_version_ | 1784812383586222080 |
---|---|
author | Bowler, Scott Papoutsoglou, Georgios Karanikas, Aristides Tsamardinos, Ioannis Corley, Michael J. Ndhlovu, Lishomwa C. |
author_facet | Bowler, Scott Papoutsoglou, Georgios Karanikas, Aristides Tsamardinos, Ioannis Corley, Michael J. Ndhlovu, Lishomwa C. |
author_sort | Bowler, Scott |
collection | PubMed |
description | Since the onset of the COVID-19 pandemic, increasing cases with variable outcomes continue globally because of variants and despite vaccines and therapies. There is a need to identify at-risk individuals early that would benefit from timely medical interventions. DNA methylation provides an opportunity to identify an epigenetic signature of individuals at increased risk. We utilized machine learning to identify DNA methylation signatures of COVID-19 disease from data available through NCBI Gene Expression Omnibus. A training cohort of 460 individuals (164 COVID-19-infected and 296 non-infected) and an external validation dataset of 128 individuals (102 COVID-19-infected and 26 non-COVID-associated pneumonia) were reanalyzed. Data was processed using ChAMP and beta values were logit transformed. The JADBio AutoML platform was leveraged to identify a methylation signature associated with severe COVID-19 disease. We identified a random forest classification model from 4 unique methylation sites with the power to discern individuals with severe COVID-19 disease. The average area under the curve of receiver operator characteristic (AUC-ROC) of the model was 0.933 and the average area under the precision-recall curve (AUC-PRC) was 0.965. When applied to our external validation, this model produced an AUC-ROC of 0.898 and an AUC-PRC of 0.864. These results further our understanding of the utility of DNA methylation in COVID-19 disease pathology and serve as a platform to inform future COVID-19 related studies. |
format | Online Article Text |
id | pubmed-9580434 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-95804342022-10-19 A machine learning approach utilizing DNA methylation as an accurate classifier of COVID-19 disease severity Bowler, Scott Papoutsoglou, Georgios Karanikas, Aristides Tsamardinos, Ioannis Corley, Michael J. Ndhlovu, Lishomwa C. Sci Rep Article Since the onset of the COVID-19 pandemic, increasing cases with variable outcomes continue globally because of variants and despite vaccines and therapies. There is a need to identify at-risk individuals early that would benefit from timely medical interventions. DNA methylation provides an opportunity to identify an epigenetic signature of individuals at increased risk. We utilized machine learning to identify DNA methylation signatures of COVID-19 disease from data available through NCBI Gene Expression Omnibus. A training cohort of 460 individuals (164 COVID-19-infected and 296 non-infected) and an external validation dataset of 128 individuals (102 COVID-19-infected and 26 non-COVID-associated pneumonia) were reanalyzed. Data was processed using ChAMP and beta values were logit transformed. The JADBio AutoML platform was leveraged to identify a methylation signature associated with severe COVID-19 disease. We identified a random forest classification model from 4 unique methylation sites with the power to discern individuals with severe COVID-19 disease. The average area under the curve of receiver operator characteristic (AUC-ROC) of the model was 0.933 and the average area under the precision-recall curve (AUC-PRC) was 0.965. When applied to our external validation, this model produced an AUC-ROC of 0.898 and an AUC-PRC of 0.864. These results further our understanding of the utility of DNA methylation in COVID-19 disease pathology and serve as a platform to inform future COVID-19 related studies. Nature Publishing Group UK 2022-10-19 /pmc/articles/PMC9580434/ /pubmed/36261477 http://dx.doi.org/10.1038/s41598-022-22201-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Bowler, Scott Papoutsoglou, Georgios Karanikas, Aristides Tsamardinos, Ioannis Corley, Michael J. Ndhlovu, Lishomwa C. A machine learning approach utilizing DNA methylation as an accurate classifier of COVID-19 disease severity |
title | A machine learning approach utilizing DNA methylation as an accurate classifier of COVID-19 disease severity |
title_full | A machine learning approach utilizing DNA methylation as an accurate classifier of COVID-19 disease severity |
title_fullStr | A machine learning approach utilizing DNA methylation as an accurate classifier of COVID-19 disease severity |
title_full_unstemmed | A machine learning approach utilizing DNA methylation as an accurate classifier of COVID-19 disease severity |
title_short | A machine learning approach utilizing DNA methylation as an accurate classifier of COVID-19 disease severity |
title_sort | machine learning approach utilizing dna methylation as an accurate classifier of covid-19 disease severity |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580434/ https://www.ncbi.nlm.nih.gov/pubmed/36261477 http://dx.doi.org/10.1038/s41598-022-22201-4 |
work_keys_str_mv | AT bowlerscott amachinelearningapproachutilizingdnamethylationasanaccurateclassifierofcovid19diseaseseverity AT papoutsoglougeorgios amachinelearningapproachutilizingdnamethylationasanaccurateclassifierofcovid19diseaseseverity AT karanikasaristides amachinelearningapproachutilizingdnamethylationasanaccurateclassifierofcovid19diseaseseverity AT tsamardinosioannis amachinelearningapproachutilizingdnamethylationasanaccurateclassifierofcovid19diseaseseverity AT corleymichaelj amachinelearningapproachutilizingdnamethylationasanaccurateclassifierofcovid19diseaseseverity AT ndhlovulishomwac amachinelearningapproachutilizingdnamethylationasanaccurateclassifierofcovid19diseaseseverity AT bowlerscott machinelearningapproachutilizingdnamethylationasanaccurateclassifierofcovid19diseaseseverity AT papoutsoglougeorgios machinelearningapproachutilizingdnamethylationasanaccurateclassifierofcovid19diseaseseverity AT karanikasaristides machinelearningapproachutilizingdnamethylationasanaccurateclassifierofcovid19diseaseseverity AT tsamardinosioannis machinelearningapproachutilizingdnamethylationasanaccurateclassifierofcovid19diseaseseverity AT corleymichaelj machinelearningapproachutilizingdnamethylationasanaccurateclassifierofcovid19diseaseseverity AT ndhlovulishomwac machinelearningapproachutilizingdnamethylationasanaccurateclassifierofcovid19diseaseseverity |