Cargando…
A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning
Although many biomarkers associated with coronavirus disease 2019 (COVID-19) were found, a novel signature relevant to immune cells has not been developed. In this work, the “CIBERSORT” algorithm was used to assess the fraction of immune infiltrating cells in GSE152641 and GSE171110. Key modules ass...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9498787/ https://www.ncbi.nlm.nih.gov/pubmed/36140771 http://dx.doi.org/10.3390/genes13091602 |
_version_ | 1784794846890819584 |
---|---|
author | Lai, Guichuan Liu, Hui Deng, Jielian Li, Kangjie Xie, Biao |
author_facet | Lai, Guichuan Liu, Hui Deng, Jielian Li, Kangjie Xie, Biao |
author_sort | Lai, Guichuan |
collection | PubMed |
description | Although many biomarkers associated with coronavirus disease 2019 (COVID-19) were found, a novel signature relevant to immune cells has not been developed. In this work, the “CIBERSORT” algorithm was used to assess the fraction of immune infiltrating cells in GSE152641 and GSE171110. Key modules associated with important immune cells were selected by the “WGCNA” package. The “GO” enrichment analysis was used to reveal the biological function associated with COVID-19. The “Boruta” algorithm was used to screen candidate genes, and the “LASSO” algorithm was used for collinearity reduction. A novel gene signature was developed based on multivariate logistic regression analysis. Subsequently, M0 macrophages (PR(AUC) = 0.948 in GSE152641 and PR(AUC) = 0.981 in GSE171110) and neutrophils (PR(AUC) = 0.892 in GSE152641 and PR(AUC) = 0.960 in GSE171110) were considered as important immune cells. Forty-three intersected genes from two modules were selected, which mainly participated in some immune-related activities. Finally, a three-gene signature comprising CLEC4D, DUSP13, and UNC5A that can accurately distinguish COVID-19 patients and healthy controls in three datasets was constructed. The ROC(AUC) was 0.974 in the training set, 0.946 in the internal test set, and 0.709 in the external test set. In conclusion, we constructed a three-gene signature to identify COVID-19, and CLEC4D, DUSP13, and UNC5A may be potential biomarkers for COVID-19 patients. |
format | Online Article Text |
id | pubmed-9498787 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-94987872022-09-23 A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning Lai, Guichuan Liu, Hui Deng, Jielian Li, Kangjie Xie, Biao Genes (Basel) Article Although many biomarkers associated with coronavirus disease 2019 (COVID-19) were found, a novel signature relevant to immune cells has not been developed. In this work, the “CIBERSORT” algorithm was used to assess the fraction of immune infiltrating cells in GSE152641 and GSE171110. Key modules associated with important immune cells were selected by the “WGCNA” package. The “GO” enrichment analysis was used to reveal the biological function associated with COVID-19. The “Boruta” algorithm was used to screen candidate genes, and the “LASSO” algorithm was used for collinearity reduction. A novel gene signature was developed based on multivariate logistic regression analysis. Subsequently, M0 macrophages (PR(AUC) = 0.948 in GSE152641 and PR(AUC) = 0.981 in GSE171110) and neutrophils (PR(AUC) = 0.892 in GSE152641 and PR(AUC) = 0.960 in GSE171110) were considered as important immune cells. Forty-three intersected genes from two modules were selected, which mainly participated in some immune-related activities. Finally, a three-gene signature comprising CLEC4D, DUSP13, and UNC5A that can accurately distinguish COVID-19 patients and healthy controls in three datasets was constructed. The ROC(AUC) was 0.974 in the training set, 0.946 in the internal test set, and 0.709 in the external test set. In conclusion, we constructed a three-gene signature to identify COVID-19, and CLEC4D, DUSP13, and UNC5A may be potential biomarkers for COVID-19 patients. MDPI 2022-09-08 /pmc/articles/PMC9498787/ /pubmed/36140771 http://dx.doi.org/10.3390/genes13091602 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lai, Guichuan Liu, Hui Deng, Jielian Li, Kangjie Xie, Biao A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning |
title | A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning |
title_full | A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning |
title_fullStr | A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning |
title_full_unstemmed | A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning |
title_short | A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning |
title_sort | novel 3-gene signature for identifying covid-19 patients based on bioinformatics and machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9498787/ https://www.ncbi.nlm.nih.gov/pubmed/36140771 http://dx.doi.org/10.3390/genes13091602 |
work_keys_str_mv | AT laiguichuan anovel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning AT liuhui anovel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning AT dengjielian anovel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning AT likangjie anovel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning AT xiebiao anovel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning AT laiguichuan novel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning AT liuhui novel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning AT dengjielian novel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning AT likangjie novel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning AT xiebiao novel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning |