Cargando…

A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning

Although many biomarkers associated with coronavirus disease 2019 (COVID-19) were found, a novel signature relevant to immune cells has not been developed. In this work, the “CIBERSORT” algorithm was used to assess the fraction of immune infiltrating cells in GSE152641 and GSE171110. Key modules ass...

Descripción completa

Detalles Bibliográficos
Autores principales: Lai, Guichuan, Liu, Hui, Deng, Jielian, Li, Kangjie, Xie, Biao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9498787/
https://www.ncbi.nlm.nih.gov/pubmed/36140771
http://dx.doi.org/10.3390/genes13091602
_version_ 1784794846890819584
author Lai, Guichuan
Liu, Hui
Deng, Jielian
Li, Kangjie
Xie, Biao
author_facet Lai, Guichuan
Liu, Hui
Deng, Jielian
Li, Kangjie
Xie, Biao
author_sort Lai, Guichuan
collection PubMed
description Although many biomarkers associated with coronavirus disease 2019 (COVID-19) were found, a novel signature relevant to immune cells has not been developed. In this work, the “CIBERSORT” algorithm was used to assess the fraction of immune infiltrating cells in GSE152641 and GSE171110. Key modules associated with important immune cells were selected by the “WGCNA” package. The “GO” enrichment analysis was used to reveal the biological function associated with COVID-19. The “Boruta” algorithm was used to screen candidate genes, and the “LASSO” algorithm was used for collinearity reduction. A novel gene signature was developed based on multivariate logistic regression analysis. Subsequently, M0 macrophages (PR(AUC) = 0.948 in GSE152641 and PR(AUC) = 0.981 in GSE171110) and neutrophils (PR(AUC) = 0.892 in GSE152641 and PR(AUC) = 0.960 in GSE171110) were considered as important immune cells. Forty-three intersected genes from two modules were selected, which mainly participated in some immune-related activities. Finally, a three-gene signature comprising CLEC4D, DUSP13, and UNC5A that can accurately distinguish COVID-19 patients and healthy controls in three datasets was constructed. The ROC(AUC) was 0.974 in the training set, 0.946 in the internal test set, and 0.709 in the external test set. In conclusion, we constructed a three-gene signature to identify COVID-19, and CLEC4D, DUSP13, and UNC5A may be potential biomarkers for COVID-19 patients.
format Online
Article
Text
id pubmed-9498787
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94987872022-09-23 A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning Lai, Guichuan Liu, Hui Deng, Jielian Li, Kangjie Xie, Biao Genes (Basel) Article Although many biomarkers associated with coronavirus disease 2019 (COVID-19) were found, a novel signature relevant to immune cells has not been developed. In this work, the “CIBERSORT” algorithm was used to assess the fraction of immune infiltrating cells in GSE152641 and GSE171110. Key modules associated with important immune cells were selected by the “WGCNA” package. The “GO” enrichment analysis was used to reveal the biological function associated with COVID-19. The “Boruta” algorithm was used to screen candidate genes, and the “LASSO” algorithm was used for collinearity reduction. A novel gene signature was developed based on multivariate logistic regression analysis. Subsequently, M0 macrophages (PR(AUC) = 0.948 in GSE152641 and PR(AUC) = 0.981 in GSE171110) and neutrophils (PR(AUC) = 0.892 in GSE152641 and PR(AUC) = 0.960 in GSE171110) were considered as important immune cells. Forty-three intersected genes from two modules were selected, which mainly participated in some immune-related activities. Finally, a three-gene signature comprising CLEC4D, DUSP13, and UNC5A that can accurately distinguish COVID-19 patients and healthy controls in three datasets was constructed. The ROC(AUC) was 0.974 in the training set, 0.946 in the internal test set, and 0.709 in the external test set. In conclusion, we constructed a three-gene signature to identify COVID-19, and CLEC4D, DUSP13, and UNC5A may be potential biomarkers for COVID-19 patients. MDPI 2022-09-08 /pmc/articles/PMC9498787/ /pubmed/36140771 http://dx.doi.org/10.3390/genes13091602 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lai, Guichuan
Liu, Hui
Deng, Jielian
Li, Kangjie
Xie, Biao
A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning
title A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning
title_full A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning
title_fullStr A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning
title_full_unstemmed A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning
title_short A Novel 3-Gene Signature for Identifying COVID-19 Patients Based on Bioinformatics and Machine Learning
title_sort novel 3-gene signature for identifying covid-19 patients based on bioinformatics and machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9498787/
https://www.ncbi.nlm.nih.gov/pubmed/36140771
http://dx.doi.org/10.3390/genes13091602
work_keys_str_mv AT laiguichuan anovel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning
AT liuhui anovel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning
AT dengjielian anovel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning
AT likangjie anovel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning
AT xiebiao anovel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning
AT laiguichuan novel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning
AT liuhui novel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning
AT dengjielian novel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning
AT likangjie novel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning
AT xiebiao novel3genesignatureforidentifyingcovid19patientsbasedonbioinformaticsandmachinelearning