Cargando…

Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies

Coronavirus disease 2019 (COVID-19) arising from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has resulted in a global pandemic since its first report in December 2019. So far, SARS-CoV-2 nucleic acid detection has been deemed as the golden standard of COVID-19 diagnosis. However, th...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Shuai, Qu, Renliang, Wang, Pengyan, Wang, Shenghan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8485143/
https://www.ncbi.nlm.nih.gov/pubmed/34603483
http://dx.doi.org/10.1155/2021/2203636
_version_ 1784577476758863872
author Zhang, Shuai
Qu, Renliang
Wang, Pengyan
Wang, Shenghan
author_facet Zhang, Shuai
Qu, Renliang
Wang, Pengyan
Wang, Shenghan
author_sort Zhang, Shuai
collection PubMed
description Coronavirus disease 2019 (COVID-19) arising from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has resulted in a global pandemic since its first report in December 2019. So far, SARS-CoV-2 nucleic acid detection has been deemed as the golden standard of COVID-19 diagnosis. However, this detection method often leads to false negatives, thus triggering missed COVID-19 diagnosis. Therefore, it is urgent to find new biomarkers to increase the accuracy of COVID-19 diagnosis. To explore new biomarkers of COVID-19 in this study, expression profiles were firstly accessed from the GEO database. On this basis, 500 feature genes were screened by the minimum-redundancy maximum-relevancy (mRMR) feature selection method. Afterwards, the incremental feature selection (IFS) method was used to choose a classifier with the best performance from different feature gene-based support vector machine (SVM) classifiers. The corresponding 66 feature genes were set as the optimal feature genes. Lastly, the optimal feature genes were subjected to GO functional enrichment analysis, principal component analysis (PCA), and protein-protein interaction (PPI) network analysis. All in all, it was posited that the 66 feature genes could effectively classify positive and negative COVID-19 and work as new biomarkers of the disease.
format Online
Article
Text
id pubmed-8485143
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-84851432021-10-02 Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies Zhang, Shuai Qu, Renliang Wang, Pengyan Wang, Shenghan Comput Math Methods Med Research Article Coronavirus disease 2019 (COVID-19) arising from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has resulted in a global pandemic since its first report in December 2019. So far, SARS-CoV-2 nucleic acid detection has been deemed as the golden standard of COVID-19 diagnosis. However, this detection method often leads to false negatives, thus triggering missed COVID-19 diagnosis. Therefore, it is urgent to find new biomarkers to increase the accuracy of COVID-19 diagnosis. To explore new biomarkers of COVID-19 in this study, expression profiles were firstly accessed from the GEO database. On this basis, 500 feature genes were screened by the minimum-redundancy maximum-relevancy (mRMR) feature selection method. Afterwards, the incremental feature selection (IFS) method was used to choose a classifier with the best performance from different feature gene-based support vector machine (SVM) classifiers. The corresponding 66 feature genes were set as the optimal feature genes. Lastly, the optimal feature genes were subjected to GO functional enrichment analysis, principal component analysis (PCA), and protein-protein interaction (PPI) network analysis. All in all, it was posited that the 66 feature genes could effectively classify positive and negative COVID-19 and work as new biomarkers of the disease. Hindawi 2021-09-27 /pmc/articles/PMC8485143/ /pubmed/34603483 http://dx.doi.org/10.1155/2021/2203636 Text en Copyright © 2021 Shuai Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Shuai
Qu, Renliang
Wang, Pengyan
Wang, Shenghan
Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
title Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
title_full Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
title_fullStr Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
title_full_unstemmed Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
title_short Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
title_sort identification of novel covid-19 biomarkers by multiple feature selection strategies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8485143/
https://www.ncbi.nlm.nih.gov/pubmed/34603483
http://dx.doi.org/10.1155/2021/2203636
work_keys_str_mv AT zhangshuai identificationofnovelcovid19biomarkersbymultiplefeatureselectionstrategies
AT qurenliang identificationofnovelcovid19biomarkersbymultiplefeatureselectionstrategies
AT wangpengyan identificationofnovelcovid19biomarkersbymultiplefeatureselectionstrategies
AT wangshenghan identificationofnovelcovid19biomarkersbymultiplefeatureselectionstrategies