Cargando…

Identifying novel host-based diagnostic biomarker panels for COVID-19: a whole-blood/nasopharyngeal transcriptome meta-analysis

BACKGROUND: Regardless of improvements in controlling the COVID-19 pandemic, the lack of comprehensive insight into SARS-COV-2 pathogenesis is still a sophisticated challenge. In order to deal with this challenge, we utilized advanced bioinformatics and machine learning algorithms to reveal more cha...

Descripción completa

Detalles Bibliográficos
Autores principales: Maleknia, Samaneh, Tavassolifar, Mohammad Javad, Mottaghitalab, Faezeh, Zali, Mohammad Reza, Meyfour, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9347150/
https://www.ncbi.nlm.nih.gov/pubmed/35922752
http://dx.doi.org/10.1186/s10020-022-00513-5
_version_ 1784761803109040128
author Maleknia, Samaneh
Tavassolifar, Mohammad Javad
Mottaghitalab, Faezeh
Zali, Mohammad Reza
Meyfour, Anna
author_facet Maleknia, Samaneh
Tavassolifar, Mohammad Javad
Mottaghitalab, Faezeh
Zali, Mohammad Reza
Meyfour, Anna
author_sort Maleknia, Samaneh
collection PubMed
description BACKGROUND: Regardless of improvements in controlling the COVID-19 pandemic, the lack of comprehensive insight into SARS-COV-2 pathogenesis is still a sophisticated challenge. In order to deal with this challenge, we utilized advanced bioinformatics and machine learning algorithms to reveal more characteristics of SARS-COV-2 pathogenesis and introduce novel host response-based diagnostic biomarker panels. METHODS: In the present study, eight published RNA-Seq datasets related to whole-blood (WB) and nasopharyngeal (NP) swab samples of patients with COVID-19, other viral and non-viral acute respiratory illnesses (ARIs), and healthy controls (HCs) were integrated. To define COVID-19 meta-signatures, Gene Ontology and pathway enrichment analyses were applied to compare COVID-19 with other similar diseases. Additionally, CIBERSORTx was executed in WB samples to detect the immune cell landscape. Furthermore, the optimum WB- and NP-based diagnostic biomarkers were identified via all the combinations of 3 to 9 selected features and the 2-phases machine learning (ML) method which implemented k-fold cross validation and independent test set validation. RESULTS: The host gene meta-signatures obtained for SARS-COV-2 infection were different in the WB and NP samples. The gene ontology and enrichment results of the WB dataset represented the enhancement in inflammatory host response, cell cycle, and interferon signature in COVID-19 patients. Furthermore, NP samples of COVID-19 in comparison with HC and non-viral ARIs showed the significant upregulation of genes associated with cytokine production and defense response to the virus. In contrast, these pathways in COVID-19 compared to other viral ARIs were strikingly attenuated. Notably, immune cell proportions of WB samples altered in COVID-19 versus HC. Moreover, the optimum WB- and NP-based diagnostic panels after two phases of ML-based validation included 6 and 8 markers with an accuracy of 97% and 88%, respectively. CONCLUSIONS: Based on the distinct gene expression profiles of WB and NP, our results indicated that SARS-COV-2 function is body-site-specific, although according to the common signature in WB and NP COVID-19 samples versus controls, this virus also induces a global and systematic host response to some extent. We also introduced and validated WB- and NP-based diagnostic biomarkers using ML methods which can be applied as a complementary tool to diagnose the COVID-19 infection from non-COVID cases. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s10020-022-00513-5.
format Online
Article
Text
id pubmed-9347150
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-93471502022-08-04 Identifying novel host-based diagnostic biomarker panels for COVID-19: a whole-blood/nasopharyngeal transcriptome meta-analysis Maleknia, Samaneh Tavassolifar, Mohammad Javad Mottaghitalab, Faezeh Zali, Mohammad Reza Meyfour, Anna Mol Med Research Article BACKGROUND: Regardless of improvements in controlling the COVID-19 pandemic, the lack of comprehensive insight into SARS-COV-2 pathogenesis is still a sophisticated challenge. In order to deal with this challenge, we utilized advanced bioinformatics and machine learning algorithms to reveal more characteristics of SARS-COV-2 pathogenesis and introduce novel host response-based diagnostic biomarker panels. METHODS: In the present study, eight published RNA-Seq datasets related to whole-blood (WB) and nasopharyngeal (NP) swab samples of patients with COVID-19, other viral and non-viral acute respiratory illnesses (ARIs), and healthy controls (HCs) were integrated. To define COVID-19 meta-signatures, Gene Ontology and pathway enrichment analyses were applied to compare COVID-19 with other similar diseases. Additionally, CIBERSORTx was executed in WB samples to detect the immune cell landscape. Furthermore, the optimum WB- and NP-based diagnostic biomarkers were identified via all the combinations of 3 to 9 selected features and the 2-phases machine learning (ML) method which implemented k-fold cross validation and independent test set validation. RESULTS: The host gene meta-signatures obtained for SARS-COV-2 infection were different in the WB and NP samples. The gene ontology and enrichment results of the WB dataset represented the enhancement in inflammatory host response, cell cycle, and interferon signature in COVID-19 patients. Furthermore, NP samples of COVID-19 in comparison with HC and non-viral ARIs showed the significant upregulation of genes associated with cytokine production and defense response to the virus. In contrast, these pathways in COVID-19 compared to other viral ARIs were strikingly attenuated. Notably, immune cell proportions of WB samples altered in COVID-19 versus HC. Moreover, the optimum WB- and NP-based diagnostic panels after two phases of ML-based validation included 6 and 8 markers with an accuracy of 97% and 88%, respectively. CONCLUSIONS: Based on the distinct gene expression profiles of WB and NP, our results indicated that SARS-COV-2 function is body-site-specific, although according to the common signature in WB and NP COVID-19 samples versus controls, this virus also induces a global and systematic host response to some extent. We also introduced and validated WB- and NP-based diagnostic biomarkers using ML methods which can be applied as a complementary tool to diagnose the COVID-19 infection from non-COVID cases. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s10020-022-00513-5. BioMed Central 2022-08-03 /pmc/articles/PMC9347150/ /pubmed/35922752 http://dx.doi.org/10.1186/s10020-022-00513-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Maleknia, Samaneh
Tavassolifar, Mohammad Javad
Mottaghitalab, Faezeh
Zali, Mohammad Reza
Meyfour, Anna
Identifying novel host-based diagnostic biomarker panels for COVID-19: a whole-blood/nasopharyngeal transcriptome meta-analysis
title Identifying novel host-based diagnostic biomarker panels for COVID-19: a whole-blood/nasopharyngeal transcriptome meta-analysis
title_full Identifying novel host-based diagnostic biomarker panels for COVID-19: a whole-blood/nasopharyngeal transcriptome meta-analysis
title_fullStr Identifying novel host-based diagnostic biomarker panels for COVID-19: a whole-blood/nasopharyngeal transcriptome meta-analysis
title_full_unstemmed Identifying novel host-based diagnostic biomarker panels for COVID-19: a whole-blood/nasopharyngeal transcriptome meta-analysis
title_short Identifying novel host-based diagnostic biomarker panels for COVID-19: a whole-blood/nasopharyngeal transcriptome meta-analysis
title_sort identifying novel host-based diagnostic biomarker panels for covid-19: a whole-blood/nasopharyngeal transcriptome meta-analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9347150/
https://www.ncbi.nlm.nih.gov/pubmed/35922752
http://dx.doi.org/10.1186/s10020-022-00513-5
work_keys_str_mv AT malekniasamaneh identifyingnovelhostbaseddiagnosticbiomarkerpanelsforcovid19awholebloodnasopharyngealtranscriptomemetaanalysis
AT tavassolifarmohammadjavad identifyingnovelhostbaseddiagnosticbiomarkerpanelsforcovid19awholebloodnasopharyngealtranscriptomemetaanalysis
AT mottaghitalabfaezeh identifyingnovelhostbaseddiagnosticbiomarkerpanelsforcovid19awholebloodnasopharyngealtranscriptomemetaanalysis
AT zalimohammadreza identifyingnovelhostbaseddiagnosticbiomarkerpanelsforcovid19awholebloodnasopharyngealtranscriptomemetaanalysis
AT meyfouranna identifyingnovelhostbaseddiagnosticbiomarkerpanelsforcovid19awholebloodnasopharyngealtranscriptomemetaanalysis