Cargando…

Computational identification of differentially-expressed genes as suggested novel COVID-19 biomarkers: A bioinformatics analysis of expression profiles

COVID-19 was declared a pandemic in March 2020, and since then, it has not stopped spreading like wildfire in almost every corner of the world, despite the many efforts made to stem its spread. SARS-CoV-2 has one of the biggest genomes among RNA viruses and presents unique characteristics that diffe...

Descripción completa

Detalles Bibliográficos
Autores principales: Di Salvatore, Valentina, Crispino, Elena, Maleki, Avisa, Nicotra, Giulia, Russo, Giulia, Pappalardo, Francesco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10259169/
https://www.ncbi.nlm.nih.gov/pubmed/37347079
http://dx.doi.org/10.1016/j.csbj.2023.06.007
_version_ 1785057611617402880
author Di Salvatore, Valentina
Crispino, Elena
Maleki, Avisa
Nicotra, Giulia
Russo, Giulia
Pappalardo, Francesco
author_facet Di Salvatore, Valentina
Crispino, Elena
Maleki, Avisa
Nicotra, Giulia
Russo, Giulia
Pappalardo, Francesco
author_sort Di Salvatore, Valentina
collection PubMed
description COVID-19 was declared a pandemic in March 2020, and since then, it has not stopped spreading like wildfire in almost every corner of the world, despite the many efforts made to stem its spread. SARS-CoV-2 has one of the biggest genomes among RNA viruses and presents unique characteristics that differentiate it from other coronaviruses, making it even more challenging to find a cure or vaccine that is efficient enough. This work aims, using RNA sequencing (RNA-Seq) data, to evaluate whether the expression of specific human genes in the host can vary in different grades of disease severity and to determine the molecular origins of the differences in response to SARS-CoV-2 infection in different patients. In addition to quantifying gene expression, data coming from RNA-Seq allow for the discovery of new transcripts, the identification of alternative splicing events, the detection of allele-specific expression, and the detection of post-transcriptional alterations. For this reason, we performed differential expression analysis on different expression profiles of COVID-19 patients, using RNA-Seq data coming from NCBI public repository, and we obtained the lists of all differentially expressed genes (DEGs) emerging from 7 experimental conditions. We performed a Gene Set Enrichment Analysis (GSEA) on these genes to find possible correlations between DEGs and known disease phenotypes. We mainly focused on DEGs coming out from the analysis of the contrasts involving severe conditions to infer any possible relation between a worsening of the clinical picture and an over-representation of specific genes. Based on the obtained results, this study indicates a small group of genes that result up-regulated in the severe form of the disease. EXOSC5, MESD, REXO2, and TRMT2A genes are not differentially expressed or not present in the other conditions, being for that reason, good biomarkers candidates for the severe form of COVID-19 disease. The use of specific over-expressed genes, whether up-regulated or down-regulated, which have an individual role in each different condition of COVID-19 as a biomarker, can assist in early diagnosis.
format Online
Article
Text
id pubmed-10259169
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-102591692023-06-12 Computational identification of differentially-expressed genes as suggested novel COVID-19 biomarkers: A bioinformatics analysis of expression profiles Di Salvatore, Valentina Crispino, Elena Maleki, Avisa Nicotra, Giulia Russo, Giulia Pappalardo, Francesco Comput Struct Biotechnol J Research Article COVID-19 was declared a pandemic in March 2020, and since then, it has not stopped spreading like wildfire in almost every corner of the world, despite the many efforts made to stem its spread. SARS-CoV-2 has one of the biggest genomes among RNA viruses and presents unique characteristics that differentiate it from other coronaviruses, making it even more challenging to find a cure or vaccine that is efficient enough. This work aims, using RNA sequencing (RNA-Seq) data, to evaluate whether the expression of specific human genes in the host can vary in different grades of disease severity and to determine the molecular origins of the differences in response to SARS-CoV-2 infection in different patients. In addition to quantifying gene expression, data coming from RNA-Seq allow for the discovery of new transcripts, the identification of alternative splicing events, the detection of allele-specific expression, and the detection of post-transcriptional alterations. For this reason, we performed differential expression analysis on different expression profiles of COVID-19 patients, using RNA-Seq data coming from NCBI public repository, and we obtained the lists of all differentially expressed genes (DEGs) emerging from 7 experimental conditions. We performed a Gene Set Enrichment Analysis (GSEA) on these genes to find possible correlations between DEGs and known disease phenotypes. We mainly focused on DEGs coming out from the analysis of the contrasts involving severe conditions to infer any possible relation between a worsening of the clinical picture and an over-representation of specific genes. Based on the obtained results, this study indicates a small group of genes that result up-regulated in the severe form of the disease. EXOSC5, MESD, REXO2, and TRMT2A genes are not differentially expressed or not present in the other conditions, being for that reason, good biomarkers candidates for the severe form of COVID-19 disease. The use of specific over-expressed genes, whether up-regulated or down-regulated, which have an individual role in each different condition of COVID-19 as a biomarker, can assist in early diagnosis. Research Network of Computational and Structural Biotechnology 2023-06-12 /pmc/articles/PMC10259169/ /pubmed/37347079 http://dx.doi.org/10.1016/j.csbj.2023.06.007 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Di Salvatore, Valentina
Crispino, Elena
Maleki, Avisa
Nicotra, Giulia
Russo, Giulia
Pappalardo, Francesco
Computational identification of differentially-expressed genes as suggested novel COVID-19 biomarkers: A bioinformatics analysis of expression profiles
title Computational identification of differentially-expressed genes as suggested novel COVID-19 biomarkers: A bioinformatics analysis of expression profiles
title_full Computational identification of differentially-expressed genes as suggested novel COVID-19 biomarkers: A bioinformatics analysis of expression profiles
title_fullStr Computational identification of differentially-expressed genes as suggested novel COVID-19 biomarkers: A bioinformatics analysis of expression profiles
title_full_unstemmed Computational identification of differentially-expressed genes as suggested novel COVID-19 biomarkers: A bioinformatics analysis of expression profiles
title_short Computational identification of differentially-expressed genes as suggested novel COVID-19 biomarkers: A bioinformatics analysis of expression profiles
title_sort computational identification of differentially-expressed genes as suggested novel covid-19 biomarkers: a bioinformatics analysis of expression profiles
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10259169/
https://www.ncbi.nlm.nih.gov/pubmed/37347079
http://dx.doi.org/10.1016/j.csbj.2023.06.007
work_keys_str_mv AT disalvatorevalentina computationalidentificationofdifferentiallyexpressedgenesassuggestednovelcovid19biomarkersabioinformaticsanalysisofexpressionprofiles
AT crispinoelena computationalidentificationofdifferentiallyexpressedgenesassuggestednovelcovid19biomarkersabioinformaticsanalysisofexpressionprofiles
AT malekiavisa computationalidentificationofdifferentiallyexpressedgenesassuggestednovelcovid19biomarkersabioinformaticsanalysisofexpressionprofiles
AT nicotragiulia computationalidentificationofdifferentiallyexpressedgenesassuggestednovelcovid19biomarkersabioinformaticsanalysisofexpressionprofiles
AT russogiulia computationalidentificationofdifferentiallyexpressedgenesassuggestednovelcovid19biomarkersabioinformaticsanalysisofexpressionprofiles
AT pappalardofrancesco computationalidentificationofdifferentiallyexpressedgenesassuggestednovelcovid19biomarkersabioinformaticsanalysisofexpressionprofiles