Cargando…

Genome-Wide Co-Expression Distributions as a Metric to Prioritize Genes of Functional Importance

Genome-wide gene expression analysis are routinely used to gain a systems-level understanding of complex processes, including network connectivity. Network connectivity tends to be built on a small subset of extremely high co-expression signals that are deemed significant, but this overlooks the vas...

Descripción completa

Detalles Bibliográficos
Autores principales: Alexandre, Pâmela A., Hudson, Nicholas J., Lehnert, Sigrid A., Fortes, Marina R. S., Naval-Sánchez, Marina, Nguyen, Loan T., Porto-Neto, Laercio R., Reverter, Antonio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7593939/
https://www.ncbi.nlm.nih.gov/pubmed/33092259
http://dx.doi.org/10.3390/genes11101231
_version_ 1783601511770619904
author Alexandre, Pâmela A.
Hudson, Nicholas J.
Lehnert, Sigrid A.
Fortes, Marina R. S.
Naval-Sánchez, Marina
Nguyen, Loan T.
Porto-Neto, Laercio R.
Reverter, Antonio
author_facet Alexandre, Pâmela A.
Hudson, Nicholas J.
Lehnert, Sigrid A.
Fortes, Marina R. S.
Naval-Sánchez, Marina
Nguyen, Loan T.
Porto-Neto, Laercio R.
Reverter, Antonio
author_sort Alexandre, Pâmela A.
collection PubMed
description Genome-wide gene expression analysis are routinely used to gain a systems-level understanding of complex processes, including network connectivity. Network connectivity tends to be built on a small subset of extremely high co-expression signals that are deemed significant, but this overlooks the vast majority of pairwise signals. Here, we developed a computational pipeline to assign to every gene its pair-wise genome-wide co-expression distribution to one of 8 template distributions shapes varying between unimodal, bimodal, skewed, or symmetrical, representing different proportions of positive and negative correlations. We then used a hypergeometric test to determine if specific genes (regulators versus non-regulators) and properties (differentially expressed or not) are associated with a particular distribution shape. We applied our methodology to five publicly available RNA sequencing (RNA-seq) datasets from four organisms in different physiological conditions and tissues. Our results suggest that genes can be assigned consistently to pre-defined distribution shapes, regarding the enrichment of differential expression and regulatory genes, in situations involving contrasting phenotypes, time-series, or physiological baseline data. There is indeed a striking additional biological signal present in the genome-wide distribution of co-expression values which would be overlooked by currently adopted approaches. Our method can be applied to extract further information from transcriptomic data and help uncover the molecular mechanisms involved in the regulation of complex biological process and phenotypes.
format Online
Article
Text
id pubmed-7593939
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75939392020-10-30 Genome-Wide Co-Expression Distributions as a Metric to Prioritize Genes of Functional Importance Alexandre, Pâmela A. Hudson, Nicholas J. Lehnert, Sigrid A. Fortes, Marina R. S. Naval-Sánchez, Marina Nguyen, Loan T. Porto-Neto, Laercio R. Reverter, Antonio Genes (Basel) Article Genome-wide gene expression analysis are routinely used to gain a systems-level understanding of complex processes, including network connectivity. Network connectivity tends to be built on a small subset of extremely high co-expression signals that are deemed significant, but this overlooks the vast majority of pairwise signals. Here, we developed a computational pipeline to assign to every gene its pair-wise genome-wide co-expression distribution to one of 8 template distributions shapes varying between unimodal, bimodal, skewed, or symmetrical, representing different proportions of positive and negative correlations. We then used a hypergeometric test to determine if specific genes (regulators versus non-regulators) and properties (differentially expressed or not) are associated with a particular distribution shape. We applied our methodology to five publicly available RNA sequencing (RNA-seq) datasets from four organisms in different physiological conditions and tissues. Our results suggest that genes can be assigned consistently to pre-defined distribution shapes, regarding the enrichment of differential expression and regulatory genes, in situations involving contrasting phenotypes, time-series, or physiological baseline data. There is indeed a striking additional biological signal present in the genome-wide distribution of co-expression values which would be overlooked by currently adopted approaches. Our method can be applied to extract further information from transcriptomic data and help uncover the molecular mechanisms involved in the regulation of complex biological process and phenotypes. MDPI 2020-10-20 /pmc/articles/PMC7593939/ /pubmed/33092259 http://dx.doi.org/10.3390/genes11101231 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Alexandre, Pâmela A.
Hudson, Nicholas J.
Lehnert, Sigrid A.
Fortes, Marina R. S.
Naval-Sánchez, Marina
Nguyen, Loan T.
Porto-Neto, Laercio R.
Reverter, Antonio
Genome-Wide Co-Expression Distributions as a Metric to Prioritize Genes of Functional Importance
title Genome-Wide Co-Expression Distributions as a Metric to Prioritize Genes of Functional Importance
title_full Genome-Wide Co-Expression Distributions as a Metric to Prioritize Genes of Functional Importance
title_fullStr Genome-Wide Co-Expression Distributions as a Metric to Prioritize Genes of Functional Importance
title_full_unstemmed Genome-Wide Co-Expression Distributions as a Metric to Prioritize Genes of Functional Importance
title_short Genome-Wide Co-Expression Distributions as a Metric to Prioritize Genes of Functional Importance
title_sort genome-wide co-expression distributions as a metric to prioritize genes of functional importance
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7593939/
https://www.ncbi.nlm.nih.gov/pubmed/33092259
http://dx.doi.org/10.3390/genes11101231
work_keys_str_mv AT alexandrepamelaa genomewidecoexpressiondistributionsasametrictoprioritizegenesoffunctionalimportance
AT hudsonnicholasj genomewidecoexpressiondistributionsasametrictoprioritizegenesoffunctionalimportance
AT lehnertsigrida genomewidecoexpressiondistributionsasametrictoprioritizegenesoffunctionalimportance
AT fortesmarinars genomewidecoexpressiondistributionsasametrictoprioritizegenesoffunctionalimportance
AT navalsanchezmarina genomewidecoexpressiondistributionsasametrictoprioritizegenesoffunctionalimportance
AT nguyenloant genomewidecoexpressiondistributionsasametrictoprioritizegenesoffunctionalimportance
AT portonetolaercior genomewidecoexpressiondistributionsasametrictoprioritizegenesoffunctionalimportance
AT reverterantonio genomewidecoexpressiondistributionsasametrictoprioritizegenesoffunctionalimportance